Scalable and extensible reinforcement learning for LM agents.
nlp training agent framework reinforcement-learning system tool mcp rl multi-modal reward reasoning tool-use vision-language chat-template llm agentic agentrl agent-rl agentfly
-
Updated
Oct 18, 2025 - Python