Highlights
Stars
A PyTorch native platform for training generative AI models
Collection of scripts and notebooks for OpenAI's latest GPT OSS models
A native-PyTorch library for large scale M-LLM (text/audio) training with tp/cp/dp.
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!
Get your documents ready for gen AI
[NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"
[CVPR 2025] Magma: A Foundation Model for Multimodal AI Agents
[ICLR 2025 Oral] Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
A natively parallel dataloader for Python, written in Rust. Serving data at GB/s speeds, while covering aspect ratio bucketing, crop and resize for image ML workloads.
LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
aider is AI pair programming in your terminal
Efficient Triton Kernels for LLM Training
FlashInfer: Kernel Library for LLM Serving
A throughput-oriented high-performance serving framework for LLMs
[CVPR'25 highlight] RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness
The AI framework that adds the engineering to prompt engineering (Python/TS/Ruby/Java/C#/Rust/Go compatible)
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
Efficient and general syntactical decoding for Large Language Models
BullMQ - Message Queue and Batch processing for NodeJS and Python based on Redis
The multi-agent framework and runtime. Fast, elegant and performant at scale.