Stars
北京科技大学计算机组成原理课程设计
The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.
Easy Data Preparation with latest LLMs-based Operators and Pipelines.
⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)
An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
Train your Agent model via our easy and efficient framework
完全免费, 自动获取新账号,一键重置新额度, 解决机器码问题, 自动满额度
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!