- Shanghai, China
-
03:41
(UTC +08:00)
Highlights
-
-
DeepSeekV3 Public
Simple and efficient implementation of 671B DeepSeek V3 that trainable with FSDP+EP and minimal requirement of 256x A100/H100, targeted for HuggingFace ecosystem
-
openevolve Public
Forked from codelion/openevolveOpen-source implementation of AlphaEvolve
Python Apache License 2.0 UpdatedSep 22, 2025 -
torch-triton-pair Public
Pair of PyTorch and Triton code from open-sourced kernel libraries
Python UpdatedSep 19, 2025 -
dynamo Public
Forked from ai-dynamo/dynamoA Datacenter Scale Distributed Inference Serving Framework
Rust Apache License 2.0 UpdatedSep 17, 2025 -
-
dlBLAS Public
Forked from DeepLink-org/DLBlasdlBLAS: clean and efficient kernels
Python BSD 3-Clause "New" or "Revised" License UpdatedSep 8, 2025 -
NVSHMEM Public
Forked from NVIDIA/nvshmemNVIDIA NVSHMEM is a parallel programming interface for NVIDIA GPUs based on OpenSHMEM. NVSHMEM can significantly reduce multi-process communication and coordination overheads by allowing programmer…
-
MiroFlow Public
Forked from MiroMindAI/MiroFlowBuild, manage, and scale your AI agents with ease.
Python Apache License 2.0 UpdatedSep 4, 2025 -
MiroThinker Public
Forked from MiroMindAI/MiroThinkerMiroThinker is open-source agentic models trained for deep research and complex tool use scenarios.
Python Apache License 2.0 UpdatedSep 3, 2025 -
varlen_mamba Public
Forked from state-spaces/mambaMamba SSM architecture that supports training on variable-length sequences
-
MiroRL Public
Forked from MiroMindAI/MiroRLMiroRL is an MCP-first reinforcement learning framework for deep research agent.
Python Apache License 2.0 UpdatedAug 27, 2025 -
MiroTrain Public
Forked from MiroMindAI/MiroTrainMiroTrain is an efficient and algorithm-first framework for post-training large agentic models.
Python Apache License 2.0 UpdatedAug 27, 2025 -
mixture_of_recursions Public
Forked from raymin0223/mixture_of_recursionsMixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation
Python Apache License 2.0 UpdatedJul 28, 2025 -
InternEvo Public
Forked from InternLM/InternEvoInternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.
Python Apache License 2.0 UpdatedJul 18, 2025 -
HolisticTraceAnalysis Public
Forked from facebookresearch/HolisticTraceAnalysisA library to analyze PyTorch traces.
Python MIT License UpdatedJul 14, 2025 -
quack Public
Forked from Dao-AILab/quackA Quirky Assortment of CuTe Kernels
Python Apache License 2.0 UpdatedJul 10, 2025 -
FlagGems Public
Forked from FlagOpen/FlagGemsFlagGems is an operator library for large language models implemented in the Triton Language.
Python Apache License 2.0 UpdatedJul 8, 2025 -
Liger-Kernel Public
Forked from linkedin/Liger-KernelEfficient Triton Kernels for LLM Training
Python BSD 2-Clause "Simplified" License UpdatedJul 6, 2025 -
real_KernelBench Public
Forked from ScalingIntelligence/KernelBenchKernelBench: Can LLMs Write GPU Kernels? - Benchmark with Torch -> CUDA problems
Python Other UpdatedJul 6, 2025 -
CZ_pytorch Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
Python Other UpdatedJul 3, 2025 -
CZ_cpython Public
Forked from python/cpythonThe Python programming language
Python Other UpdatedJul 3, 2025 -
CZ_cutlass Public
Forked from NVIDIA/cutlassCUDA Templates for Linear Algebra Subroutines
C++ Other UpdatedJun 27, 2025 -
flashinfer Public
Forked from flashinfer-ai/flashinferFlashInfer: Kernel Library for LLM Serving
Cuda Apache License 2.0 UpdatedJun 25, 2025 -
cz_verl Public
Forked from volcengine/verlveRL: Volcano Engine Reinforcement Learning for LLM
Python Apache License 2.0 UpdatedJun 19, 2025 -
DeepSpeed Public
Forked from deepspeedai/DeepSpeedDeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Python Apache License 2.0 UpdatedJun 17, 2025 -
caesar Public
Forked from ScalingIntelligence/caesarThroughput-oriented multi-turn inference engine for KernelBench [ICML '25]
Python UpdatedMay 27, 2025 -
viztracer Public
Forked from gaogaotiantian/viztracerA debugging and profiling tool that can trace and visualize python code execution
Python Apache License 2.0 UpdatedMay 25, 2025 -
torchtitan Public
Forked from pytorch/torchtitanA PyTorch native platform for training generative AI models
Python BSD 3-Clause "New" or "Revised" License UpdatedMay 20, 2025 -
ao Public
Forked from pytorch/aoPyTorch native quantization and sparsity for training and inference
Python BSD 3-Clause "New" or "Revised" License UpdatedMay 17, 2025