- Irvine
-
21:30
(UTC -07:00) - in/austin362667
Lists (1)
Sort Name ascending (A-Z)
Stars
A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.
Roblox Foundation Model for 3D Intelligence
How to ensure correctness and ship LLM generated kernels in PyTorch
Minimalistic 4D-parallelism distributed training framework for education purpose
Implementation for FP8/INT8 Rollout for RL training without performence drop.
Open-source simulator for autonomous driving research.
Infrastructure for Machine Learning Guided Optimization (MLGO) in LLVM.
Library for reading and processing ML training data.
AMD RAD's multi-GPU Triton-based framework for seamless multi-GPU programming
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!
Checkpoint-engine is a simple middleware to update model weights in LLM inference engines
Optimize Julia Functions With MLIR and XLA for High-Performance Execution on CPU, GPU, TPU and more.
Backward compatible ML compute opset inspired by HLO/MHLO
A machine learning compiler for GPUs, CPUs, and ML accelerators
[ICRA 2022] An opensource framework for cooperative detection. Official implementation for OPV2V.
Gthulhu optimizes cloud-native workloads using the Linux Scheduler Extension for different application scenarios.
Scalable, reliable, distributed storage system optimized for data analytics and object store workloads.
Train speculative decoding models effortlessly and port them smoothly to SGLang serving.
Allow torch tensor memory to be released and resumed later
A parallel programming training mini app simulating weather-like flows
FFmpeg libav tutorial - learn how media works from basic to transmuxing, transcoding and more. Translations: 🇺🇸 🇨🇳 🇰🇷 🇪🇸 🇻🇳 🇧🇷
Fast caching software with a focus on low latency and cpu efficiency.
A Python-embedded modeling language for convex optimization problems.
vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization
Physics-Informed Neural networks for Advanced modeling