Stars
The data plane for agents. Arch is a models-native proxy server that handles the plumbing work in AI: agent routing & hand off, guardrails, end-to-end logs and traces, unified access to LLMs from O…
Run the latest LLMs and VLMs across GPU, NPU, and CPU with PC (Python/C++) & mobile (Android & iOS) support, running quickly with OpenAI gpt-oss, Granite4, Qwen3VL, Gemma 3n and more.
Web-based tool converts GitHub repository contents into a single formatted text file
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
A high-performance API server that provides OpenAI-compatible endpoints for MLX models. Developed using Python and powered by the FastAPI framework, it provides an efficient, scalable, and user-fri…
Metal GPU implementation of the Qwen3 transformer model on macOS with complete Apple Silicon compute shader acceleration.
A workshop that teaches you how to build your own coding agent. Similar to Roo code, Cline, Amp, Cursor, Windsurf or OpenCode.
A high-throughput and memory-efficient inference and serving engine for LLMs
ControlNet++: All-in-one ControlNet for image generations and editing!
Portable file server with accelerated resumable uploads, dedup, WebDAV, FTP, TFTP, zeroconf, media indexer, thumbnails++ all in one file, no deps
Run Orpheus 3B Locally With LM Studio
Self-host the powerful Chatterbox TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), predefined voices, voice cloning, and large audiobook-scale…
Faster Whisper transcription with CTranslate2
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Whisper realtime streaming for long speech-to-text transcription and translation
No fortress, purely open ground. OpenManus is Coming.
A cert-manager webhook to perform DNS01 challenge through websupport DNS API
A simple screen parsing tool towards pure vision based GUI agent
🤗 smolagents: a barebones library for agents that think in code.
Simple, unified interface to multiple Generative AI providers
Everything about the SmolLM and SmolVLM family of models
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"