Stars
State-of-the-art 2D and 3D Face Analysis Project
Production-ready platform for agentic workflow development.
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
Official repository for "MaskControl: Spatio-Temporal Control for Masked Motion Synthesis" ICCV 2025 (Oral)
[ICLR2025] DisPose: Disentangling Pose Guidance for Controllable Human Image Animation
API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition, and speaker verification.
TinyVM is a small, fast, lightweight virtual machine written in pure ANSI C.
A curated list of awesome C++ (or C) frameworks, libraries, resources, and shiny things. Inspired by awesome-... stuff.
Real time interactive streaming digital human
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
🦜🔗 Build context-aware reasoning applications
Robust Speech Recognition via Large-Scale Weak Supervision
WordPress, Git-ified. This repository is just a mirror of the WordPress subversion repository. Please do not send pull requests. Submit pull requests to https://github.com/WordPress/wordpress-devel…
Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
A curated list of awesome research papers, projects, code, dataset, workshops etc. related to virtual try-on.
Master programming by recreating your favorite technologies from scratch.
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Fast Open-Source Search & Clustering engine × for Vectors & Arbitrary Objects × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍
DeepFashion2 Dataset https://arxiv.org/pdf/1901.07973.pdf
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
idiap / coqui-ai-TTS
Forked from coqui-ai/TTS🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…