ml
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
A curated list of awesome projects which use Machine Learning to generate synthetic content.
Low-code framework for building custom LLMs, neural networks, and other AI models
😈Awful AI is a curated list to track current scary usages of AI - hoping to raise awareness
The "Python Machine Learning (1st edition)" book code repository and info resource
openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 300+ supported cars.
Automatic License Plate Recognition library
Port of OpenAI's Whisper model in C/C++
Stable diffusion for real-time music generation (web app)
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Pod transcription with OpenAI Whisper and AWS
FauxPilot - an open-source alternative to GitHub Copilot server
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Rust+OpenCL+AVX2 implementation of LLaMA inference code
Faster Whisper transcription with CTranslate2
Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools, with 10x faster training through evolutionary hyperparameter optimization.
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
TextGAN is a PyTorch framework for Generative Adversarial Networks (GANs) based text generation models.
🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org
🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.
[Unmaintained, see README] An ecosystem of Rust libraries for working with large language models