Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
-
Updated
Nov 13, 2024 - Jupyter Notebook
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (e.g gpt, claude, gemini, llama, qwen, mistral).
🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
A speech-to-text framework and bot for Discord. Take control of your Discord server using speech and voice commands. Can also be useful for hearing impaired and deaf people.
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
Live transcription in Next.js by Deepgram
A React component to make correcting automated transcriptions of audio and video easier and faster. By BBC News Labs. - Work in progress
Synchronized Translation for Videos. Video dubbing
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.
Running speech to text model (whisper.cpp) in Unity3d on your local machine.
Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover, Transcription, Text-to-Speech (Edge-TTS, F5-TTS), and Translation.
Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!
A Conversational Assistant equipped with synthetic voices including J.A.R.V.I.S's. Powered by OpenAI and IBM Watson APIs and a Tacotron model for voice generation.
A collection of resources to make a smart speaker
A speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.
Open STT
Add a description, image, and links to the stt topic page so that developers can more easily learn about it.
To associate your repository with the stt topic, visit your repo's landing page and select "manage topics."