stt

Here are 421 public repositories matching this topic...

alphacep / vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Updated Nov 13, 2024
Jupyter Notebook

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (e.g gpt, claude, gemini, llama, qwen, mistral).

Updated Nov 24, 2024
Python

pannous / tensorflow-speech-recognition

Sponsor

Star

🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks

deep-learning neural-network tensorflow speech-recognition speech-to-text stt

Updated Jan 17, 2024
Python

inevolin / DiscordEarsBot

Star

A speech-to-text framework and bot for Discord. Take control of your Discord server using speech and voice commands. Can also be useful for hearing impaired and deaf people.

discord discord-bot speech speech-synthesis speech-recognition speech-to-text discord-js stt speech-processing hearing-aids hearing-impaired

Updated Dec 29, 2023
JavaScript

snakers4 / silero-models

Star

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

Updated Oct 18, 2023
Jupyter Notebook

jianchang512 / stt

Star

Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具，输出json、srt字幕、纯文字格式

speech speech-recognition speech-to-text stt

Updated Nov 20, 2024
Python

coqui-ai / STT

Star

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

deep-learning tensorflow voice-recognition speech-recognition automatic-speech-recognition speech-to-text stt asr speech-recognizer speech-recognition-api

Updated Mar 11, 2024
C++

deepgram-starters / nextjs-live-transcription

Star

Live transcription in Next.js by Deepgram

websocket nextjs realtime live speech-to-text transcription stt deepgram

Updated Sep 18, 2024
TypeScript

bbc / react-transcript-editor

Star

A React component to make correcting automated transcriptions of audio and video easier and faster. By BBC News Labs. - Work in progress

react transcript kaldi transcription stt bbc-news-labs news-labs transcript-editor textav

Updated Feb 12, 2024
JavaScript

R3gm / SoniTranslate

Star

Synchronized Translation for Videos. Video dubbing

text-to-speech translation tts speech-to-text stt audio-processing asr document-translator dubbing diarization automatic-dubbing subtitle-to-speech translate-audio translate-video video-dubbing

Updated Oct 23, 2024
Python

coqui-ai / open-speech-corpora

Star

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

text-to-speech tts speech-synthesis voice-recognition speech-recognition speech-to-text stt speech-processing voice-activity-detection speech-separation speech-emotion-recognition voice-cloning

Updated Jun 6, 2024

lenML / Speech-AI-Forge

Star

🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.

Updated Nov 11, 2024
Python

Macoron / whisper.unity

Star

Running speech to text model (whisper.cpp) in Unity3d on your local machine.

unity3d speech-recognition openai speech-to-text stt whisper asr

Updated Nov 2, 2024
C#

abus-aikorea / voice-pro

Star

Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover, Transcription, Text-to-Speech (Edge-TTS, F5-TTS), and Translation.

text-to-speech translator web-ui tts speech-synthesis subtitles speech-recognition translate webui speech-to-text transcription gradio stt whisper voice-cloning demucs yt-dlp faster-whisper uvr5

Updated Nov 22, 2024
Python

deepgram-devs / deepgram-ai-agent-demo

Star

Deepgram Conversational AI demo

react nextjs tts stt asr deepgram vercel

Updated Nov 20, 2024
TypeScript

pluja / whishper

Star

Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!

golang ui web ai subtitles webapp speech-recognition speech-to-text transcription stt whisper audio-to-text sveltekit web-whisper

Updated Sep 17, 2024
Svelte

gia-guar / JARVIS-ChatGPT

Star

A Conversational Assistant equipped with synthetic voices including J.A.R.V.I.S's. Powered by OpenAI and IBM Watson APIs and a Tacotron model for voice generation.

python ai pytorch tts speech-recognition openai stt ibm-watson tacotron jarvis-ai chatgpt chatgpt-api chat-gpt-3 elevenlabs

Updated Sep 7, 2023
Python

voice-engine / make-a-smart-speaker

Star

A collection of resources to make a smart speaker

nlu tts stt voice-assistant beamforming kws aec

Updated Dec 20, 2019

inevolin / DiscordSpeechBot

Star

A speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.

music music-player discord discord-bot speech speech-recognition speech-to-text discord-js stt speech-processing

Updated Oct 7, 2021
JavaScript

snakers4 / open_stt

Star

Open STT

dataset russian automatic-speech-recognition speech-to-text stt asr

Updated Mar 11, 2022
Python

Improve this page

Add a description, image, and links to the stt topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the stt topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

stt

Here are 421 public repositories matching this topic...

alphacep / vosk-api

khoj-ai / khoj

pannous / tensorflow-speech-recognition

inevolin / DiscordEarsBot

snakers4 / silero-models

jianchang512 / stt

coqui-ai / STT

deepgram-starters / nextjs-live-transcription

bbc / react-transcript-editor

R3gm / SoniTranslate

coqui-ai / open-speech-corpora

lenML / Speech-AI-Forge

Macoron / whisper.unity

abus-aikorea / voice-pro

deepgram-devs / deepgram-ai-agent-demo

pluja / whishper

gia-guar / JARVIS-ChatGPT

voice-engine / make-a-smart-speaker

inevolin / DiscordSpeechBot

snakers4 / open_stt

Improve this page

Add this topic to your repo