- Seoul, Republic of Korea
-
07:09
(UTC +09:00) - https://www.linkedin.com/in/kdrkdrkdr
- https://elnino.kr
Highlights
Lists (3)
Sort Name ascending (A-Z)
Stars
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
finetune llm part for spark-tts model
TIGER: Time-frequency Interleaved Gain Extraction and Reconstruction for Efficient Speech Separation
Easy to use stem (e.g. instrumental/vocals) separation from CLI or as a python package, using a variety of amazing pre-trained models (primarily from UVR)
산업기능요원 (현역 / 보충역) 회사 알짜 리스트!
Official code for"DiaMoE-TTS: A Unified IPA-based Dialect TTS Framework with Mixture-of-Experts and Parameter-Efficient Zero-Shot Adaptation"
High-performance safetensors model loader
woct0rdho / triton-windows
Forked from triton-lang/tritonFork of the Triton language and compiler for Windows support and easy installation
VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
✨ Build a machine learning model from a prompt
Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.
Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching
사업자없이 결제 가능한 PG인 PayApp 서비스 FastAPI Demo 프로젝트입니다.
State-of-the-art TTS model under 25MB 😻
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
A web application for chatting with AI characters that anyone can easily set up and deploy.
Voice Activity Detector (VAD) : low-latency, high-performance and lightweight
Collection of leaked system prompts
pyauth / pyotp
Forked from mdp/rotpPython One-Time Password Library
Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning