-
Tencent
- Shanghai
-
14:27
(UTC +08:00)
Stars
Expressive Anechoic Recordings of Speech (EARS)
Transformer with Local Modeling by Convolution for Speech Separation and Enhancement
A TTS model capable of generating ultra-realistic dialogue in one pass.
The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", which is accepted by Information Fusion.
Official Jax Implementation of MaskGIT
Real-Time De-noising and De-reverbing with Tiny Recurrent UNet
The personal information dashboard for your terminal
Official repository of Spiking-FullSubNet, the Intel N-DNS Challenge Algorithmic Track Winner.
The official pytorch implemention of the Intespeech 2024 paper "Reshape Dimensions Network for Speaker Recognition"
经济学人(含音频)、纽约客、卫报、连线、大西洋月刊等英语杂志免费下载,支持epub、mobi、pdf格式, 每周更新
Speaker verification evaluation protocols simulating speaker diarisation
BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION
Speech Enhancement Metrics (PESQ, CSIG, CBAK, COVL)
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.
VOICOR: A Residual Iterative Voice Correction Framework for Monaural Speech Enhancement
This repo contains the official PyTorch implementation of "Audio Super Resolution in the Spectral Domain" (ICASSP 2023)
vits2 backbone with multilingual-bert
Voice activity detection (VAD) library, based on WebRTC's VAD engine
A list of publicly available room impulse response datasets and scripts to download them.
wsj0-{2, 3, 4, 5} mix generation scripts, in Python.
Official implementation of "Separate Anything You Describe"