Interrelate Training and Clustering for Online Speaker Diarization
Abstract
References
Index Terms
- Interrelate Training and Clustering for Online Speaker Diarization
Recommendations
Online Neural Speaker Diarization With Target Speaker Tracking
This paper proposes an online target speaker voice activity detection (TS-VAD) system for speaker diarization tasks that does not rely on prior knowledge from clustering-based diarization systems to obtain target speaker embeddings. By adapting ...
Graph attention-based deep embedded clustering for speaker diarization
AbstractDeep speaker embedding extraction models have recently served as the cornerstone for modular speaker diarization systems. However, in current modular systems, the extracted speaker embeddings (namely, speaker features) do not effectively leverage ...
Highlights- A graph constructed from speaker embeddings to utilize the local structural information among embeddings.
- Employed Multi-layer graph attention networks as an encoder module to learn latent speaker embeddings.
- Multi-objective ...
Speaker diarization system using MKMFCC parameterization and WLI-fuzzy clustering
Speaker diarization is the process of determining "who speak when?" with appropriate speaker labels with respect to the time regions where they spoke. Accordingly, in the previous work, a model based speaker diarization using the tangential weighted Mel ...
Comments
Please enable JavaScript to view thecomments powered by Disqus.Information & Contributors
Information
Published In

Publisher
IEEE Press
Publication History
Qualifiers
- Research-article
Contributors
Other Metrics
Bibliometrics & Citations
Bibliometrics
Article Metrics
- View Citations1Total Citations
- 9Total Downloads
- Downloads (Last 12 months)9
- Downloads (Last 6 weeks)0
Other Metrics
Citations
View Options
Login options
Check if you have access through your login credentials or your institution to get full access on this article.
Sign in