default search action
Odyssey 2012: Singapore
- Haizhou Li, Bin Ma, Kong-Aik Lee:
Odyssey 2012: The Speaker and Language Recognition Workshop, Singapore, June 25-28, 2012. ISCA 2012
Plenary Session
- Niko Brümmer:
The role of proper scoring rules in training and evaluating probabilistic speaker and language recognizers. - Li Deng:
Being deep and being dynamic - new-generation models and methodology for advancing speech technology. - Alvin F. Martin:
The NIST speaker recognition evaluations.
Speaker Recognition - Compact Representation
- Patrick Kenny:
A small footprint i-vector extractor. 1-6 - Sandro Cumani, Pietro Laface, Vasileios Vasilakakis:
Memory and computation effective approaches for i - vector extraction. 7-13 - Srikanth R. Madikeri:
A hybrid factor analysis and probabilistic PCA-based system for dictionary learning and encoding for robust speaker recognition. 14-20 - Haris B. C., Rohit Sinha:
On exploring the similarity and fusion of i-vector and sparse representation based speaker verification systems. 21-27
Speaker Recognition - Generative Modeling
- Ahilan Kanagasundaram, Robbie Vogt, David Dean, Sridha Sridharan:
PLDA based speaker recognition on short utterances. 28-33 - Ahilan Kanagasundaram, David Dean, Sridha Sridharan, Robbie Vogt:
PLDA based speaker verification with weighted LDA techniques. 34-38 - Carlos Vaquero:
Dataset shift in PLDA based speaker verification. 39-46 - Jesús Antonio Villalba López, Eduardo Lleida:
Bayesian adaptation of PLDA based speaker recognition to domains with scarce development data. 47-54 - Mitchell McLaren, Miranti Indar Mandasari, David A. van Leeuwen:
Source normalization for language-independent speaker recognition using i-vectors. 55-61
Forensic Speaker Recognition
- Geoffrey Stewart Morrison, Felipe Ochoa, Tharmarajah Thiruvaran:
Database selection for forensic voice comparison. 62-77 - Ewald Enzinger, Cuiling Zhang, Geoffrey Stewart Morrison:
Voice source features for forensic voice comparison - an evaluation of the GLOTTEX software package. 78-85 - Yosef A. Solewicz, Timo Becker, Gaëlle Jardine, Stefan G. Gfrörer:
Comparison of speaker recognition systems on a real forensic benchmark. 86-91
Neural Network for Speaker Recognition
- Sri Garimella, Hynek Hermansky:
Factor analysis of mixture of auto-associative neural networks for speaker verification. 92-97 - Samuel Thomas, Sri Harish Reddy Mallidi, Sriram Ganapathy, Hynek Hermansky:
Adaptation transforms of auto-associative neural networks as features for speaker verification. 98-104 - Sibel Yaman, Jason W. Pelecanos, Ruhi Sarikaya:
Bottleneck features for speaker recognition. 105-108 - Themos Stafylakis, Patrick Kenny, Mohammed Senoussaoui, Pierre Dumouchel:
Preliminary investigation of Boltzmann machine classifiers for speaker recognition. 109-116 - Mohammed Senoussaoui, Najim Dehak, Patrick Kenny, Réda Dehak, Pierre Dumouchel:
First attempt of boltzmann machines for speaker verification. 117-121
Speaker Diarization
- Hagai Aronowitz, Yosef A. Solewicz, Orith Toledo-Ronen:
Online two speaker diarization. 122-129 - Jordi Luque, Javier Hernando:
On the use of agglomerative and spectral clustering in speaker diarization of meetings. 130-137 - Itshak Lapidot, Jean-François Bonastre:
Generalized Viterbi-based models for time-series segmentation applied to speaker diarization. 138-145 - Mickael Rouvier, Sylvain Meignier:
A global optimization framework for speaker diarization. 146-150 - Sashin Kajarekar, Aparna Khare, Matthias Paulik, Neha Agrawal, Panchi Panchapagesan, Ananth Sankar, Satish Gannu:
Cisco's speaker segmentation and recognition system. 151-156
Speaker Recognition - Channel Robustness
- Pierre-Michel Bousquet, Anthony Larcher, Driss Matrouf, Jean-François Bonastre, Oldrich Plchot:
Variance-spectra based normalization for i-vector standard and probabilistic linear discriminant analysis. 157-164 - Wei Rao, Man-Wai Mak:
Utterance partitioning with acoustic vector resampling for i-vector based speaker verification. 165-171 - Sheng Chen, Mingxing Xu, Emlyn Pratt:
Study on the effects of intrinsic variation using i-vectors in text-independent speaker verification. 172-179 - William M. Campbell, Douglas E. Sturim, Bengt J. Borgström, Robert B. Dunn, Alan McCree, Thomas F. Quatieri, Douglas A. Reynolds:
Exploring the impact of advanced front-end processing on NIST speaker recognition microphone tasks. 180-186 - Bengt J. Borgström, Alan McCree:
Linear prediction modulation filtering for speaker recognition of reverberant speech. 187-193
Language Recognition Evaluation
- Luis Javier Rodríguez-Fuentes, Amparo Varona, Mireia Díez, Mikel Peñagarikano, Germán Bordel:
Evaluation of spoken language recognition technology using broadcast speech: performance and challenges. 194-201 - Stephanie M. Strassel, Kevin Walker, Karen Jones, David Graff, Christopher Cieri:
New resources for recognition of confusable linguistic varieties: the LRE11 corpus. 202-208 - Elliot Singer, Pedro A. Torres-Carrasquillo, Douglas A. Reynolds, Alan McCree, Fred Richardson, Najim Dehak, Douglas E. Sturim:
The MITLL NIST LRE 2011 language recognition system. 209-215 - Niko Brümmer, Sandro Cumani, Ondrej Glembek, Martin Karafiát, Pavel Matejka, Jan Pesán, Oldrich Plchot, Mehdi Soufifar, Edward de Villiers, Jan Cernocký:
Description and analysis of the Brno276 system for LRE2011. 216-223 - Gang Liu, Chi Zhang, John H. L. Hansen:
A linguistic data acquisition front-end for language recognition evaluation. 224-228
Features for Speaker Recognition
- Sriram Ganapathy, Samuel Thomas, Hynek Hermansky:
Feature extraction using 2-d autoregressive models for speaker recognition. 229-235 - Cemal Hanilçi, Tomi Kinnunen, Rahim Saeidi, Jouni Pohjalainen, Paavo Alku, Figen Ertas:
Regularization of all-pole models for speaker verification under additive noise. 236-242 - Taufiq Hasan, John H. L. Hansen:
Factor analysis of acoustic features using a mixture of probabilistic principal component analyzers for robust speaker verification. 243-247 - Rahim Saeidi, Antti Hurmalainen, Tuomas Virtanen, David A. van Leeuwen:
Exemplar-based sparse representation and sparse discrimination for noise robust speaker identification. 248-255 - Md. Jahangir Alam, Patrick Kenny, Douglas D. O'Shaughnessy:
On the use of asymmetric-shaped tapers for speaker verification using i-vectors. 256-262
Speaker Recognition Evaluation
- George R. Doddington:
The effect of target/non-target age difference on speaker recognition performance. 263-267 - Ville Hautamäki, Kong-Aik Lee, Anthony Larcher, Tomi Kinnunen, Bin Ma, Haizhou Li:
Variational Bayes logistic regression as regularized fusion for NIST SRE 2010. 268-274 - Craig S. Greenberg, Alvin F. Martin, Mark A. Przybocki:
The 2011 BEST speaker recognition interim assessment. 275-282 - Juliette Kahn, Olivier Galibert, Matthieu Carré, Aude Giraudel, Philippe Joly, Ludovic Quintard:
The REPERE challenge: finding people in a multimodal context. 283-290 - Kevin Walker, Stephanie M. Strassel:
The RATS radio traffic collection system. 291-297
Speaker Recognition - Application
- Andreas Stolcke, Martin Graciarena, Luciana Ferrer:
Effects of audio and ASR quality on cepstral and high-level speaker verification systems. 298-303 - Tomi Kinnunen, Rahim Saeidi, Jussi Leppänen, Jukka Saarinen:
Audio context recognition in variable mobile environments from short segments using speaker and language recognizers. 304-311 - Hagai Aronowitz:
Text dependent speaker verification using a small development set. 312-316 - Luciana Ferrer, Lukás Burget, Oldrich Plchot, Nicolas Scheffer:
A unified approach for audio characterization and its application to speaker recognition. 317-323 - Themos Stafylakis, Vassilis Katsouros, Patrick Kenny, Pierre Dumouchel:
Mean shift algorithm for exponential families with applications to speaker clustering. 324-329
Language Recognition - Feature, Classifier and Fusion
- Oldrich Plchot, Martin Karafiát, Niko Brümmer, Ondrej Glembek, Pavel Matejka, Edward de Villiers, Jan Cernocký:
Speaker vectors from subspace Gaussian mixture model as complementary features for language identification. 330-333 - Zhiyi Li, Wei-Qiang Zhang, Liang He, Jia Liu:
Complementary combination in i-vector level for language recognition. 334-337 - Chang Huai You, Haizhou Li, Eliathamby Ambikairajah, Kong-Aik Lee, Bin Ma:
Bhattacharyya-based GMM-SVM system with adaptive relevance factor for pair language recognition. 338-345 - Mohamed Faouzi BenZeghiba, Jean-Luc Gauvain, Lori Lamel:
Fusing language information from diverse data sources for phonotactic language recognition. 346-352
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.