default search action
Speech Communication, Volume 52
Volume 52, Number 1, January 2010
- Xugang Lu, Shigeki Matsuda, Masashi Unoki, Satoshi Nakamura:
Temporal contrast normalization and edge-preserved smoothing of temporal modulation structures of speech for robust speech recognition. 1-11 - Tomi Kinnunen, Haizhou Li:
An overview of text-independent speaker recognition: From features to supervectors. 12-40 - Kentaro Ishizuka, Tomohiro Nakatani, Masakiyo Fujimoto, Noboru Miyazaki:
Noise robust voice activity detection based on periodic to aperiodic component ratio. 41-60 - Teruhisa Misu, Tatsuya Kawahara:
Bayes risk-based dialogue management for document retrieval system with speech interface. 61-71 - Soundararajan Srinivasan, DeLiang L. Wang:
Robust speech recognition by integrating speech separation and hypothesis testing. 72-81
Volume 52, Number 2, February 2010
- Tingyao Wu, Jacques Duchateau, Jean-Pierre Martens, Dirk Van Compernolle:
Feature subset selection for improved native accent identification. 83-98 - Elizabeth K. Hanson, David R. Beukelman, Jana Kahl Heidemann, Erin Shutts-Johnson:
The impact of alphabet supplementation and word prediction on sentence intelligiblity of electronically distorted speech. 99-105 - Yen-Liang Shue, Stefanie Shattuck-Hufnagel, Markus Iseli, Sun-Ah Jun, Nanette Veilleux, Abeer Alwan:
On the acoustic correlates of high and low nuclear pitch accents in American English. 106-122 - Jesús Vicente-Peña, Fernando Díaz-de-María, W. Bastiaan Kleijn:
The synergy between bounded-distance HMM and spectral subtraction for robust speech recognition. 123-133 - John H. L. Hansen, Xianxian Zhang:
Analysis of CFA-BF: Novel combined fixed/adaptive beamforming for robust speech recognition in real car environments. 134-149 - Andrew Beng Jin Teoh, Lee-Ying Chong:
Secure speech template protection in speaker verification system. 150-163 - Michael Pucher, Dietmar Schabus, Junichi Yamagishi, Friedrich Neubarth, Volker Strom:
Modeling and interpolation of Austrian German and Viennese dialect in HMM-based speech synthesis. 164-179
Volume 52, Number 3, March 2010
- Sharon Goldwater, Daniel Jurafsky, Christopher D. Manning:
Which words are hard to recognize? Prosodic, lexical, and disfluency factors that increase speech recognition error rates. 181-200 - Francisco Torreira, Martine Adda-Decker, Mirjam Ernestus:
The Nijmegen Corpus of Casual French. 201-212 - Fabio Valente:
Multi-stream speech recognition based on Dempster-Shafer combination rule. 213-222 - Jen-Tzung Chien, Chuang-Hua Chueh:
Joint acoustic and language modeling for speech recognition. 223-235 - Jáchym Kolár, Yang Liu, Elizabeth Shriberg:
Speaker adaptation of language and prosodic models for automatic dialog act segmentation of speech. 236-245 - Véronique Boulenger, Michel Hoen, Emmanuel Ferragne, François Pellegrino, Fanny Meunier:
Real-time lexical competitions during speech-in-speech comprehension. 246-253 - Juan Pablo Arias, Néstor Becerra Yoma, Hiram Vivanco:
Automatic intonation assessment for computer aided language learning. 254-267
Volume 52, Number 4, April 2010
- Bruce Denby, Tanja Schultz, Kiyoshi Honda:
Guest Editorial. 269 - Bruce Denby, Tanja Schultz, Kiyoshi Honda, Thomas Hueber, J. M. Gilbert, Jonathan S. Brumberg:
Silent speech interfaces. 270-287 - Thomas Hueber, Elie-Laurent Benaroya, Gérard Chollet, Bruce Denby, Gérard Dreyfus, Maureen Stone:
Development of a silent speech interface driven by ultrasound and optical images of the tongue and lips. 288-300 - Tatsuya Hirahara, Makoto Otani, Shota Shimizu, Tomoki Toda, Keigo Nakamura, Yoshitaka Nakajima, Kiyohiro Shikano:
Silent-speech enhancement using body-conducted vocal-tract resonance signals. 301-313 - Viet-Anh Tran, Gérard Bailly, Hélène Loevenbruck, Tomoki Toda:
Improvement to a NAM-captured whisper-to-speech system. 314-326 - Sanjay A. Patil, John H. L. Hansen:
The physiological microphone (PMIC): A competitive alternative for speaker assessment in stress detection and speaker verification. 327-340 - Tanja Schultz, Michael Wand:
Modeling coarticulation in EMG-based continuous speech recognition. 341-353 - Charles Jorgensen, Sorin Dusan:
Speech interfaces based upon surface electromyography. 354-366 - Jonathan S. Brumberg, Alfonso Nieto-Castañón, Philip R. Kennedy, Frank H. Guenther:
Brain-computer interfaces for speech communication. 367-379
Volume 52, Number 5, May 2010
- Teddy Surya Gunawan, Eliathamby Ambikairajah, Julien Epps:
Perceptual speech enhancement exploiting temporal masking properties of human auditory system. 381-393 - Roberto Barra-Chicote, Junichi Yamagishi, Simon King, Juan Manuel Montero, Javier Macías Guarasa:
Analysis of statistical parametric and unit selection speech synthesis systems applied to emotional speech. 394-404 - Young Han Lee, Hong Kook Kim:
Entropy coding of compressed feature parameters for distributed speech recognition. 405-412 - Klára Vicsi, György Szaszák:
Using prosody to improve automatic speech recognition. 413-426 - Jacob Benesty, Jingdong Chen, Yiteng Huang:
On widely linear Wiener and tradeoff filters for noise reduction. 427-439 - Jesús Vicente-Peña, Fernando Díaz-de-María:
Uncertainty decoding on Frequency Filtered parameters for robust ASR. 440-449 - Kuldip K. Paliwal, Kamil K. Wójcicki, Belinda Schwerin:
Single-channel speech enhancement using spectral subtraction in the short-time modulation domain. 450-475
Volume 52, Number 6, June 2010
- Marion Dohen, Jean-Luc Schwartz, Gérard Bailly:
Speech and face-to-face communication - An introduction. 477-480
- Benjamin Weiss, Christine Kühnel, Ina Wechsung, Sascha Fagel, Sebastian Möller:
Quality of talking heads in different interaction and media contexts. 481-492 - Pierre Badin, Yuliya Tarabalka, Frédéric Elisei, Gérard Bailly:
Can you 'read' tongue movements? Evaluation of the contribution of tongue display to speech understanding. 493-503 - Panikos Heracleous, Denis Beautemps, Noureddine Aboutabit:
Cued Speech automatic recognition in normal-hearing and deaf subjects. 504-512
- Emilie Troille, Marie-Agnès Cathiard, Christian Abry:
Speech face perception is locked to anticipation in speech production. 513-524 - Mathilde Fort, Elsa Spinelli, Christophe Savariaux, Sonia Kandel:
The word superiority effect in audiovisual speech perception. 525-532 - Marc Sato, Giovanni Buccino, Maurizio Gentilucci, Luigi Cattaneo:
On the tip of the tongue: Modulation of the primary motor cortex during audiovisual speech perception. 533-541
- María L. Flecha-García:
Eyebrow raises in dialogue and their relation to discourse structure, utterance function and pitch accents in English. 542-554 - Erin Cvejic, Jeesun Kim, Chris Davis:
Prosody off the top of the head: Prosodic contrasts can be discriminated by head motion. 555-564 - Jean-Marc Colletta, Catherine Pellenq, Michèle Guidetti:
Age-related changes in co-speech gesture and narrative: Evidence from French children and adults. 565-576
- Vincent Aubanel, Noël Nguyen:
Automatic recognition of regional phonological variation in conversational interaction. 577-586 - Stefan Kopp:
Social resonance and embodied coordination in face-to-face conversation with artificial interlocutors. 587-597 - Gérard Bailly, Stephan Raidt, Frédéric Elisei:
Gaze, conversational agents and face-to-face communication. 598-612
Volume 52, Numbers 7-8, July - August 2010
- Dmitri Bitouk, Ragini Verma, Ani Nenkova:
Class-level spectral features for emotion recognition. 613-625 - Murray J. Munro, Tracey M. Derwing, Clifford S. Burgess:
Detection of nonnative speaker status from content-masked speech. 626-637 - Ulrich Reubold, Jonathan Harrington, Felicitas Kleber:
Vocal aging effects on F0 and the first formant: A longitudinal analysis in adult speakers. 638-651 - Kai Yu, Mark J. F. Gales, Lan Wang, Philip C. Woodland:
Unsupervised training and directed manual transcription for LVCSR. 652-663 - J. M. Górriz, Javier Ramírez, Elmar Wolfgang Lang, Carlos García Puntonet, Ignacio Turias:
Improved likelihood ratio test based voice activity detector applied to speech recognition. 664-677 - Claus Christiansen, Michael Syskind Pedersen, Torsten Dau:
Prediction of speech intelligibility based on an auditory preprocessing model. 678-692
Volume 52, Number 9, September 2010
- Sandipan Chakroborty, Goutam Saha:
Feature selection using singular value decomposition and QR factorization with column pivoting for text-independent speaker identification. 693-709 - Changchun Bao, Hao Xu, Bingyin Xia, Zhangyu Liu, Jianwei Qiu:
An efficient transcoding algorithm between AMR-NB and G.729ab. 710-724 - Ayyoob Jafari, Farshad Almasganj:
Using Laplacian eigenmaps latent variable model and manifold learning to improve speech recognition accuracy. 725-735 - Andrew Hines, Naomi Harte:
Speech intelligibility from image processing. 736-752 - Mohaddeseh Nosratighods, Eliathamby Ambikairajah, Julien Epps, Michael J. Carey:
A segment selection technique for speaker verification. 753-761 - Brady Laska, Miodrag Bolic, Rafik A. Goubran:
Discrete cosine transform particle filter speech enhancement. 762-775
Volume 52, Number 10, October 2010
- John H. L. Hansen, Sharmistha S. Gray, Wooil Kim:
Automatic voice onset time detection for unvoiced stops (/p/, /t/, /k/) with application to accent classification. 777-789 - Fabio Valente:
Hierarchical and parallel processing of auditory and modulation frequencies for automatic speech recognition. 790-800 - Korbinian Riedhammer, Benoît Favre, Dilek Hakkani-Tür:
Long story short - Global unsupervised models for keyphrase based meeting summarization. 801-815 - Klaus-Peter Engelbrecht, Sebastian Möller:
Sequential classifiers for the prediction of user judgments about spoken dialog systems. 816-833 - Zhen-Hua Ling, Korin Richmond, Junichi Yamagishi:
An Analysis of HMM-based prediction of articulatory movements. 834-846 - Josafá de Jesus Aguiar Pontes, Sadaoki Furui:
Predicting the phonetic realizations of word-final consonants in context - A challenge for French grapheme-to-phoneme converters. 847-862
Volume 52, Numbers 11-12, November - December 2010
- Anne Cutler, Martin Cooke, María Luisa García Lecumberri:
Preface. 863
- María Luisa García Lecumberri, Martin Cooke, Anne Cutler:
Non-native speech perception in adverse conditions: A review. 864-886 - Sven L. Mattys, Lucy M. Carroll, Carrie K. W. Li, Sonia L. Y. Chan:
Effects of energetic and informational masking on speech segmentation by native and non-native speakers. 887-899 - Chao-Yang Lee, Liang Tao, Zinny S. Bond:
Identification of multi-speaker Mandarin tones in noise by native and non-native listeners. 900-910 - Hugo Quené, L. E. van Delft:
Non-native durational patterns decrease speech intelligibility. 911-918 - Payam Ezzatian, Meital Avivi, Bruce A. Schneider:
Do nonnative listeners benefit as much as native listeners from spatial cues that release speech from masking? 919-929 - Ann R. Bradlow, Cynthia G. Clopper, Rajka Smiljanic, Mary Ann Walter:
A perceptual phonetic similarity space for languages: Evidence from five native language listener groups. 930-942 - Kristin J. Van Engen:
Similarity and familiarity: Second language sentence recognition in first- and second-language multi-talker babble. 943-953 - Martin Cooke, María Luisa García Lecumberri, Odette Scharenborg, Wim A. van Dommelen:
Language-independent processing in speech perception: Identification of English intervocalic consonants by speakers of eight European languages. 954-967 - Wim A. van Dommelen, Valérie Hazan:
Perception of English consonants in noise by native and Norwegian listeners. 968-979 - Mirjam Broersma, Odette Scharenborg:
Native and non-native listeners' perception of English consonants in different types of noise. 980-995 - Valérie Hazan, Jeesun Kim, Yuchun Chen:
Audiovisual perception in adverse conditions: Language, speaker and listener effects. 996-1009 - Jan Volín, Radek Skarnitzl:
The strength of foreign accent in Czech English under adverse listening conditions. 1010-1021 - Charlotte Gooskens, Vincent J. van Heuven, Renée van Bezooijen, Jos J. A. Pacilly:
Is spoken Danish less intelligible than Swedish? 1022-1037 - Antje Heinrich, Yvonne Flory, Sarah Hawkins:
Influence of English r-resonances on intelligibility of speech in noise for native English and German listeners. 1038-1055
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.