default search action
IEEE Transactions on Audio, Speech & Language Processing, Volume 16
Volume 16, Number 1, January 2008
- Julio Vargas, Steve McLaughlin:
Cascade Prediction Filters With Adaptive Zeros to Track the Time-Varying Resonances of the Vocal Tract. 1-7 - Joseph Tepperman, Shrikanth S. Narayanan:
Using Articulatory Representations to Detect Segmental Errors in Nonnative Pronunciation. 8-22 - Ian McLoughlin:
Subjective Intelligibility Testing of Chinese Speech. 23-33 - Nicolas Malyska, Thomas F. Quatieri:
Spectral Representations of Nonmodal Phonation. 34-46 - Carlos Toshinori Ishi, Ken-Ichi Sakakibara, Hiroshi Ishiguro, Norihiro Hagita:
A Method for Automatic Detection of Vocal Fry. 47-56 - Volodya Grancharov, Jan H. Plasberg, Jonas Samuelsson, W. Bastiaan Kleijn:
Generalized Postfilter for Speech Quality Enhancement. 57-64 - L. Anders Ekman, W. Bastiaan Kleijn, Manohar N. Murthi:
Regularized Linear Prediction of Speech. 65-73 - Jerome R. Bellegarda:
Unit-Centric Feature Mapping for Inventory Pruning in Unit Selection Text-to-Speech Synthesis. 74-82 - Gerard Hotho, Lars F. Villemoes, Jeroen Breebaart:
A Backward-Compatible Multichannel Audio Codec. 83-93 - Te Li, Susanto Rahardja, Soo Ngee Koh:
Frequency Region-Based Prioritized Bit-Plane Coding for Scalable Audio. 94-105 - S. Grofit, Yizhar Lavner:
Time-Scale Modification of Audio Signals Using Enhanced WSOLA With Management of Transients. 106-115 - Pierre Leveau, Emmanuel Vincent, Gaël Richard, Laurent Daudet:
Instrument-Specific Harmonic Atoms for Mid-Level Music Representation. 116-128 - Charles D. Creusere, K. D. Kallakuri, Rahul Vanam:
An Objective Metric of Human Subjective Audio Quality Optimized for a Wide Range of Audio Fidelities. 129-136 - Wei Chu, Benoît Champagne:
A Noise-Robust FFT-Based Auditory Spectrum With Application in Audio Classification. 137-150 - Heidi Christensen, Yoshihiko Gotoh, Steve Renals:
A Cascaded Broadcast News Highlighter. 151-161 - Yekutiel Avargel, Israel Cohen:
Adaptive System Identification in the Short-Time Fourier Transform Domain Using Cross-Multiplicative Transfer Function Approximation. 162-173 - Cédric Févotte, Bruno Torrésani, Laurent Daudet, Simon J. Godsill:
Sparse Linear Regression With Structured Priors and Application to Denoising of Musical Audio. 174-185 - A. S. Park, James R. Glass:
Unsupervised Pattern Discovery in Speech. 186-197 - Jen-Tzung Chien, Meng-Sung Wu:
Adaptive Bayesian Latent Semantic Analysis. 198-207 - Imed Zitouni:
Constrained Minimization and Discriminative Training for Natural Language Call Routing. 208-215 - Sankaranarayanan Ananthakrishnan, Shrikanth S. Narayanan:
Automatic Prosodic Event Detection Using Acoustic, Lexical, and Syntactic Evidence. 216-228 - Yi Hu, Philipos C. Loizou:
Evaluation of Objective Quality Measures for Speech Enhancement. 229-238 - Jen-Tzung Chien, Chuan-Wei Ting:
Factor Analyzed Subspace Modeling and Selection. 239-248
Volume 16, Number 2, February 2008
- Anssi Klapuri:
Multipitch Analysis of Polyphonic Music and Speech Signals Using an Auditory Model. 255-266 - Mark R. Every:
Discriminating Between Pitched Sources in Music Audio. 267-277 - Mathieu Lagrange, Luis Gustavo Martins, Jennifer Murdoch, George Tzanetakis:
Normalized Cuts for Predominant Melodic Source Separation. 278-290 - Kyogu Lee, Malcolm Slaney:
Acoustic Chord Transcription and Key Extraction From Audio Using Key-Dependent HMMs Trained on Synthesized Audio. 291-301 - Peter Jan O. Doets, Reginald L. Lagendijk:
Distortion Estimation in Compressed Music Using Only Audio Fingerprints. 302-317 - Mark Levy, Mark B. Sandler:
Structural Segmentation of Musical Audio by Constrained Clustering. 318-326 - Shlomo Dubnov:
Unified View of Prediction and Repetition Structure in Audio Signals With Application to Interest Point Detection. 327-337 - Min-Yen Kan, Ye Wang, Denny Iskandar, Tin Lay Nwe, Arun Shenoy:
LyricAlly: Automatic Synchronization of Textual Lyrics to Acoustic Music Signals. 338-349 - Jyh-Shing Roger Jang, Hong-Ru Lee:
A General Framework of Progressive Filtering and Its Application to Query by Singing/Humming. 350-358 - Erdem Unal, Elaine Chew, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
Challenging Uncertainty in Query by Humming Systems: A Fingerprinting Approach. 359-371 - Iman S. H. Suyoto, Alexandra L. Uitdenbogerd, Falk Scholer:
Searching Musical Audio Using Symbolic Queries. 372-381 - Frank Kurth, Meinard Müller:
Efficient Index-Based Audio Matching. 382-395 - Akihiro Kimura, Kunio Kashino, Takayuki Kurozumi, Hiroshi Murase:
A Quick Search Method for Audio Signals Based on a Piecewise Linear Representation of Feature Trajectories. 396-407 - Elias Pampalk, Perfecto Herrera, Masataka Goto:
Computational Models of Similarity for Drum Samples. 408-423 - Andre Holzapfel, Yannis Stylianou:
Musical Genre Classification Using Nonnegative Matrix Factorization-Based Features. 424-434 - Kazuyoshi Yoshii, Masataka Goto, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:
An Efficient Hybrid Music Recommender System Using an Incrementally Trainable Probabilistic Generative Model. 435-447 - Yi-Hsuan Yang, Yu-Ching Lin, Ya-Fan Su, Homer H. Chen:
A Regression Approach to Music Emotion Recognition. 448-457 - Luca Mion, Giovanni De Poli:
Score-Independent Audio Features for Description of Music Expression. 458-466 - Douglas Turnbull, Luke Barrington, David A. Torres, Gert R. G. Lanckriet:
Semantic Annotation and Retrieval of Music and Sound Effects. 467-476
Volume 16, Number 3, March 2008
- Jingdong Chen, Jacob Benesty, Yiteng Huang:
A Minimum Distortion Noise Reduction Algorithm With Multiple Microphones. 481-493 - Yonggang Deng, William J. Byrne:
HMM Word and Phrase Alignment for Statistical Machine Translation. 494-507 - Giulia Garau, Steve Renals:
Combining Spectral Representations for Large-Vocabulary Continuous Speech Recognition. 508-518 - Jian Xue, Yunxin Zhao:
Random Forests of Phonetic Decision Trees for Acoustic Modeling in Conversational Speech Recognition. 519-528 - Olivier Gillet, Gaël Richard:
Transcription and Separation of Drum Signals From Polyphonic Music. 529-540 - Richard C. Hendriks, Jesper Jensen, Richard Heusdens:
Noise Tracking Using DFT Domain Subspace Decompositions. 541-553 - Haibin Huang, Pasi Fränti, Dong-Yan Huang, Susanto Rahardja:
Cascaded RLS-LMS Prediction in MPEG-4 Lossless Audio Coding. 554-562 - Jeih-Weih Hung, Wei-Yi Tsai:
Constructing Modulation Frequency Domain-Based Features for Robust Speech Recognition. 563-577 - Antonio Miguel, Eduardo Lleida, Richard C. Rose, Luis Buera, Oscar Saz, Alfonso Ortega:
Capturing Local Variability for Speaker Normalization in Speech Recognition. 578-593 - Norman Poh, Josef Kittler:
Incorporating Model-Specific Score Distribution in Speaker Verification Systems. 594-606 - Yun Tang, Richard C. Rose:
Rapid Speaker Adaptation Using Clustered Maximum-Likelihood Linear Basis With Sparse Training Data. 607-616 - Jeremy Morris, Eric Fosler-Lussier:
Conditional Random Fields for Integrating Local Discriminative Classifiers. 617-628 - Oscal T.-C. Chen, Wen-Chih Wu:
Highly Robust, Secure, and Perceptual-Quality Echo Hiding Scheme. 629-638 - Shoichiro Saito, Hirokazu Kameoka, Keigo Takahashi, Takuya Nishimoto, Shigeki Sagayama:
Specmurt Analysis of Polyphonic Music Signals. 639-650 - Simon Shelley, Damian T. Murphy:
The Modeling of Diffuse Boundaries in the 2-D Digital Waveguide Mesh. 651-665 - Iain McCowan, Mike Lincoln, Ivan Himawan:
Microphone Array Shape Calibration in Diffuse Noise Fields. 666-670 - Bob L. Sturm, John J. Shynk, Laurent Daudet, Curtis Roads:
Dark Energy in Sparse Atomic Estimations. 671-676
Volume 16, Number 4, May 2008
- Chi-Min Liu, Han-Wen Hsu, Wen-Chieh Lee:
Compression Artifacts in Perceptual Audio Coding. 681-695 - Masahiro Yukawa, Rodrigo C. de Lamare, Raimundo Sampaio Neto:
Efficient Acoustic Echo Cancellation With Reduced-Rank Adaptive Filtering Based on Selective Decimation and Adaptive Interpolation. 696-710 - Gal Reuven, Sharon Gannot, Israel Cohen:
Dual-Source Transfer-Function Generalized Sidelobe Canceller. 711-727 - Nicoleta Roman, DeLiang Wang:
Binaural Tracking of Multiple Moving Sources. 728-739 - Boaz Rafaely:
The Spherical-Shell Microphone Array. 740-747 - Banu Gunel, Hüseyin Hacihabiboglu, Ahmet M. Kondoz:
Acoustic Source Separation of Convolutive Mixtures Based on Intensity Vector Statistics. 748-756 - Jacob Benesty, Jingdong Chen, Yiteng Huang:
On the Importance of the Pearson Correlation Coefficient in Noise Reduction. 757-765 - Zhiyao Duan, Yungang Zhang, Changshui Zhang, Zhenwei Shi:
Unsupervised Single-Channel Music Source Separation by Average Harmonic Structure Modeling. 766-778 - Sibel Yaman, Chin-Hui Lee:
A Flexible Classifier Design Framework Based on Multiobjective Programming. 779-789 - Simon Tucker, Steve Whittaker:
Temporal Compression Of Speech: An Evaluation. 790-796 - Vivek Kumar Rangarajan Sridhar, Srinivas Bangalore, Shrikanth S. Narayanan:
Exploiting Acoustic and Syntactic Features for Automatic Prosody Labeling in a Maximum Entropy Framework. 797-811 - Fabio Antonacci, Marco Foco, Augusto Sarti, Stefano Tubaro:
Fast Tracing of Acoustic Beams and Paths Through Visibility Lookup. 812-824 - Tim Fingscheidt, Suhadi Suhadi, Sorel Stan:
Environment-Optimized Speech Enhancement. 825-834 - David Yuheng Zhao, W. Bastiaan Kleijn, Alexander Ypma, Bert de Vries:
Online Noise Estimation Using Stochastic-Gain HMM for Speech Enhancement. 835-846 - John Grothendieck, Allen L. Gorin:
Towards Link Characterization From Content: Recovering Distributions From Classifier Output. 847-858 - Chia-Yu Wan, Lin-Shan Lee:
Histogram-Based Quantization for Robust and/or Distributed Speech Recognition. 859-873
Volume 16, Number 5, July 2008
- Norman H. Adams, Gregory H. Wakefield:
State-Space Synthesis of Virtual Auditory Space. 881-890 - Jianping Deng, Martin Bouchard, Tet Hin Yeap:
Feature Enhancement for Noisy Speech Recognition With a Time-Variant Linear Predictive HMM Structure. 891-899 - Peng Liu, Cong Liu, Hui Jiang, Frank K. Soong, Ren-Hua Wang:
A Constrained Line Search Optimization Method for Discriminative Training of HMMs. 900-909 - Timo Gerkmann, Colin Breithaupt, Rainer Martin:
Improved A Posteriori Speech Presence Probability Estimation Based on a Likelihood Ratio With Fixed Priors. 910-919 - Margarita Kotti, Emmanouil Benetos, Costas Kotropoulos:
Computationally Efficient and Robust BIC-Based Speaker Segmentation. 920-933 - Hüseyin Hacihabiboglu, Banu Gunel, Ahmet M. Kondoz:
Time-Domain Simulation of Directive Sources in 3-D Digital Waveguide Mesh-Based Acoustical Models. 934-946 - Matti Karjalainen:
Efficient Realization of Wave Digital Components for Physical Modeling and Sound Synthesis. 947-956 - Yiteng Huang, Jacob Benesty, Jingdong Chen:
Analysis and Comparison of Multichannel Noise Reduction Methods in a Common Framework. 957-968 - Srivatsan Kandadai, Charles D. Creusere:
Scalable Audio Compression at Low Bitrates. 969-979 - Patrick Kenny, Pierre Ouellet, Najim Dehak, Vishwa Gupta, Pierre Dumouchel:
A Study of Interspeaker Variability in Speaker Verification. 980-988 - J. Paschedag, Boris Lohmann:
Error Convergence of the Filtered-X LMS Algorithm for Multiple Harmonic Excitation. 989-999 - Yegui Xiao, Akira Ikuta, Liying Ma, Khashayar Khorasani:
Stochastic Analysis of the FXLMS-Based Narrowband Active Noise Control System. 1000-1014 - Michael A. Casey, Christophe Rhodes, Malcolm Slaney:
Analysis of Minimum Distances in High-Dimensional Musical Spaces. 1015-1028 - Khe Chai Sim, Haizhou Li:
On Acoustic Diversification Front-End for Spoken Language Identification. 1029-1037 - Rasool Tahmasbi, Sadegh Rezaei:
Change Point Detection in GARCH Models for Voice Activity Detection. 1038-1046 - Valentin Ion, Reinhold Haeb-Umbach:
A Novel Uncertainty Decoding Rule With Applications to Transmission Error Robust Speech Recognition. 1047-1060 - Dong Yu, Li Deng, Jasha Droppo, Jian Wu, Yifan Gong, Alex Acero:
Robust Speech Recognition Using a Cepstral Minimum-Mean-Square-Error-Motivated Noise Suppressor. 1061-1070
Volume 16, Number 6, August 2008
- Jia-Li You, Yining Chen, Min Chu, Frank K. Soong, Jin-Lin Wang:
Identifying Language Origin of Named Entity With Multiple Information Sources. 1077-1086 - K. I. Nordstrom, George Tzanetakis, Peter F. Driessen:
Transforming Perceived Vocal Effort and Breathiness Using Adaptive Pre-Emphasis Linear Prediction. 1087-1096 - Marco Grimaldi, Fred Cummins:
Speaker Identification Using Instantaneous Frequencies. 1097-1111 - Jan S. Erkelens, Richard Heusdens:
Tracking of Nonstationary Noise Based on Data-Driven Recursive Noise Power Estimation. 1112-1123 - Hannu Pulakka, Laura Laaksonen, Martti Vainio, Jouni Pohjalainen, Paavo Alku:
Evaluation of an Artificial Speech Bandwidth Extension Method in Three Languages. 1124-1137 - Joan Serrà, Emilia Gómez, Perfecto Herrera, Xavier Serra:
Chroma Binary Similarity and Local Alignment Applied to Cover Song Identification. 1138-1151 - Ioannis Karydis, Alexandros Nanopoulos, Apostolos N. Papadopoulos, Dimitrios Katsaros, Yannis Manolopoulos:
Music Retrieval Over Wireless Ad-Hoc Networks. 1152-1162 - Kees van den Doel, Uri M. Ascher:
Real-Time Numerical Solution of Webster's Equation on A Nonuniform Grid. 1163-1172 - T. Scott Brandes:
Feature Vector Selection and Use With Hidden Markov Models to Identify Frequency-Modulated Bioacoustic Signals Amidst Noise. 1173-1180 - Uttachai Manmontri, Patrick A. Naylor:
A Class of Frobenius Norm-Based Algorithms Using Penalty Term and Natural Gradient for Blind Signal Separation. 1181-1193 - Manolis Perakakis, Alexandros Potamianos:
A Study in Efficiency and Modality Usage in Multimodal Form Filling Systems. 1194-1206 - Sibel Yaman, Li Deng, Dong Yu, Ye-Yi Wang, Alex Acero:
An Integrative and Discriminative Technique for Spoken Utterance Classification. 1207-1214
Volume 16, Number 7, September 2008
- Evgeny Matusov, Gregor Leusch, Rafael E. Banchs, Nicola Bertoldi, Daniel Déchelotte, Marcello Federico, Muntsin Kolss, Young-Suk Lee, José B. Mariño, Matthias Paulik, Salim Roukos, Holger Schwenk, Hermann Ney:
System Combination for Machine Translation of Spoken and Written Language. 1222-1237 - Mike Dowman, Virginia Savova, Thomas L. Griffiths, Konrad P. Körding, Joshua B. Tenenbaum, Matthew Purver:
A Probabilistic Model of Meetings That Combines Words and Discourse Features. 1238-1248 - Srinivas Bangalore, Giuseppe Di Fabbrizio, Amanda Stent:
Learning the Structure of Task-Driven Human-Human Dialogs. 1249-1259 - Hany Hassan, Khalil Sima'an, Andy Way:
Syntactically Lexicalized Phrase-Based SMT. 1260-1273 - Christoph Tillmann, Tong Zhang:
An Online Relevant Set Algorithm for Statistical Machine Translation. 1274-1286 - Minwoo Jeong, Gary Geunbae Lee:
Triangular-Chain Conditional Random Fields. 1287-1302 - Alfred Dielmann, Steve Renals:
Recognition of Dialogue Acts in Multiparty Meetings Using a Switching DBN. 1303-1314 - Min Zhang, Wanxiang Che, Guodong Zhou, AiTi Aw, Chew Lim Tan, Ting Liu, Sheng Li:
Semantic Role Labeling Using a Grammar-Driven Convolution Tree Kernel. 1315-1329 - Ruhi Sarikaya, Mohamed Afify, Yonggang Deng, Hakan Erdogan, Yuqing Gao:
Joint Morphological-Lexical Language Modeling for Processing Morphologically Rich Languages With Application to Dialectal Arabic. 1330-1339 - Francesc Alías, Xavier Sevillano, Joan Claudi Socoró, Xavi Gonzalvo:
Towards High-Quality Next-Generation Text-to-Speech Synthesis: A Multidomain Approach by Automatic Domain Classification. 1340-1354
Volume 16, Number 8, November 2008
- Emmanuel Ravelli, Gaël Richard, Laurent Daudet:
Union of MDCT Bases for Audio Coding. 1361-1372 - Olivier Derrien, Gaël Richard:
A New Model-Based Algorithm for Optimizing the MPEG-AAC in MS-Stereo. 1373-1382 - Alberto Carini, Silvia Malatini:
Optimal Variable Step-Size NLMS Algorithms With Auxiliary Noise Power Scheduling for Feedforward Active Noise Control. 1383-1395 - Miguel Ferrer, Alberto González, Maria de Diego, Gema Pinero:
Fast Affine Projection Algorithms for Filtered-x Multichannel Active Noise Control. 1396-1408 - Ming Wu, Guoyue Chen, Xiaojun Qiu:
An Improved Active Noise Control Algorithm Without Secondary Path Identification Based on the Frequency-Domain Subband Architecture. 1409-1419 - Jian-Wu Xu, José Carlos Príncipe:
A Pitch Detector Based on a Generalized Correlation Function. 1420-1432 - Emanuël A. P. Habets, Sharon Gannot, Israel Cohen, P. Sommen:
Joint Dereverberation and Residual Echo Suppression of Speech Signals in Noisy Environments. 1433-1451 - Jacob H. Gunther, Gerald Wilson:
Mean-Squared Error Analysis of Adaptive Subband-Based System Identification. 1452-1465 - Constantin Paleologu, Jacob Benesty, Silviu Ciochina:
A Variable Step-Size Affine Projection Algorithm Designed for Acoustic Echo Cancellation. 1466-1478 - Jan Scheuing, Bin Yang:
Disambiguation of TDOA Estimation for Multiple Sources in Reverberant Environments. 1479-1489 - Jacek Dmochowski, Jacob Benesty, Sofiène Affes:
Linearly Constrained Minimum Variance Source Localization and Spectral Estimation. 1490-1502 - Jeroen Breebaart, Erik Schuijers:
Phantom Materialization: A Novel Method to Enhance Stereo Audio Reproduction on Headphones. 1503-1511 - Tomohiro Nakatani, Biing-Hwang Juang, Takuya Yoshioka, Keisuke Kinoshita, Marc Delcroix, Masato Miyoshi:
Speech Dereverberation Based on Maximum-Likelihood Estimation With Time-Varying Gaussian Source Model. 1512-1527 - Ari Abramson, Israel Cohen:
Single-Sensor Audio Source Separation Using Classification and Estimation Approach and GARCH Modeling. 1528-1540 - Chang-Hsing Lee, Chin-Chuan Han, Ching-Chien Chuang:
Automatic Classification of Bird Species From Their Sounds Using Two-Dimensional Cepstral Coefficients. 1541-1550 - Shahram Khadivi, Hermann Ney:
Integration of Speech Recognition and Machine Translation in Computer-Assisted Translation. 1551-1564 - Juan Manuel Górriz, Javier Ramírez, Elmar Wolfgang Lang, Carlos García Puntonet:
Jointly Gaussian PDF-Based Likelihood Ratio Test for Voice Activity Detection. 1565-1578 - Tiago H. Falk, Wai-Yip Chan:
Hybrid Signal-and-Link-Parametric Speech Quality Measurement for VoIP Communications. 1579-1589 - Kyu Jeong Han, Samuel Kim, Shrikanth S. Narayanan:
Strategies to Improve the Robustness of Agglomerative Hierarchical Clustering Under Data Source Variation for Speaker Diarization. 1590-1601 - K. Sri Rama Murty, B. Yegnanarayana:
Epoch Extraction From Speech Signals. 1602-1613 - Eric Plourde, Benoît Champagne:
Auditory-Based Spectral Amplitude Estimators for Speech Enhancement. 1614-1623 - Benny Sallberg, Nedelko Grbic, Ingvar Claesson:
Complex-Valued Independent Component Analysis for Online Blind Speech Extraction. 1624-1632 - Hai Huyen Dam, Hai Quang Dam, Sven Nordholm:
Noise Statistics Update Adaptive Beamformer With PSD Estimation for Speech Extraction in Noisy Environment. 1633-1641 - Donglai Zhu, Haizhou Li, Bin Ma, Chin-Hui Lee:
Optimizing the Performance of Spoken Language Recognition With Discriminative Training. 1642-1653 - Kevin M. Indrebo, Richard J. Povinelli, Michael T. Johnson:
Minimum Mean-Squared Error Estimation of Mel-Frequency Cepstral Coefficients Using a Novel Distortion Model. 1654-1661 - Xiong Xiao, Chng Eng Siong, Haizhou Li:
Normalization of the Speech Modulation Spectra for Robust Speech Recognition. 1662-1674 - Yi-Hsiang Chao, Wei-Ho Tsai, Hsin-Min Wang, Ruei-Chuan Chang:
Using Kernel Discriminant Analysis to Improve the Characterization of the Alternative Hypothesis for Speaker Verification. 1675-1684 - Ruohua Zhou, Marco Mattavelli, Giorgio Zoia:
Music Onset Detection Based on Resonator Time Frequency Image. 1685-1695 - Nicola Bertoldi, Richard Zens, Marcello Federico, Wade Shen:
Efficient Speech Translation Through Confusion Network Decoding. 1696-1705 - Ming Wu, Xiaojun Qiu, Guoyue Chen:
An Overlap-Save Frequency-Domain Implementation of the Delayless Subband ANC Algorithm. 1706-1710
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.