default search action
SLT 2014: South Lake Tahoe, NV, USA
- 2014 IEEE Spoken Language Technology Workshop, SLT 2014, South Lake Tahoe, NV, USA, December 7-10, 2014. IEEE 2014, ISBN 978-1-4799-7129-9
- Ali Orkan Bayer, Giuseppe Riccardi:
Semantic language models for Automatic Speech Recognition. 7-12 - Anna Schmidt, Youssef Oualil, Oliver Ohneiser, Matthias Kleinert, Marc Schulder, Arif Khan, Hartmut Helmke, Dietrich Klakow:
Context-based recognition network adaptation for improving on-line ASR in Air Traffic Control. 13-18 - Seyed Hamidreza Mohammadi, Alexander Kain:
Voice conversion using deep neural networks with speaker-independent pre-training. 19-23 - Masahiro Saiko, Hitoshi Yamamoto, Ryosuke Isotani, Chiori Hori:
Efficient multi-lingual unsupervised acoustic model training under mismatch conditions. 24-29 - Vincent Renkens, Steven Janssens, Bart Ons, Jort F. Gemmeke, Hugo Van hamme:
Acquisition of ordinal words using weakly supervised NMF. 30-35 - Basil Abraham, Neethu Mariam Joy, Navneeth K. S. Umesh:
A data-driven phoneme mapping technique using interpolation vectors of phone-cluster adaptive training. 36-41 - Md. Akmal Haidar, Douglas D. O'Shaughnessy:
Document-based Dirichlet class language model for speech recognition using document-based n-gram events. 42-47 - Frantisek Grézl, Ekaterina Egorova, Martin Karafiát:
Further investigation into multilingual training and adaptation of stacked bottle-neck neural network structure. 48-53 - Weiran Wang, Raman Arora, Karen Livescu:
Reconstruction of articulatory measurements with smoothed low-rank matrix completion. 54-59 - Hiroaki Sugiyama, Toyomi Meguro, Ryuichiro Higashinaka, Yasuhiro Minami:
Open-domain utterance generation using phrase pairs based on dependency relations. 60-65 - Bing Zhao, Yik-Cheung Tam:
Bilingual Recurrent Neural Networks for improved statistical machine translation. 66-70 - Maryam Siahbani, Ramtin Mehdizadeh Seraj, Baskaran Sankaran, Anoop Sarkar:
Incremental translation using hierarchichal phrase-based translation system. 71-76 - Alan Wisler, Visar Berisha, Julie Liss, Andreas Spanias:
Domain invariant speech features using a new divergence measure. 77-82 - Zhiyang He, Ji Wu, Ping Lv:
Label correlation mixture model for multi-label text categorization. 83-88 - Jose Sousa, Fabiola Araujo, Aldebaro Klautau:
Utterance copy for Klatt's speech synthesizer using genetic algorithm. 89-94 - Lara J. Martin, Matthew Stone, Florian Metze, Jack Mostow:
A methodology for using crowdsourced data to measure uncertainty in natural speech. 95-99 - Herman Kamper, Aren Jansen, Simon King, Sharon Goldwater:
Unsupervised lexical clustering of speech segments using fixed-dimensional acoustic embeddings. 100-105 - Gabriel Synnaeve, Thomas Schatz, Emmanuel Dupoux:
Phonetics embedding learning with side information. 106-111 - Heriberto Cuayáhuitl, Nina Dethlefs, Helen F. Hastie, Xingkun Liu:
Training a statistical surface realiser from automatic slot labelling. 112-117 - Oscar Saz, Mortaza Doulaty, Thomas Hain:
Background-tracking acoustic features for genre identification of broadcast shows. 118-123 - Steven J. Rennie, Vaibhava Goel, Samuel Thomas:
Deep Order Statistic Networks. 124-128 - Murali Karthick B, Srinivasan Umesh:
Improving deep neural networks using state projection vectors of subspace Gaussian mixture model as features. 129-134 - Romain Serizel, Diego Giuliani:
Vocal tract length normalisation approaches to DNN-based children's and adults' speech recognition. 135-140 - Pengyuan Zhang, Yulan Liu, Thomas Hain:
Semi-supervised DNN training in meeting recognition. 141-146 - Jen-Tzung Chien, Tsai-Wei Lu:
Tikhonov regularization for deep neural network acoustic modeling. 147-152 - Ryan Price, Ken-ichi Iso, Koichi Shinoda:
Speaker adaptation of deep neural networks using a hierarchy of output layers. 153-158 - Steven J. Rennie, Vaibhava Goel, Samuel Thomas:
Annealed dropout training of deep networks. 159-164 - Yajie Miao, Lu Jiang, Hao Zhang, Florian Metze:
Improvements to speaker adaptive training of deep neural networks. 165-170 - Pawel Swietojanski, Steve Renals:
Learning hidden unit contributions for unsupervised speaker adaptation of neural network acoustic models. 171-176 - Yuzong Liu, Katrin Kirchhoff:
Graph-based semi-supervised acoustic modeling in DNN-based speech recognition. 177-182 - George Saon:
A distributed architecture for fast SGD sequence discriminative training of DNN acoustic models. 183-188 - Kaisheng Yao, Baolin Peng, Yu Zhang, Dong Yu, Geoffrey Zweig, Yangyang Shi:
Spoken language understanding using long short-term memory neural networks. 189-194 - Xiaohu Liu, Ruhi Sarikaya:
A discriminative model based entity dictionary weighting approach for spoken language understanding. 195-199 - Kai Hong, Pengjun Pei, Ye-Yi Wang, Dilek Hakkani-Tür:
Entity ranking for descriptive queries. 200-205 - Jen-Tzung Chien, Yuan-Chu Ku:
Bayesian recurrent neural network language model. 206-211 - Mickael Rouvier, Benoît Favre, Frédéric Béchet:
Joint decoding of complementary utterances. 212-217 - Mohamed Morchid, Richard Dufour, Mohamed Bouallegue, Georges Linarès:
Author-topic based representation of call-center conversations. 218-223 - Xiang Li, Gökhan Tür, Dilek Hakkani-Tür, Qi Li:
Personal knowledge graph population from user utterances in conversational understanding. 224-229 - Ji He, Alex Marin, Mari Ostendorf:
Effective data-driven feature learning for detecting name errors in automatic speech recognition. 230-235 - Gina-Anne Levow, Valerie Freeman, Alena Hrynkevich, Mari Ostendorf, Richard A. Wright, Julian Chan, Yi Luan, Trang Tran:
Recognition of stance strength and polarity in spontaneous speech. 236-241 - Yun-Nung Chen, Dilek Hakkani-Tür, Gökhan Tür:
Deriving local relational surface forms from dependency-based entity embeddings for unsupervised spoken language understanding. 242-247 - Jort F. Gemmeke, Siddharth Sehgal, Stuart P. Cunningham, Hugo Van hamme:
Dysarthric vocal interfaces with minimal training data. 248-253 - Heidi Christensen, I. Casanueva, Stuart P. Cunningham, Phil D. Green, Thomas Hain:
Automatic selection of speakers for improved acoustic modelling: recognition of disordered speech with sparse data. 254-259 - Xiaodan Zhuang, Viktor Rozgic, Michael Crystal, Brian Marx:
Improving speech-based PTSD detection via multi-view learning. 260-265 - Emily Prud'hommeaux, Eric Morley, Masoud Rouhizadeh, Laura Silverman, Jan P. H. van Santen, Brian Roark, Richard Sproat, Sarah Kauper, Rachel DeLaHunta:
Computational analysis of trajectories of linguistic development in autism. 266-271 - Mahsa Sadat Elyasi Langarani, Jan P. H. van Santen:
Modeling fundamental frequency dynamics in hypokinetic dysarthria. 272-276 - Verena Venek, Stefan Scherer, Louis-Philippe Morency, Albert A. Rizzo, John Pestian:
Adolescent suicidal risk assessment in clinician-patient interaction: A study of verbal and acoustic behaviors. 277-282 - Kyusong Lee, Seonghan Ryu, Hongsuck Seo, Seokhwan Kim, Gary Geunbae Lee:
Grammatical error correction based on learner comprehension model in oral conversation. 283-287 - Nichola Lubold, Heather Pon-Barry:
A comparison of acoustic-prosodic entrainment in face-to-face and remote collaborative learning dialogues. 288-293 - Jidong Tao, Keelan Evanini, Xinhao Wang:
The influence of automatic speech recognition accuracy on the performance of an automated speech assessment system. 294-299 - Xuesong Yang, Anastassia Loukina, Keelan Evanini:
Machine learning approaches to improving pronunciation error detection on an imbalanced corpus. 300-305 - Lasguido Nio, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura:
Improving the robustness of example-based dialog retrieval using recursive neural network paraphrase identification. 306-311 - Lihong Li, He He, Jason D. Williams:
Temporal supervised learning for inferring a dialog policy from example conversations. 312-317 - Yi Ma, Eric Fosler-Lussier:
A discriminative sequence model for dialog state tracking using user goal change detection. 318-323 - Matthew Henderson, Blaise Thomson, Jason D. Williams:
The third Dialog State Tracking Challenge. 324-329 - Kai Sun, Lu Chen, Su Zhu, Kai Yu:
A generalized rule based tracker for dialogue state tracking. 330-335 - Su Zhu, Lu Chen, Kai Sun, Da Zheng, Kai Yu:
Semantic parser enhancement for dialogue domain extension with little data. 336-341 - Hang Ren, Weiqun Xu, Yonghong Yan:
Markovian discriminative modeling for cross-domain dialog state tracking. 342-347 - Rudolf Kadlec, Miroslav Vodolán, Jindrich Libovický, Jan Macek, Jan Kleindienst:
Knowledge-based Dialog State Tracking. 348-353 - Dongho Kim, Matthew Henderson, Milica Gasic, Pirros Tsiakoulis, Steve J. Young:
The use of discriminative belief tracking in POMDP-based dialogue systems. 354-359 - Matthew Henderson, Blaise Thomson, Steve J. Young:
Robust dialog state tracking using delexicalised recurrent neural networks and unsupervised adaptation. 360-365 - Sebastian Schuster, Stephanie Pancoast, Milind Ganjoo, Michael C. Frank, Dan Jurafsky:
Speaker-independent detection of child-directed speech. 366-371 - Abhinav Misra, John H. L. Hansen:
Spoken language mismatch in speaker verification: An investigation with NIST-SRE and CRSS Bi-Ling corpora. 372-377 - Daniel Garcia-Romero, Xiaohui Zhang, Alan McCree, Daniel Povey:
Improving speaker recognition performance in the domain adaptation challenge using deep neural networks. 378-383 - Qian Zhang, John H. L. Hansen:
Training candidate selection for effective rejection in open-set language identification. 384-389 - Xavier Bost, Georges Linarès:
Constrained speaker diarization of TV series based on visual patterns. 390-395 - Maria Joana Correia, Alberto Abad, Isabel Trancoso:
Exploiting magnitude and phase spectral information for converted speech detection. 396-401 - Sree Harsha Yella, Andreas Stolcke, Malcolm Slaney:
Artificial neural network features for speaker diarization. 402-406 - Brian Thompson:
Discrimination between singing and speech in real-world audio. 407-412 - Gregory Sell, Daniel Garcia-Romero:
Speaker diarization with plda i-vector scoring and unsupervised calibration. 413-417 - Gang Liu, Chengzhu Yu, Navid Shokouhi, Abhinav Misra, Hua Xing, John H. L. Hansen:
Utilization of unlabeled development data for speaker verification. 418-423 - Di Xu, Yun Wang, Florian Metze:
EM-based phoneme confusion matrix generation for low-resource spoken term detection. 424-429 - Van Tung Pham, Nancy F. Chen, Sunil Sivadas, Haihua Xu, I-Fan Chen, Chongjia Ni, Engsiong Chng, Haizhou Li:
System and keyword dependent fusion for spoken term detection. 430-435 - Shi-wook Lee, Kazuyo Tanaka, Yoshiaki Itoh:
Effective combination of heterogeneous subword-based spoken term detection systems. 436-441 - Jonathan Wintrode, Sanjeev Khudanpur:
Combining local and broad topic context to improve term detection. 442-447 - Khe Chai Sim:
A multimodal stroke-based predictive input for efficient Chinese text entry on mobile devices. 448-453 - Yuan Liang, Koji Iwano, Koichi Shinoda:
An efficient error correction interface for speech recognition on mobile touchscreen devices. 454-459 - Matthias Sperber, Graham Neubig, Satoshi Nakamura, Alex Waibel:
On-the-fly user modeling for cost-sensitive correction of speech transcripts. 460-465 - Nurul Lubis, Dessi Puji Lestari, Ayu Purwarianti, Sakriani Sakti, Satoshi Nakamura:
Emotion recognition on Indonesian television talk shows. 466-471 - Mohammed Abdel-Wahab, Carlos Busso:
Evaluation of syllable rate estimation in expressive speech and its contribution to emotion recognition. 472-477 - Mostafa Ali Shahin, Beena Ahmed, Kirrie J. Ballard:
Classification of lexical stress patterns using deep neural network architecture. 478-482 - Zhipeng Chen, Teng Zhang, Ji Wu:
Subword scheme for keyword search. 483-488 - Hang Su, James Hieronymus, Yanzhang He, Eric Fosler-Lussier, Steven Wegmann:
Syllable based keyword search: Transducing syllable lattices to word lattices. 489-494 - Matti Varjokallio, Mikko Kurimo:
A word-level token-passing decoder for subword n-gram LVCSR. 495-500 - Martin Karafiát, Karel Veselý, Igor Szöke, Lukás Burget, Frantisek Grézl, Mirko Hannemann, Jan Cernocký:
But ASR system for BABEL Surprise evaluation 2014. 501-506 - Seyedmahdad Mirsamadi, John H. L. Hansen:
Multichannel feature enhancement in distributed microphone arrays for robust distant speech recognition in smart rooms. 507-512 - Christos Koniaris, Saikat Chatterjee:
A sparsity based preprocessing for noise robust speech recognition. 513-518 - Deepak Baby, Tuomas Virtanen, Jort F. Gemmeke, Tom Barker, Hugo Van hamme:
Exemplar-based noise robust automatic speech recognition using modulation spectrogram features. 519-524 - Ahmed Ali, Yifan Zhang, Patrick Cardinal, Najim Dehak, Stephan Vogel, James R. Glass:
A complete KALDI recipe for building Arabic speech recognition systems. 525-529 - Jan Trmal, Guoguo Chen, Daniel Povey, Sanjeev Khudanpur, Pegah Ghahremani, Xiaohui Zhang, Vimal Manohar, Chunxi Liu, Aren Jansen, Dietrich Klakow, David Yarowsky, Florian Metze:
A keyword search system using open source software. 530-535 - Pallavi Baljekar, Jill Fain Lehman, Rita Singh:
Online word-spotting in continuous speech with recurrent neural networks. 536-541 - Rui Zhao, Jinyu Li, Yifan Gong:
Variable-activation and variable-input deep neural network for robust speech recognition. 542-547 - Vikramjit Mitra, Wen Wang, Horacio Franco:
Deep convolutional nets and robust features for reverberation-robust speech recognition. 548-553 - Zhaohan Daniel Guo, Gökhan Tür, Wen-tau Yih, Geoffrey Zweig:
Joint semantic utterance classification and slot filling with recursive neural networks. 554-559 - Mandy Korpusik, Nicole Schmidt, Jennifer Drexler, Scott Cyphers, James R. Glass:
Data collection and language understanding of food descriptions. 560-565 - Qi Li, Gökhan Tür, Dilek Hakkani-Tür, Xiang Li, Tim Paek, Asela Gunawardana, Chris Quirk:
Distributed open-domain conversational understanding framework with domain independent extractors. 566-571 - Anna Prokofieva, Dilek Hakkani-Tür, Malcolm Slaney:
Eye gaze for understanding conversational speech. 572-577 - Agustín Gravano, Stefan Benus, Rivka Levitan, Julia Hirschberg:
Three ToBI-based measures of prosodic entrainment and their correlations with speaker engagement. 578-583 - Yun-Nung Chen, William Yang Wang, Alexander I. Rudnicky:
Leveraging frame semantics and distributional semantics for unsupervised semantic slot induction in spoken dialogue systems. 584-589 - Yun-Nung Chen, Alexander I. Rudnicky:
Dynamically supporting unexplored domains in conversational interactions by enriching semantics with neural word embeddings. 590-595 - Georgia Athanasopoulou, Ioannis Klasinas, Spiros Georgiladakis, Elias Iosif, Alexandros Potamianos:
Using lexical, syntactic and semantic features for non-terminal grammar rule induction in Spoken Dialogue Systems. 596-601 - Deepak Ramachandran, Peter Z. Yeh, William Jarrold, Benjamin Douglas, Adwait Ratnaparkhi, Ronald Provine, Jeremy Mendel, Adam Emfield:
An end-to-end dialog system for TV program discovery. 602-607 - Nobal B. Niraula, Amanda Stent, Hyuckchul Jung, Giuseppe Di Fabbrizio, I. Dan Melamed, Vasile Rus:
Forms2Dialog: Automatic dialog generation for Web tasks. 608-613
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.