default search action
Duc Le
This is just a disambiguation page, and is not intended to be the bibliography of an actual person. Any publication listed on this page has not been assigned to an actual author yet. If you know the true author of one of the publications listed below, you are welcome to contact us.
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c48]Duc Le, Bac Le:
Multi-scale and Multi-level Attention Based on External Knowledge in EHRs. ACIIDS (Companion 1) 2024: 113-125 - [c47]Trang Le, Daniel Lazar, Suyoun Kim, Shan Jiang, Duc Le, Adithya Sagar, Aleksandr Livshits, Ahmed Aly, Akshat Shrivastava:
PRoDeliberation: Parallel Robust Deliberation for End-to-End Spoken Language Understanding. EMNLP (Findings) 2024: 14027-14038 - [c46]Julian D. Parker, Janne Spijkervet, Katerina Kosta, Furkan Yesiler, Boris Kuznetsov, Ju-Chiang Wang, Matt Avent, Jitong Chen, Duc Le:
STEMGEN: A Music Generation Model That Listens. ICASSP 2024: 1116-1120 - [c45]Minz Won, Yun-Ning Hung, Duc Le:
A Foundation Model for Music Informatics. ICASSP 2024: 1226-1230 - [i37]Trang Le, Daniel Lazar, Suyoun Kim, Shan Jiang, Duc Le, Adithya Sagar, Aleksandr Livshits, Ahmed Aly, Akshat Shrivastava:
PRoDeliberation: Parallel Robust Deliberation for End-to-End Spoken Language Understanding. CoRR abs/2406.07823 (2024) - [i36]Ye Bai, Haonan Chen, Jitong Chen, Zhuo Chen, Yi Deng, Xiaohong Dong, Lamtharn Hantrakul, Weituo Hao, Qingqing Huang, Zhongyi Huang, Dongya Jia, Feihu La, Duc Le, Bochen Li, Chumin Li, Hui Li, Xingxing Li, Shouda Liu, Wei-Tsung Lu, Yiqing Lu, Andrew Shaw, Janne Spijkervet, Yakun Sun, Bo Wang, Ju-Chiang Wang, Yuping Wang, Yuxuan Wang, Ling Xu, Yifeng Yang, Chao Yao, Shuo Zhang, Yang Zhang, Yilin Zhang, Hang Zhao, Ziyi Zhao, Dejian Zhong, Shicen Zhou, Pei Zou:
Seed-Music: A Unified Framework for High Quality and Controlled Music Generation. CoRR abs/2409.09214 (2024) - 2023
- [j6]Duc Le, Sang Truong, Brijesh Patel, Donald A. Adjeroh, Ngan Le:
sCL-ST: Supervised Contrastive Learning With Semantic Transformations for Multiple Lead ECG Arrhythmia Classification. IEEE J. Biomed. Health Informatics 27(6): 2818-2828 (2023) - [c44]Duc Le, Frank Seide, Yuhao Wang, Yang Li, Kjell Schubert, Ozlem Kalinli, Michael L. Seltzer:
Factorized Blank Thresholding for Improved Runtime Efficiency of Neural Transducers. ICASSP 2023: 1-5 - [c43]Ke Li, Jay Mahadeokar, Jinxi Guo, Yangyang Shi, Gil Keren, Ozlem Kalinli, Michael L. Seltzer, Duc Le:
Improving fast-slow Encoder based Transducer with Streaming Deliberation. ICASSP 2023: 1-5 - [c42]Akshat Shrivastava, Suyoun Kim, Paden Tomasello, Ali Elkahky, Daniel Lazar, Trang Le, Shan Jiang, Duc Le, Aleksandr Livshits, Ahmed Aly:
ICASSP 2023 Spoken Language Understanding Grand Challenge. ICASSP 2023: 1-2 - [c41]Andros Tjandra, Nayan Singhal, David Zhang, Ozlem Kalinli, Abdelrahman Mohamed, Duc Le, Michael L. Seltzer:
Massively Multilingual ASR on 70 Languages: Tokenization, Architecture, and Generalization Capabilities. ICASSP 2023: 1-5 - [c40]Mu Yang, Andros Tjandra, Chunxi Liu, David Zhang, Duc Le, Ozlem Kalinli:
Learning ASR Pathways: A Sparse Multilingual ASR Model. ICASSP 2023: 1-5 - [c39]Suyoun Kim, Akshat Shrivastava, Duc Le, Ju Lin, Ozlem Kalinli, Michael L. Seltzer:
Modality Confidence Aware Training for Robust End-to-End Spoken Language Understanding. INTERSPEECH 2023: 1119-1123 - [i35]Zhuangqun Huang, Gil Keren, Ziran Jiang, Shashank Jain, David Goss-Grubbs, Nelson Cheng, Farnaz Abtahi, Duc Le, David Zhang, Antony D'Avirro, Ethan Campbell-Taylor, Jessie Salas, Irina-Elena Veliche, Xi Chen:
Text Generation with Speech Synthesis for ASR Data Augmentation. CoRR abs/2305.16333 (2023) - [i34]Suyoun Kim, Akshat Shrivastava, Duc Le, Ju Lin, Ozlem Kalinli, Michael L. Seltzer:
Modality Confidence Aware Training for Robust End-to-End Spoken Language Understanding. CoRR abs/2307.12134 (2023) - [i33]Yun-Ning Hung, Ju-Chiang Wang, Minz Won, Duc Le:
Scaling Up Music Information Retrieval Training with Semi-Supervised Learning. CoRR abs/2310.01353 (2023) - [i32]Minz Won, Yun-Ning Hung, Duc Le:
A Foundation Model for Music Informatics. CoRR abs/2311.03318 (2023) - [i31]Julian D. Parker, Janne Spijkervet, Katerina Kosta, Furkan Yesiler, Boris Kuznetsov, Ju-Chiang Wang, Matt Avent, Jitong Chen, Duc Le:
StemGen: A music generation model that listens. CoRR abs/2312.08723 (2023) - [i30]Matthew Perez, Duc Le, Amrit Romana, Elise Jones, Keli Licata, Emily Mower Provost:
Seq2seq for Automatic Paraphasia Detection in Aphasic Speech. CoRR abs/2312.10518 (2023) - 2022
- [c38]Thinh Phan, Duc Le, Brijesh Patel, Donald A. Adjeroh, Jingxian Wu, Morten Olgaard Jensen, Ngan Le:
Multimodality Multi-Lead ECG Arrhythmia Classification using Self-Supervised Learning. BHI 2022: 1-4 - [c37]Suyoun Kim, Ke Li, Lucas Kabela, Ron Huang, Jiedan Zhu, Ozlem Kalinli, Duc Le:
Joint Audio/Text Training for Transformer Rescorer of Streaming Speech Recognition. EMNLP (Findings) 2022: 5717-5722 - [c36]Antoine Bruguier, Duc Le, Rohit Prabhavalkar, Dangna Li, Zhe Liu, Bo Wang, Eun Chang, Fuchun Peng, Ozlem Kalinli, Michael L. Seltzer:
Neural-FST Class Language Model for End-to-End Speech Recognition. ICASSP 2022: 6107-6111 - [c35]Jay Mahadeokar, Yangyang Shi, Ke Li, Duc Le, Jiedan Zhu, Vikas Chandra, Ozlem Kalinli, Michael L. Seltzer:
Streaming parallel transducer beam search with fast slow cascaded encoders. INTERSPEECH 2022: 2083-2087 - [c34]Duc Le, Akshat Shrivastava, Paden D. Tomasello, Suyoun Kim, Aleksandr Livshits, Ozlem Kalinli, Michael L. Seltzer:
Deliberation Model for On-Device Spoken Language Understanding. INTERSPEECH 2022: 3468-3472 - [c33]Suyoun Kim, Duc Le, Weiyi Zheng, Tarun Singh, Abhinav Arora, Xiaoyu Zhai, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric. INTERSPEECH 2022: 3978-3982 - [c32]Weiyi Zheng, Alex Xiao, Gil Keren, Duc Le, Frank Zhang, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Abdelrahman Mohamed:
Scaling ASR Improves Zero and Few Shot Learning. INTERSPEECH 2022: 5135-5139 - [c31]Niko Moritz, Frank Seide, Duc Le, Jay Mahadeokar, Christian Fuegen:
An Investigation of Monotonic Transducers for Large-Scale Automatic Speech Recognition. SLT 2022: 324-330 - [c30]Paden Tomasello, Akshat Shrivastava, Daniel Lazar, Po-Chun Hsu, Duc Le, Adithya Sagar, Ali Elkahky, Jade Copet, Wei-Ning Hsu, Yossi Adi, Robin Algayres, Tu Anh Nguyen, Emmanuel Dupoux, Luke Zettlemoyer, Abdelrahman Mohamed:
Stop: A Dataset for Spoken Task Oriented Semantic Parsing. SLT 2022: 991-998 - [i29]Antoine Bruguier, Duc Le, Rohit Prabhavalkar, Dangna Li, Zhe Liu, Bo Wang, Eun Chang, Fuchun Peng, Ozlem Kalinli, Michael L. Seltzer:
Neural-FST Class Language Model for End-to-End Speech Recognition. CoRR abs/2201.11867 (2022) - [i28]Jay Mahadeokar, Yangyang Shi, Ke Li, Duc Le, Jiedan Zhu, Vikas Chandra, Ozlem Kalinli, Michael L. Seltzer:
Streaming parallel transducer beam search with fast-slow cascaded encoders. CoRR abs/2203.15773 (2022) - [i27]Duc Le, Akshat Shrivastava, Paden Tomasello, Suyoun Kim, Aleksandr Livshits, Ozlem Kalinli, Michael L. Seltzer:
Deliberation Model for On-Device Spoken Language Understanding. CoRR abs/2204.01893 (2022) - [i26]Niko Moritz, Frank Seide, Duc Le, Jay Mahadeokar, Christian Fuegen:
An Investigation of Monotonic Transducers for Large-Scale Automatic Speech Recognition. CoRR abs/2204.08858 (2022) - [i25]Paden Tomasello, Akshat Shrivastava, Daniel Lazar, Po-Chun Hsu, Duc Le, Adithya Sagar, Ali Elkahky, Jade Copet, Wei-Ning Hsu, Yossef Mordechay, Robin Algayres, Tu Anh Nguyen, Emmanuel Dupoux, Luke Zettlemoyer, Abdelrahman Mohamed:
STOP: A dataset for Spoken Task Oriented Semantic Parsing. CoRR abs/2207.10643 (2022) - [i24]Mu Yang, Andros Tjandra, Chunxi Liu, David Zhang, Duc Le, John H. L. Hansen, Ozlem Kalinli:
Learning ASR pathways: A sparse multilingual ASR model. CoRR abs/2209.05735 (2022) - [i23]Thinh Phan, Duc Le, Brijesh Patel, Donald A. Adjeroh, Jingxian Wu, Morten Olgaard Jensen, Ngan Le:
Multimodality Multi-Lead ECG Arrhythmia Classification using Self-Supervised Learning. CoRR abs/2210.06297 (2022) - [i22]Suyoun Kim, Ke Li, Lucas Kabela, Rongqing Huang, Jiedan Zhu, Ozlem Kalinli, Duc Le:
Joint Audio/Text Training for Transformer Rescorer of Streaming Speech Recognition. CoRR abs/2211.00174 (2022) - [i21]Duc Le, Frank Seide, Yuhao Wang, Yang Li, Kjell Schubert, Ozlem Kalinli, Michael L. Seltzer:
Factorized Blank Thresholding for Improved Runtime Efficiency of Neural Transducers. CoRR abs/2211.00896 (2022) - [i20]Andros Tjandra, Nayan Singhal, David Zhang, Ozlem Kalinli, Abdelrahman Mohamed, Duc Le, Michael L. Seltzer:
Massively Multilingual ASR on 70 Languages: Tokenization, Architecture, and Generalization Capabilities. CoRR abs/2211.05756 (2022) - 2021
- [j5]Shahbaz Ahmed, Duc Le, Lauri Sydänheimo, Leena Ukkonen, Toni Björninen:
Wearable Metasurface-Enabled Quasi-Yagi Antenna for UHF RFID Reader With End-Fire Radiation Along the Forearm. IEEE Access 9: 77229-77238 (2021) - [c29]Yangyang Shi, Yongqiang Wang, Chunyang Wu, Ching-Feng Yeh, Julian Chan, Frank Zhang, Duc Le, Mike Seltzer:
Emformer: Efficient Memory Transformer Based Acoustic Model for Low Latency Streaming Speech Recognition. ICASSP 2021: 6783-6787 - [c28]Suyoun Kim, Yuan Shangguan, Jay Mahadeokar, Antoine Bruguier, Christian Fuegen, Michael L. Seltzer, Duc Le:
Improved Neural Language Model Fusion for Streaming Recurrent Neural Network Transducer. ICASSP 2021: 7333-7337 - [c27]Duc Le, Mahaveer Jain, Gil Keren, Suyoun Kim, Yangyang Shi, Jay Mahadeokar, Julian Chan, Yuan Shangguan, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Michael L. Seltzer:
Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion. Interspeech 2021: 1772-1776 - [c26]Suyoun Kim, Abhinav Arora, Duc Le, Ching-Feng Yeh, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Semantic Distance: A New Metric for ASR Performance Analysis Towards Spoken Language Understanding. Interspeech 2021: 1977-1981 - [c25]Yangyang Shi, Varun Nagaraja, Chunyang Wu, Jay Mahadeokar, Duc Le, Rohit Prabhavalkar, Alex Xiao, Ching-Feng Yeh, Julian Chan, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Dynamic Encoder Transducer: A Flexible Solution for Trading Off Accuracy for Latency. Interspeech 2021: 2042-2046 - [c24]Jay Mahadeokar, Yangyang Shi, Yuan Shangguan, Chunyang Wu, Alex Xiao, Hang Su, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer:
Flexi-Transducer: Optimizing Latency, Accuracy and Compute for Multi-Domain On-Device Scenarios. Interspeech 2021: 2107-2111 - [c23]Yuan Shangguan, Rohit Prabhavalkar, Hang Su, Jay Mahadeokar, Yangyang Shi, Jiatong Zhou, Chunyang Wu, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer:
Dissecting User-Perceived Latency of On-Device E2E Speech Recognition. Interspeech 2021: 4553-4557 - [c22]Jay Mahadeokar, Yuan Shangguan, Duc Le, Gil Keren, Hang Su, Thong Le, Ching-Feng Yeh, Christian Fuegen, Michael L. Seltzer:
Alignment Restricted Streaming Recurrent Neural Network Transducer. SLT 2021: 52-59 - [c21]Chunxi Liu, Frank Zhang, Duc Le, Suyoun Kim, Yatharth Saraf, Geoffrey Zweig:
Improving RNN Transducer Based ASR with Auxiliary Tasks. SLT 2021: 172-179 - [c20]Duc Le, Gil Keren, Julian Chan, Jay Mahadeokar, Christian Fuegen, Michael L. Seltzer:
Deep Shallow Fusion for RNN-T Personalization. SLT 2021: 251-257 - [i19]Suyoun Kim, Abhinav Arora, Duc Le, Ching-Feng Yeh, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Semantic Distance: A New Metric for ASR Performance Analysis Towards Spoken Language Understanding. CoRR abs/2104.02138 (2021) - [i18]Yangyang Shi, Varun Nagaraja, Chunyang Wu, Jay Mahadeokar, Duc Le, Rohit Prabhavalkar, Alex Xiao, Ching-Feng Yeh, Julian Chan, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Dynamic Encoder Transducer: A Flexible Solution For Trading Off Accuracy For Latency. CoRR abs/2104.02176 (2021) - [i17]Duc Le, Mahaveer Jain, Gil Keren, Suyoun Kim, Yangyang Shi, Jay Mahadeokar, Julian Chan, Yuan Shangguan, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Michael L. Seltzer:
Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion. CoRR abs/2104.02194 (2021) - [i16]Yuan Shangguan, Rohit Prabhavalkar, Hang Su, Jay Mahadeokar, Yangyang Shi, Jiatong Zhou, Chunyang Wu, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer:
Dissecting User-Perceived Latency of On-Device E2E Speech Recognition. CoRR abs/2104.02207 (2021) - [i15]Jay Mahadeokar, Yangyang Shi, Yuan Shangguan, Chunyang Wu, Alex Xiao, Hang Su, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer:
Flexi-Transducer: Optimizing Latency, Accuracy and Compute forMulti-Domain On-Device Scenarios. CoRR abs/2104.02232 (2021) - [i14]Suyoun Kim, Duc Le, Weiyi Zheng, Tarun Singh, Abhinav Arora, Xiaoyu Zhai, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric. CoRR abs/2110.05376 (2021) - [i13]Alex Xiao, Weiyi Zheng, Gil Keren, Duc Le, Frank Zhang, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Abdelrahman Mohamed:
Scaling ASR Improves Zero and Few Shot Learning. CoRR abs/2111.05948 (2021) - 2020
- [c19]Karine Ip, Abhijith Asok, Yijia Xu, Duc Le, Natalie Mionis, Roman Batoukov:
ML-Assisted Monitoring and Characterization of IoT Sensor Networks. EAIS 2020: 1-8 - [c18]Duc Le, Thilo Köhler, Christian Fuegen, Michael L. Seltzer:
G2G: TTS-Driven Pronunciation Learning for Graphemic Hybrid ASR. ICASSP 2020: 6869-6873 - [c17]Yongqiang Wang, Abdelrahman Mohamed, Duc Le, Chunxi Liu, Alex Xiao, Jay Mahadeokar, Hongzhao Huang, Andros Tjandra, Xiaohui Zhang, Frank Zhang, Christian Fuegen, Geoffrey Zweig, Michael L. Seltzer:
Transformer-Based Acoustic Modeling for Hybrid Speech Recognition. ICASSP 2020: 6874-6878 - [c16]Yangyang Shi, Yongqiang Wang, Chunyang Wu, Christian Fuegen, Frank Zhang, Duc Le, Ching-Feng Yeh, Michael L. Seltzer:
Weak-Attention Suppression for Transformer Based Speech Recognition. INTERSPEECH 2020: 4996-5000 - [i12]Yangyang Shi, Yongqiang Wang, Chunyang Wu, Christian Fuegen, Frank Zhang, Duc Le, Ching-Feng Yeh, Michael L. Seltzer:
Weak-Attention Suppression For Transformer Based Speech Recognition. CoRR abs/2005.09137 (2020) - [i11]Matthew Perez, Wenyu Jin, Duc Le, Noelle Carlozzi, Praveen Dayalu, Angela Roberts, Emily Mower Provost:
Classification of Huntington Disease using Acoustic and Lexical Features. CoRR abs/2008.03367 (2020) - [i10]Yangyang Shi, Yongqiang Wang, Chunyang Wu, Ching-Feng Yeh, Julian Chan, Frank Zhang, Duc Le, Michael L. Seltzer:
Emformer: Efficient Memory Transformer Based Acoustic Model For Low Latency Streaming Speech Recognition. CoRR abs/2010.10759 (2020) - [i9]Suyoun Kim, Yuan Shangguan, Jay Mahadeokar, Antoine Bruguier, Christian Fuegen, Michael L. Seltzer, Duc Le:
Improved Neural Language Model Fusion for Streaming Recurrent Neural Network Transducer. CoRR abs/2010.13878 (2020) - [i8]Jay Mahadeokar, Yuan Shangguan, Duc Le, Gil Keren, Hang Su, Thong Le, Ching-Feng Yeh, Christian Fuegen, Michael L. Seltzer:
Alignment Restricted Streaming Recurrent Neural Network Transducer. CoRR abs/2011.03072 (2020) - [i7]Chunxi Liu, Frank Zhang, Duc Le, Suyoun Kim, Yatharth Saraf, Geoffrey Zweig:
Improving RNN Transducer Based ASR with Auxiliary Tasks. CoRR abs/2011.03109 (2020) - [i6]Duc Le, Gil Keren, Julian Chan, Jay Mahadeokar, Christian Fuegen, Michael L. Seltzer:
Deep Shallow Fusion for RNN-T Personalization. CoRR abs/2011.07754 (2020)
2010 – 2019
- 2019
- [j4]Abigail J. Heithoff, Steven A. Totusek, Duc Le, Lucas Barwick, Gary Gensler, Donald R. Franklin, Allison C. Dye, Sanjit Pandey, Seth Sherman, Chittibabu Guda, Howard S. Fox:
The integrated National NeuroAIDS Tissue Consortium database: a rich platform for neuroHIV research. Database J. Biol. Databases Curation 2019: bay134 (2019) - [j3]Duc Le, Wenrui Huang, Elijah Johnson:
Neural network modeling of monthly salinity variations in oyster reef in Apalachicola Bay in response to freshwater inflow and winds. Neural Comput. Appl. 31(10): 6249-6259 (2019) - [c15]Duc Le, Xiaohui Zhang, Weiyi Zheng, Christian Fügen, Geoffrey Zweig, Michael L. Seltzer:
From Senones to Chenones: Tied Context-Dependent Graphemes for Hybrid Speech Recognition. ASRU 2019: 457-464 - [i5]Duc Le, Xiaohui Zhang, Weiyi Zheng, Christian Fügen, Geoffrey Zweig, Michael L. Seltzer:
From Senones to Chenones: Tied Context-Dependent Graphemes for Hybrid Speech Recognition. CoRR abs/1910.01493 (2019) - [i4]Yongqiang Wang, Abdelrahman Mohamed, Duc Le, Chunxi Liu, Alex Xiao, Jay Mahadeokar, Hongzhao Huang, Andros Tjandra, Xiaohui Zhang, Frank Zhang, Christian Fuegen, Geoffrey Zweig, Michael L. Seltzer:
Transformer-based Acoustic Modeling for Hybrid Speech Recognition. CoRR abs/1910.09799 (2019) - [i3]Duc Le, Thilo Köhler, Christian Fuegen, Michael L. Seltzer:
G2G: TTS-Driven Pronunciation Learning for Graphemic Hybrid ASR. CoRR abs/1910.12612 (2019) - [i2]Ching-Feng Yeh, Jay Mahadeokar, Kaustubh Kalgaonkar, Yongqiang Wang, Duc Le, Mahaveer Jain, Kjell Schubert, Christian Fuegen, Michael L. Seltzer:
Transformer-Transducer: End-to-End Speech Recognition with Self-Attention. CoRR abs/1910.12977 (2019) - 2018
- [j2]Duc Le, Keli Licata, Emily Mower Provost:
Automatic quantitative analysis of spontaneous aphasic speech. Speech Commun. 100: 1-12 (2018) - [c14]Matthew Perez, Wenyu Jin, Duc Le, Noelle Carlozzi, Praveen Dayalu, Angela Roberts, Emily Mower Provost:
Classification of Huntington Disease Using Acoustic and Lexical Features. INTERSPEECH 2018: 1898-1902 - [i1]Duc Le:
Real-time Air Pollution prediction model based on Spatiotemporal Big data. CoRR abs/1805.00432 (2018) - 2017
- [b1]Duc Le:
Towards Automatic Speech-Language Assessment for Aphasia Rehabilitation. University of Michigan, USA, 2017 - [c13]Duc Le, Keli Licata, Emily Mower Provost:
Automatic Paraphasia Detection from Aphasic Speech: A Preliminary Study. INTERSPEECH 2017: 294-298 - [c12]Duc Le, Zakaria Aldeneh, Emily Mower Provost:
Discretized Continuous Speech Emotion Recognition with Multi-Task Deep Recurrent Neural Network. INTERSPEECH 2017: 1108-1112 - 2016
- [j1]Duc Le, Keli Licata, Carol Persad, Emily Mower Provost:
Automatic Assessment of Speech Intelligibility for Individuals With Aphasia. IEEE ACM Trans. Audio Speech Lang. Process. 24(11): 2187-2199 (2016) - [c11]John Gideon, Biqiao Zhang, Zakaria Aldeneh, Yelin Kim, Soheil Khorram, Duc Le, Emily Mower Provost:
Wild wild emotion: a multimodal ensemble approach. ICMI 2016: 501-505 - [c10]Duc Le, Emily Mower Provost:
Improving Automatic Recognition of Aphasic Speech with AphasiaBank. INTERSPEECH 2016: 2681-2685 - 2015
- [c9]Duc Le, Emily Mower Provost:
Data selection for acoustic emotion recognition: Analyzing and comparing utterance and sub-utterance selection strategies. ACII 2015: 146-152 - 2014
- [c8]Duc Le, Keli Licata, Elizabeth Mercado, Carol Persad, Emily Mower Provost:
Automatic analysis of speech quality for aphasia treatment. ICASSP 2014: 4853-4857 - [c7]Duc Le, Emily Mower Provost:
Modeling pronunciation, rhythm, and intonation for automatic assessment of speech quality in aphasia rehabilitation. INTERSPEECH 2014: 1563-1567 - [c6]Duc Le, Mohammad Amin Alipour, Rahul Gopinath, Alex Groce:
MuCheck: an extensible tool for mutation testing of haskell programs. ISSTA 2014: 429-432 - 2013
- [c5]Duc Le, Emily Mower Provost:
Emotion recognition from spontaneous speech using Hidden Markov models with deep belief networks. ASRU 2013: 216-221 - [c4]Je Hun Jeon, Duc Le, Rui Xia, Yang Liu:
A preliminary study of cross-lingual emotion recognition from speech: automatic classification versus human perception. INTERSPEECH 2013: 2837-2840 - 2011
- [c3]Duc Le, Eric Walkingshaw, Martin Erwig:
#ifdef confirmed harmful: Promoting understandable software variation. VL/HCC 2011: 143-150 - [c2]Duc Le:
Support for software variation editing. VL/HCC 2011: 233-234
2000 – 2009
- 2005
- [c1]Louis Luh, Willie Ng, Joseph F. Jensen, Duc Le, David L. Persechini, Stephen Thomas III, Charles H. Fields, James Lin:
A 10.24GSPS photonic sampled bandpass ΔΣ modulator direct-sampling at 12GHz. CICC 2005: 387-390
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-12 20:59 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint