default search action
Ian McGraw
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2023
- [c36]Steven M. Hernandez, Ding Zhao, Shaojin Ding, Antoine Bruguier, Rohit Prabhavalkar, Tara N. Sainath, Yanzhang He, Ian McGraw:
Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition Models. ICASSP 2023: 1-5 - [c35]Tom O'Malley, Shaojin Ding, Arun Narayanan, Quan Wang, Rajeev Rikhye, Qiao Liang, Yanzhang He, Ian McGraw:
Conditional Conformer: Improving Speaker Modulation For Single And Multi-User Speech Enhancement. ICASSP 2023: 1-5 - [c34]Weiran Wang, Ding Zhao, Shaojin Ding, Hao Zhang, Shuo-Yiin Chang, David Rybach, Tara N. Sainath, Yanzhang He, Ian McGraw, Shankar Kumar:
Multi-Output RNN-T Joint Networks for Multi-Task Learning of ASR and Auxiliary Tasks. ICASSP 2023: 1-5 - [i18]Steven M. Hernandez, Ding Zhao, Shaojin Ding, Antoine Bruguier, Rohit Prabhavalkar, Tara N. Sainath, Yanzhang He, Ian McGraw:
Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition Models. CoRR abs/2303.08343 (2023) - 2022
- [c33]Tara N. Sainath, Yanzhang He, Arun Narayanan, Rami Botros, Weiran Wang, David Qiu, Chung-Cheng Chiu, Rohit Prabhavalkar, Alexander Gruenstein, Anmol Gulati, Bo Li, David Rybach, Emmanuel Guzman, Ian McGraw, James Qin, Krzysztof Choromanski, Qiao Liang, Robert David, Ruoming Pang, Shuo-Yiin Chang, Trevor Strohman, W. Ronny Huang, Wei Han, Yonghui Wu, Yu Zhang:
Improving The Latency And Quality Of Cascaded Encoders. ICASSP 2022: 8112-8116 - [c32]Shaojin Ding, Weiran Wang, Ding Zhao, Tara N. Sainath, Yanzhang He, Robert David, Rami Botros, Xin Wang, Rina Panigrahy, Qiao Liang, Dongseong Hwang, Ian McGraw, Rohit Prabhavalkar, Trevor Strohman:
A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes. INTERSPEECH 2022: 1706-1710 - [c31]Shaojin Ding, Rajeev Rikhye, Qiao Liang, Yanzhang He, Quan Wang, Arun Narayanan, Tom O'Malley, Ian McGraw:
Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition. INTERSPEECH 2022: 3744-3748 - [c30]Rajeev Rikhye, Quan Wang, Qiao Liang, Yanzhang He, Ian McGraw:
Closing the Gap Between Single-User and Multi-User VoiceFilter-Lite. Odyssey 2022: 294-300 - [i17]Rajeev Rikhye, Quan Wang, Qiao Liang, Yanzhang He, Ian McGraw:
Closing the Gap between Single-User and Multi-User VoiceFilter-Lite. CoRR abs/2202.12169 (2022) - [i16]Shaojin Ding, Rajeev Rikhye, Qiao Liang, Yanzhang He, Quan Wang, Arun Narayanan, Tom O'Malley, Ian McGraw:
Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition. CoRR abs/2204.03793 (2022) - [i15]Shaojin Ding, Weiran Wang, Ding Zhao, Tara N. Sainath, Yanzhang He, Robert David, Rami Botros, Xin Wang, Rina Panigrahy, Qiao Liang, Dongseong Hwang, Ian McGraw, Rohit Prabhavalkar, Trevor Strohman:
A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes. CoRR abs/2204.06164 (2022) - 2021
- [c29]Rajeev Rikhye, Quan Wang, Qiao Liang, Yanzhang He, Ian McGraw:
Multi-User Voicefilter-Lite via Attentive Speaker Embedding. ASRU 2021: 275-282 - [c28]David Qiu, Qiujia Li, Yanzhang He, Yu Zhang, Bo Li, Liangliang Cao, Rohit Prabhavalkar, Deepti Bhatia, Wei Li, Ke Hu, Tara N. Sainath, Ian McGraw:
Learning Word-Level Confidence for Subword End-To-End ASR. ICASSP 2021: 6393-6397 - [c27]David Qiu, Yanzhang He, Qiujia Li, Yu Zhang, Liangliang Cao, Ian McGraw:
Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction. Interspeech 2021: 4074-4078 - [c26]Rajeev Rikhye, Quan Wang, Qiao Liang, Yanzhang He, Ding Zhao, Yiteng Huang, Arun Narayanan, Ian McGraw:
Personalized Keyphrase Detection Using Speaker and Environment Information. Interspeech 2021: 4204-4208 - [i14]David Qiu, Qiujia Li, Yanzhang He, Yu Zhang, Bo Li, Liangliang Cao, Rohit Prabhavalkar, Deepti Bhatia, Wei Li, Ke Hu, Tara N. Sainath, Ian McGraw:
Learning Word-Level Confidence For Subword End-to-End ASR. CoRR abs/2103.06716 (2021) - [i13]David Qiu, Yanzhang He, Qiujia Li, Yu Zhang, Liangliang Cao, Ian McGraw:
Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction. CoRR abs/2104.12870 (2021) - [i12]Rajeev Rikhye, Quan Wang, Qiao Liang, Yanzhang He, Ding Zhao, Yiteng Huang, Arun Narayanan, Ian McGraw:
Personalized Keyphrase Detection using Speaker and Environment Information. CoRR abs/2104.13970 (2021) - [i11]Rajeev Rikhye, Quan Wang, Qiao Liang, Yanzhang He, Ian McGraw:
Multi-user VoiceFilter-Lite via Attentive Speaker Embedding. CoRR abs/2107.01201 (2021) - 2020
- [c25]Tara N. Sainath, Yanzhang He, Bo Li, Arun Narayanan, Ruoming Pang, Antoine Bruguier, Shuo-Yiin Chang, Wei Li, Raziel Alvarez, Zhifeng Chen, Chung-Cheng Chiu, David Garcia, Alexander Gruenstein, Ke Hu, Anjuli Kannan, Qiao Liang, Ian McGraw, Cal Peyser, Rohit Prabhavalkar, Golan Pundak, David Rybach, Yuan Shangguan, Yash Sheth, Trevor Strohman, Mirkó Visontai, Yonghui Wu, Yu Zhang, Ding Zhao:
A Streaming On-Device End-To-End Model Surpassing Server-Side Conventional Model Quality and Latency. ICASSP 2020: 6059-6063 - [c24]Yuan Shangguan, Kate Knister, Yanzhang He, Ian McGraw, Françoise Beaufays:
Analyzing the Quality and Stability of a Streaming End-to-End On-Device Speech Recognizer. INTERSPEECH 2020: 591-595 - [i10]Tara N. Sainath, Yanzhang He, Bo Li, Arun Narayanan, Ruoming Pang, Antoine Bruguier, Shuo-Yiin Chang, Wei Li, Raziel Alvarez, Zhifeng Chen, Chung-Cheng Chiu, David Garcia, Alexander Gruenstein, Ke Hu, Minho Jin, Anjuli Kannan, Qiao Liang, Ian McGraw, Cal Peyser, Rohit Prabhavalkar, Golan Pundak, David Rybach, Yuan Shangguan, Yash Sheth, Trevor Strohman, Mirkó Visontai, Yonghui Wu, Yu Zhang, Ding Zhao:
A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency. CoRR abs/2003.12710 (2020) - [i9]Yuan Shangguan, Kate Knister, Yanzhang He, Ian McGraw, Françoise Beaufays:
Analyzing the Quality and Stability of a Streaming End-to-End On-Device Speech Recognizer. CoRR abs/2006.01416 (2020)
2010 – 2019
- 2019
- [c23]Yanzhang He, Tara N. Sainath, Rohit Prabhavalkar, Ian McGraw, Raziel Alvarez, Ding Zhao, David Rybach, Anjuli Kannan, Yonghui Wu, Ruoming Pang, Qiao Liang, Deepti Bhatia, Yuan Shangguan, Bo Li, Golan Pundak, Khe Chai Sim, Tom Bagby, Shuo-Yiin Chang, Kanishka Rao, Alexander Gruenstein:
Streaming End-to-end Speech Recognition for Mobile Devices. ICASSP 2019: 6381-6385 - [c22]Tara N. Sainath, Ruoming Pang, David Rybach, Yanzhang He, Rohit Prabhavalkar, Wei Li, Mirkó Visontai, Qiao Liang, Trevor Strohman, Yonghui Wu, Ian McGraw, Chung-Cheng Chiu:
Two-Pass End-to-End Speech Recognition. INTERSPEECH 2019: 2773-2777 - [i8]Jonathan Shen, Patrick Nguyen, Yonghui Wu, Zhifeng Chen, Mia Xu Chen, Ye Jia, Anjuli Kannan, Tara N. Sainath, Yuan Cao, Chung-Cheng Chiu, Yanzhang He, Jan Chorowski, Smit Hinsu, Stella Laurenzo, James Qin, Orhan Firat, Wolfgang Macherey, Suyog Gupta, Ankur Bapna, Shuyuan Zhang, Ruoming Pang, Ron J. Weiss, Rohit Prabhavalkar, Qiao Liang, Benoit Jacob, Bowen Liang, HyoukJoong Lee, Ciprian Chelba, Sébastien Jean, Bo Li, Melvin Johnson, Rohan Anil, Rajat Tibrewal, Xiaobing Liu, Akiko Eriguchi, Navdeep Jaitly, Naveen Ari, Colin Cherry, Parisa Haghani, Otavio Good, Youlong Cheng, Raziel Alvarez, Isaac Caswell, Wei-Ning Hsu, Zongheng Yang, Kuan-Chieh Wang, Ekaterina Gonina, Katrin Tomanek, Ben Vanik, Zelin Wu, Llion Jones, Mike Schuster, Yanping Huang, Dehao Chen, Kazuki Irie, George F. Foster, John Richardson, Klaus Macherey, Antoine Bruguier, Heiga Zen, Colin Raffel, Shankar Kumar, Kanishka Rao, David Rybach, Matthew Murray, Vijayaditya Peddinti, Maxim Krikun, Michiel Bacchiani, Thomas B. Jablin, Robert Suderman, Ian Williams, Benjamin Lee, Deepti Bhatia, Justin Carlson, Semih Yavuz, Yu Zhang, Ian McGraw, Max Galkin, Qi Ge, Golan Pundak, Chad Whipkey, Todd Wang, Uri Alon, Dmitry Lepikhin, Ye Tian, Sara Sabour, William Chan, Shubham Toshniwal, Baohua Liao, Michael Nirschl, Pat Rondon:
Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling. CoRR abs/1902.08295 (2019) - [i7]Tara N. Sainath, Ruoming Pang, David Rybach, Yanzhang He, Rohit Prabhavalkar, Wei Li, Mirkó Visontai, Qiao Liang, Trevor Strohman, Yonghui Wu, Ian McGraw, Chung-Cheng Chiu:
Two-Pass End-to-End Speech Recognition. CoRR abs/1908.10992 (2019) - [i6]Yuan Shangguan, Jian Li, Liang Qiao, Raziel Alvarez, Ian McGraw:
Optimizing Speech Recognition For The Edge. CoRR abs/1909.12408 (2019) - 2018
- [i5]Yanzhang He, Tara N. Sainath, Rohit Prabhavalkar, Ian McGraw, Raziel Alvarez, Ding Zhao, David Rybach, Anjuli Kannan, Yonghui Wu, Ruoming Pang, Qiao Liang, Deepti Bhatia, Yuan Shangguan, Bo Li, Golan Pundak, Khe Chai Sim, Tom Bagby, Shuo-Yiin Chang, Kanishka Rao, Alexander Gruenstein:
Streaming End-to-end Speech Recognition For Mobile Devices. CoRR abs/1811.06621 (2018) - 2017
- [c21]Yanzhang He, Rohit Prabhavalkar, Kanishka Rao, Wei Li, Anton Bakhtin, Ian McGraw:
Streaming small-footprint keyword spotting using sequence-to-sequence models. ASRU 2017: 474-481 - [i4]Yanzhang He, Rohit Prabhavalkar, Kanishka Rao, Wei Li, Anton Bakhtin, Ian McGraw:
Streaming Small-Footprint Keyword Spotting using Sequence-to-Sequence Models. CoRR abs/1710.09617 (2017) - 2016
- [c20]Ian McGraw, Rohit Prabhavalkar, Raziel Alvarez, Montse Gonzalez Arenas, Kanishka Rao, David Rybach, Ouais Alsharif, Hasim Sak, Alexander Gruenstein, Françoise Beaufays, Carolina Parada:
Personalized speech recognition on mobile devices. ICASSP 2016: 5955-5959 - [c19]Rohit Prabhavalkar, Ouais Alsharif, Antoine Bruguier, Ian McGraw:
On the compression of recurrent neural networks with an application to LVCSR acoustic modeling for embedded speech recognition. ICASSP 2016: 5970-5974 - [i3]Ian McGraw, Rohit Prabhavalkar, Raziel Alvarez, Montse Gonzalez Arenas, Kanishka Rao, David Rybach, Ouais Alsharif, Hasim Sak, Alexander Gruenstein, Françoise Beaufays, Carolina Parada:
Personalized Speech recognition on mobile devices. CoRR abs/1603.03185 (2016) - [i2]Rohit Prabhavalkar, Ouais Alsharif, Antoine Bruguier, Ian McGraw:
On the Compression of Recurrent Neural Networks with an Application to LVCSR acoustic modeling for Embedded Speech Recognition. CoRR abs/1603.08042 (2016) - 2015
- [c18]Christophe Van Gysel, Leonid Velikovich, Ian McGraw, Françoise Beaufays:
Garbage modeling for on-device speech recognition. INTERSPEECH 2015: 2127-2131 - 2014
- [c17]David Harwath, Alexander Gruenstein, Ian McGraw:
Choosing useful word alternates for automatic speech recognition correction interfaces. INTERSPEECH 2014: 949-953 - 2013
- [j3]Ian McGraw, Ibrahim Badr, James R. Glass:
Learning Lexicons From Speech Using a Pronunciation Mixture Model. IEEE Trans. Speech Audio Process. 21(2): 357-366 (2013) - [c16]Aren Jansen, Emmanuel Dupoux, Sharon Goldwater, Mark Johnson, Sanjeev Khudanpur, Kenneth Church, Naomi Feldman, Hynek Hermansky, Florian Metze, Richard C. Rose, Mike Seltzer, Pascal Clark, Ian McGraw, Balakrishnan Varadarajan, Erin Bennett, Benjamin Börschinger, Justin T. Chiu, Ewan Dunbar, Abdellah Fourtassi, David Harwath, Chia-ying Lee, Keith D. Levin, Atta Norouzian, Vijayaditya Peddinti, Rachael Richardson, Thomas Schatz, Samuel Thomas:
A summary of the 2012 JHU CLSP workshop on zero resource speech technologies and models of early language acquisition. ICASSP 2013: 8111-8115 - 2012
- [b1]Ian McGraw:
Crowd-supervised training of spoken language systems. Massachusetts Institute of Technology, Cambridge, MA, USA, 2012 - [c15]Ian McGraw, Alexander Gruenstein:
Estimating Word-Stability During Incremental Speech Recognition. INTERSPEECH 2012: 1019-1022 - [c14]Jingjing Liu, Scott Cyphers, Panupong Pasupat, Ian McGraw, James R. Glass:
A Conversational Movie Search System Based on Conditional Random Fields. INTERSPEECH 2012: 2454-2457 - [c13]Ian McGraw, Scott Cyphers, Panupong Pasupat, Jingjing Liu, James R. Glass:
Automating Crowd-supervised Learning for Spoken Language Systems. INTERSPEECH 2012: 2474-2477 - [i1]Gal Elidan, Ian McGraw, Daphne Koller:
Residual Belief Propagation: Informed Scheduling for Asynchronous Message Passing. CoRR abs/1206.6837 (2012) - 2011
- [c12]Ibrahim Badr, Ian McGraw, James R. Glass:
Pronunciation Learning from Continuous Speech. INTERSPEECH 2011: 549-552 - [c11]Ian McGraw, James R. Glass, Stephanie Seneff:
Growing a Spoken Language Interface on Amazon Mechanical Turk. INTERSPEECH 2011: 3057-3060 - 2010
- [j2]Ariel Jaimovich, Ofer Meshi, Ian McGraw, Gal Elidan:
FastInf: An Efficient Approximate Inference Library. J. Mach. Learn. Res. 11: 1733-1736 (2010) - [c10]Ibrahim Badr, Ian McGraw, James R. Glass:
Learning new word pronunciations from spoken examples. INTERSPEECH 2010: 2294-2297 - [c9]Ian McGraw, Chia-ying Lee, I. Lee Hetherington, Stephanie Seneff, James R. Glass:
Collecting Voices from the Cloud. LREC 2010
2000 – 2009
- 2009
- [j1]Ian McGraw, Brandon Yoshimoto, Stephanie Seneff:
Speech-enabled card games for incidental vocabulary acquisition in a foreign language. Speech Commun. 51(10): 1006-1023 (2009) - [c8]Ian McGraw, Alexander Gruenstein, Andrew M. Sutherland:
A self-labeling speech corpus: collecting spoken words with an online educational game. INTERSPEECH 2009: 3031-3034 - [c7]Alexander Gruenstein, Ian McGraw, Andrew M. Sutherland:
A self-transcribing speech corpus: collecting continuous speech with an online educational game. SLaTE 2009: 109-112 - [c6]Alexander Gruenstein, Ian McGraw, Andrew M. Sutherland:
Voice race and voice scatter: online educational games for collecting orthographically-labeled speech data. SLaTE 2009 - [c5]Brandon Yoshimoto, Ian McGraw, Stephanie Seneff:
Rainbow rummy: a web-based game for vocabulary acquisition using computer-directed speech. SLaTE 2009: 5-8 - 2008
- [c4]Ian McGraw, Stephanie Seneff:
Speech-enabled Card Games for Language Learners. AAAI 2008: 778-783 - [c3]Alexander Gruenstein, Ian McGraw, Ibrahim Badr:
The WAMI toolkit for developing, deploying, and evaluating web-accessible multimodal interfaces. ICMI 2008: 141-148 - 2007
- [c2]Ian McGraw, Stephanie Seneff:
Immersive second language acquisition in narrow domains: a prototype ISLAND dialogue system. SLaTE 2007: 84-87 - 2006
- [c1]Gal Elidan, Ian McGraw, Daphne Koller:
Residual Belief Propagation: Informed Scheduling for Asynchronous Message Passing. UAI 2006
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-08-05 20:24 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint