default search action

combined dblp search
author search
venue search
publication search

ask others

Bo Li 0028

> Home > Persons

Person information

affiliation: Google Inc., USA
affiliation (former): National University of Singapore, Singapore

Other persons with the same name

see FAQ

Bo Li — disambiguation page
Bo Li 0001 — Hong Kong University of Science and Technology, Department of Computer Science and Engineering, Hong Kong (and 4 more)
Bo Li 0002 — Wuhan University of Science and Technology, School of Computer Science and Technology, Wuhan, China (and 5 more)
Bo Li 0003 — Northeastern University, School of Information Science and Engineering, Shenyang, China
Bo Li 0004 — Northwestern Polytechnical University, School of Electronics and Information, Xi'an, China (and 2 more)
Bo Li 0005 — Beihang University, Beijing Advanced Innovation Center for Big Data and Brain Computing and State Key Laboratory of Software Development Environment, Beijing, China (and 1 more)
Bo Li 0006 — Beihang University, School of Computer Science and Engineering, Beijing Key Laboratory of Digital Media and State Key Laboratory of Virtual Reality Technology and Systems, Beijing, China (and 1 more)
Bo Li 0007 — University of California San Diego, Department of Mathematics and Center for Theoretical Biological Physics, San Diego, CA, USA (and 1 more)
Bo Li 0008 — Qingdao University of Science and Technology
Bo Li 0009 — Chinese Academy of Sciences, Institute of Computing Technology, National Research Center for Intelligent Computing Systems, Beijing, China

Bo Li 0010 — University of Essex
Bo Li 0011 — University of Florida, Computer and Information Science and Engineering Department, Gainesville, FL, USA
Bo Li 0012 — Université Joseph Fourier, Grenoble
Bo Li 0013 — University of Southern Mississippi, School of Computing, Long Beach, USA (and 2 more)
Bo Li 0014 — Samsung R&D, Mountain View, CA, USA (and 1 more)
Bo Li 0015 — Harvard Medical School, Boston, MA, USA (and 2 more)
Bo Li 0016 — Beijing Normal University, Faculty of Geographical Science, College of Resources Science and Technology, State Key Laboratory of Earth Surface Processes and Resource Ecology, Beijing, China
Bo Li 0017 — Beijing Jiao Tong University, State Key Lab. of Rail Traffic Control & Safety, Beijing, China
Bo Li 0018 — Baidu Inc., Institute of Deep Learning, Beijing, China (and 1 more)
Bo Li 0019 — Purdue University, Department of Statistics, West Lafayette, IN, USA
Bo Li 0020 — Washington University, St. Louis, MO, USA
Bo Li 0021 — Auburn University
Bo Li 0022 — Sun Yet-Sen University, Zhongshan School of Medicine, China (and 1 more)
Bo Li 0023 — Central China Normal University, School of Educational Information Technology, Wuhan, China (and 2 more)
Bo Li 0024 — Ningbo Supply Chain Innovation Institute China, China
Bo Li 0025 — Yunnan University, School of Information Science and Engineering, Kunming, China
Bo Li 0026 — University of Chicago, Department of Computer Science, IL, USA (and 4 more)
Bo Li 0027 — Qualcomm, San Diego, CA, USA (and 1 more)
Bo Li 0029 — CAS, Institute of Automation, State Key Laboratory of Management and Control for Complex Systems, Beijing, China
Bo Li 0030 — Xi'an Hi-Tech Research Institute, Xi'an, China (and 1 more)
Bo Li 0031 — Beijing Institute of Technology, Beijing Lab of Intelligent Information Technology, Beijing, China
Bo Li 0032 — Virginia Tech, Blacksburg, VA, USA
Bo Li 0033 — University of Maryland, Electrical and Computer Engineering Department, College Park, MD, USA
Bo Li 0034 — Harbin Institute of Technology, School of Information and Electrical Engineering, Weihai, China
Bo Li 0036 — Clemson University, SC, USA
Bo Li 0037 — Hong Kong Polytechnic University, Department of Computing, Hong Kong (and 4 more)
Bo Li 0038 — Nanjing University, School of Electronic Science and Engineering, China
Bo Li 0039 — Chinese Academy of Sciences, Key Laboratory of Mathematics Mechanization, Beijing, China (and 1 more)
Bo Li 0040 — Lanzhou Jiaotong University, School of Automation & Electrical Engineering, China
Bo Li 0041 — Northeastern University, Shenyang, China
Bo Li 0042 — China University of Petroleum, Department of Software Engineering, Qingdao, China
Bo Li 0043 — Peking University, School of Software and Microelectronics, Beijing, China
Bo Li 0045 — Chongqing University, School of Electrical Engineering, State Key Laboratory of Power Transmission Equipment & System Security and New Technology, China
Bo Li 0046 — Shanghai Jiao Tong University, School of Medicine, Shanghai Ninth People's Hospital, China
Bo Li 0047 — Loughborough University, UK
Bo Li 0048 — Florida Atlantic University, Boca Raton, FL, USA
Bo Li 0050 — Nanjing University of Finance and Economics, School of Applied Mathematics, China (and 1 more)
Bo Li 0051 — Chinese Academy of Sciences, Institute of Microelectronics, Beijing, China (and 1 more)
Bo Li 0052 — Zhejiang University, College of Information Science and Electronic Engineering, Hangzhou, China
Bo Li 0053 — Southwest University, College of Electronic and Information Engineering, Chongqing, China (and 1 more)
Bo Li 0054 — Southwest Jiaotong University, School of Information Science and Technology, Chengdu, China (and 1 more)
Bo Li 0055 — Chinese Academy of Sciences, Institute of Computer Application, Chengdu, China
Bo Li 0056 — Shanghai Jiao Tong University, School of Electronic, Information, and Electrical Engineering, Department of Micro/Nano Electronics, China
Bo Li 0057 — Teesside University, School of Science Engineering and Design, Middlesbrough, UK
Bo Li 0058 — University of Georgia, Athens, GA, USA
Bo Li 0059 — Liaoning University of Technology, School of Electronics and Information Engineering, Jinzhou, China (and 1 more)
Bo Li 0060 — Guangdong University of Technology, School of Automation, Guangzhou, China
Bo Li 0061 — Nanjing University, State Key Laboratory for Novel Software Technology, Nanjing, China
Bo Li 0062 — Nankai University, College of Computer and Control Engineering, Tianjin, China
Bo Li 0063 — Chinese Academy of Sciences, Institute of Information Engineering, Beijing, China (and 1 more)
Bo Li 0064 — Tsinghua University, School of Economics and Management, Beijing, China
Bo Li 0065 — Southwest University of Science and Technology, Mianyang, China (and 1 more)
Bo Li 0066 — Xidian University, School of Mathematics and Statistics, Xian, China
Bo Li 0067 — Shandong University of Technology, School of Transportation and Vehicle Engineering, Zibo, China
Bo Li 0068 — Harbin Institute of Technology, Reliability Institute for Electric Apparatus and Electronics, China
Bo Li 0069 — Shanghai Maritime University, Institute of Logistics Science and Engineering, China
Bo Li 0070 — Nanjing Agricultural University, College of Engineering, China
Bo Li 0071 — Zhejiang University, School of Aeronautics and Astronautics, Hangzhou, China
Bo Li 0072 — University of Electronic Science and Technology of China, School of Astronautics and Aeronautics, Chengdu, China
Bo Li 0073 — China NARI Group Corporation, State Grid Electronic Power Research Institute, Nanjing, China
Bo Li 0074 — Sichuan University, College of Electronics and Information Engineering, Chengdu, China
Bo Li 0075 — Glodon Technology Inc., Xian, China
Bo Li 0076 — Changchun University of Science and Technology, School of Computer Science and Technology, China
Bo Li 0077 — Dalian University of Technology, School of Control Science and Engineering, China
Bo Li 0078 — University of California, San Diego, Department of Mathematics, USA
Bo Li 0079 — National University of Singapore, Singapore (and 1 more)
Bo Li 0080 — University of California, Berkeley, CA, USA
Bo Li 0081 — Xidian University, Xi'an, China
Bo Li 0082 — Aston University, Birmingham, UK (and 2 more)
Bo Li 0084 — Sichuan University, Institute for Disaster Management and Reconstruction, Chengdu, China
Bo Li 0085 — Tianjin University, College of Management and Economics, China
Bo Li 0086 — Harbin Institute of Technology, School of Computer Science and Technology, Harbin, China
Bo Li 0087 — Shanghai University of Sport, School of Physical Education and training, Shanghai, China
Bo Li 0088 — Erasmus MC, Department of Radiology and Nuclear Medicine, Rotterdam, Netherlands (and 1 more)
Bo Li 0089 — Northwestern Polytechnical University, School of Electronics and Information, Xi'an, China (and 2 more)
Bo Li 0090 — Northwestern Polytechnical University, School of Electronics and Information, Xi'an, China
Bo Li 0091 — Northwestern Polytechnical University, School of Mechanical Engineering, Xi'an, China
Bo Li 0092 — Tianjin University, School of Electrical and Information Engineering, Tianjin, China
Bo Li 0093 — Chongqing University, School of Pharmaceutical Sciences and Collaborative Innovation Center for Brain Science, Innovative Drug Research and Bioinformatics Group, Chongqing, China
Bo Li 0094 — Chongqing University, School of Resources and Safety Engineering, State Key Laboratory of Coal Mine Disaster Dynamics and Control, Chongqing, China
Bo Li 0095 — Tsinghua University, Department of Engineering Mechanics, Institute of Biomechanics and Medical Engineering, Beijing, China
Bo Li 0096 — Central China Normal University, School of Mathematics and Statistics, Wuhan, China
Bo Li 0097 — Chongqing Normal University, College of Life Sciences, Chongqing, China
Bo Li 0098 — Beihang University, Sino-German Joint Software Institute, Beijing, China (and 2 more)
Bo Li 0099 — Peking University, National Engineering Research Center for Software Engineering, Beijing, China (and 1 more)
Bo Li 0100 — University of Texas Southwestern Medical Center, Department of Bioinformatics, Dallas, TX, USA
Bo Li 0102 — Xidian University, School of Telecommunications Engineering, State Key Laboratory of Integrated Services Networks, Xi'an, China
Bo Li 0103 — Swinburne University of Technology, School of Software and Electrical Engineering, Melbourne, Australia
Bo Li 0104 — Communication University of China, School of Information and Communication Engineering, Beijing, China
Bo Li 0105 — Shandong Technology and Business University, School of Computer Science and Technology and School of Statistics, Yantai, China
Bo Li 0106 — Dalian Polytechnic University, School of Information Science and Engineering, Dalian, China
Bo Li 0107 — Guizhou University, Key Laboratory of Karst Georesources and Environment, Ministry of Education, Guiyang, China
Bo Li 0108 — Xi'an Jiaotong University, School of Mechanical Engineering, State Key Laboratory for Mechanical Manufacturing Systems Engineering and Shaanxi Key Lab of Intelligent Robots, Xi'an, China
Bo Li 0109 — Singapore University of Technology and Design, Department of Engineering Product Development, Singapore
Bo Li 0110 — Xi'an University of Posts and Telecommunications, School of Communication and Information Engineering, Xi'an, China
Bo Li 0111 — South China University of Technology, School of Electronic and Information Engineering, Guangzhou, China (and 1 more)
Bo Li 0112 — Jiangsu University of Technology, School of Electrical and Information Engineering, Changzhou, China
Bo Li 0113 — Jiangxi University of Science and Technology, Software School, Nanchang, China
Bo Li 0114 — SenseTime Group Limited, Beijing, China
Bo Li 0115 — Tencent, Youtu Lab, Shanghai, China
Bo Li 0116 — Wuhan University of Technology, Institute of Intelligent Manufacturing and Control, Wuhan, China
Bo Li 0117 — Sun Yat-sen University, Guangdong Key Laboratory of Big Data Analysis and Processing, Guangzhou, China
Bo Li 0118 — Cerence Inc., Burlington, MA, USA (and 4 more)
Bo Li 0119 — Technical University of Denmark
Bo Li 0120 — Guangdong Ocean University, Naval Architecture and Shipping College, Zhanjiang, Guangdong, China (and 1 more)
Bo Li 0121 — Alibaba Group Inc., Machine Intelligence Technology Lab, Hangzhou, China
Bo Li 0122 — Nanjing University of Information Science and Technology, School of Computer and Software, China
Bo Li 0123 — Northwestern Polytechnical University, School of Electronics and Information, Xi'an, China
Bo Li 0124 — Anhui University of Finance and Economics, School of Finance, Bengbu, China (and 1 more)
Bo Li 0125 — Nanyang Technological University, S-Lab, Singapore
Bo Li 0126 — Tongji University, Shanghai, China
Bo Li 0127 — ShangHai DianJi University, School of Electronic Infomation Engineering, China (and 1 more)

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c75]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/WuLZCLBSW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/WuLZCLBSW24
Wen Wu, Bo Li, Chao Zhang, Chung-Cheng Chiu, Qiujia Li, Junwen Bai, Tara N. Sainath, Philip C. Woodland:
Handling Ambiguity in Emotion: From Out-of-Domain Detection to Distribution Estimation. ACL (1) 2024: 2078-2093
[c74]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SimHMSSM0S24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SimHMSSM0S24
Khe Chai Sim, Zhouyuan Huo, Tsendsuren Munkhdalai, Nikhil Siddhartha, Adam Stooke, Zhong Meng, Bo Li, Tara N. Sainath:
A Comparison of Parameter-Efficient ASR Domain Adaptation Methods for Universal Speech and Language Models. ICASSP 2024: 6900-6904
[c73]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DingQRHRLPWSHLY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DingQRHRLPWSHLY24
Shaojin Ding, David Qiu, David Rim, Yanzhang He, Oleg Rybakov, Bo Li, Rohit Prabhavalkar, Weiran Wang, Tara N. Sainath, Zhonglin Han, Jian Li, Amir Yazdanbakhsh, Shivani Agrawal:
USM-Lite: Quantization and Sparsity Aware Fine-Tuning for Speech Recognition with Universal Speech Models. ICASSP 2024: 10756-10760
[c72]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Bai0LSS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Bai0LSS24
Junwen Bai, Bo Li, Qiujia Li, Tara N. Sainath, Trevor Strohman:
Efficient Adapter Finetuning for Tail Languages in Streaming Multilingual ASR. ICASSP 2024: 10841-10845
[c71]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/naacl/WangPSMHLS0QCSZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/WangPSMHLS0QCSZ24
Weiran Wang, Rohit Prabhavalkar, Haozhe Shan, Zhong Meng, Dongseong Hwang, Qiujia Li, Khe Chai Sim, Bo Li, James Qin, Xingyu Cai, Adam Stooke, Chengjian Zheng, Yanzhang He, Tara N. Sainath, Pedro Moreno Mengibar:
Massive End-to-end Speech Recognition Models with Time Reduction. NAACL-HLT 2024: 6206-6217
[i46]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-08992
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-08992
Junwen Bai, Bo Li, Qiujia Li, Tara N. Sainath, Trevor Strohman:
Efficient Adapter Finetuning for Tail Languages in Streaming Multilingual ASR. CoRR abs/2401.08992 (2024)
[i45]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-12862
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-12862
Wen Wu, Bo Li, Chao Zhang, Chung-Cheng Chiu, Qiujia Li, Junwen Bai, Tara N. Sainath, Philip C. Woodland:
Handling Ambiguity in Emotion: From Out-of-Domain Detection to Distribution Estimation. CoRR abs/2402.12862 (2024)
2023
[c70]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/HuSLZCWZL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/HuSLZCWZL23
Ke Hu, Tara N. Sainath, Bo Li, Yu Zhang, Yong Cheng, Tao Wang, Yujing Zhang, Frederick Liu:
Improving Multilingual and Code-Switching ASR Using Large Language Model Generated Text. ASRU 2023: 1-7
[c69]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChangZSLS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChangZSLS23
Shuo-Yiin Chang, Chao Zhang, Tara N. Sainath, Bo Li, Trevor Strohman:
Context-Aware end-to-end ASR Using Self-Attentive Embedding and Tensor Fusion. ICASSP 2023: 1-5
[c68]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuSLDHDZCCS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuSLDHDZCCS23
Ke Hu, Tara N. Sainath, Bo Li, Nan Du, Yanping Huang, Andrew M. Dai, Yu Zhang, Rodrigo Cabrera, Zhifeng Chen, Trevor Strohman:
Massively Multilingual Shallow Fusion with Large Language Models. ICASSP 2023: 1-5
[c67]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuoSLHSS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuoSLHSS23
Zhouyuan Huo, Khe Chai Sim, Bo Li, Dongseong Hwang, Tara N. Sainath, Trevor Strohman:
Resource-Efficient Transfer Learning from Speech Foundation Model Using Hierarchical Feature Fusion. ICASSP 2023: 1-5
[c66]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiHHBPSSZHSB23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiHHBPSSZHSB23
Bo Li, Dongseong Hwang, Zhouyuan Huo, Junwen Bai, Guru Prakash, Tara N. Sainath, Khe Chai Sim, Yu Zhang, Wei Han, Trevor Strohman, Françoise Beaufays:
Efficient Domain Adaptation for Speech Foundation Models. ICASSP 2023: 1-5
[c65]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MengWPSCVZLRR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MengWPSCVZLRR23
Zhong Meng, Weiran Wang, Rohit Prabhavalkar, Tara N. Sainath, Tongzhou Chen, Ehsan Variani, Yu Zhang, Bo Li, Andrew Rosenberg, Bhuvana Ramabhadran:
JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition. ICASSP 2023: 1-5
[c64]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YangLZCPSS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YangLZCPSS23
Chao-Han Huck Yang, Bo Li, Yu Zhang, Nanxin Chen, Rohit Prabhavalkar, Tara N. Sainath, Trevor Strohman:
From English to More Languages: Parameter-Efficient Model Reprogramming for Cross-Lingual Speech Recognition. ICASSP 2023: 1-5
[c63]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YangLZCSSL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YangLZCSSL23
Chao-Han Huck Yang, Bo Li, Yu Zhang, Nanxin Chen, Tara N. Sainath, Sabato Marco Siniscalchi, Chin-Hui Lee:
A Quantum Kernel Learning Approach to Acoustic Modeling for Spoken Command Recognition. ICASSP 2023: 1-5
[c62]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangLSSC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangLSSC23
Chao Zhang, Bo Li, Tara N. Sainath, Trevor Strohman, Shuo-Yiin Chang:
UML: A Universal Monolingual Output Layer For Multilingual Asr. ICASSP 2023: 1-5
[c61]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenY00CCPLS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenY00CCPLS23
Zih-Ching Chen, Chao-Han Huck Yang, Bo Li, Yu Zhang, Nanxin Chen, Shuo-Yiin Chang, Rohit Prabhavalkar, Hung-yi Lee, Tara N. Sainath:
How to Estimate Model Transferability of Pre-Trained Speech Models? INTERSPEECH 2023: 456-460
[c60]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Hu0S0B23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Hu0S0B23
Ke Hu, Bo Li, Tara N. Sainath, Yu Zhang, Françoise Beaufays:
Mixture-of-Expert Conformer for Streaming Multilingual ASR. INTERSPEECH 2023: 3327-3331
[c59]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Li0HSM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Li0HSM23
Qiujia Li, Bo Li, Dongseong Hwang, Tara N. Sainath, Pedro Moreno Mengibar:
Modular Domain Adaptation for Conformer-Based Streaming ASR. INTERSPEECH 2023: 3357-3361
[c58]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/LeiBBALZ0ZWLZC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LeiBBALZ0ZWLZC23
Tao Lei, Junwen Bai, Siddhartha Brahma, Joshua Ainslie, Kenton Lee, Yanqi Zhou, Nan Du, Vincent Y. Zhao, Yuexin Wu, Bo Li, Yu Zhang, Ming-Wei Chang:
Conditional Adapters: Parameter-efficient Transfer Learning with Fast Inference. NeurIPS 2023
[i44]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-07851
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-07851
Chao-Han Huck Yang, Bo Li, Yu Zhang, Nanxin Chen, Rohit Prabhavalkar, Tara N. Sainath, Trevor Strohman:
From English to More Languages: Parameter-Efficient Model Reprogramming for Cross-Lingual Speech Recognition. CoRR abs/2301.07851 (2023)
[i43]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-01496
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-01496
Bo Li, Dongseong Hwang, Zhouyuan Huo, Junwen Bai, Guru Prakash, Tara N. Sainath, Khe Chai Sim, Yu Zhang, Wei Han, Trevor Strohman, Françoise Beaufays:
Efficient Domain Adaptation for Speech Foundation Models. CoRR abs/2302.01496 (2023)
[i42]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-08583
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-08583
Zhong Meng, Weiran Wang, Rohit Prabhavalkar, Tara N. Sainath, Tongzhou Chen, Ehsan Variani, Yu Zhang, Bo Li, Andrew Rosenberg, Bhuvana Ramabhadran:
JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition. CoRR abs/2302.08583 (2023)
[i41]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-08917
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-08917
Ke Hu, Tara N. Sainath, Bo Li, Nan Du, Yanping Huang, Andrew M. Dai, Yu Zhang, Rodrigo Cabrera, Zhifeng Chen, Trevor Strohman:
Massively Multilingual Shallow Fusion with Large Language Models. CoRR abs/2302.08917 (2023)
[i40]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-11186
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-11186
Chao Zhang, Bo Li, Tara N. Sainath, Trevor Strohman, Shuo-Yiin Chang:
UML: A Universal Monolingual Output Layer for Multilingual ASR. CoRR abs/2302.11186 (2023)
[i39]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-01037
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-01037
Yu Zhang, Wei Han, James Qin, Yongqiang Wang, Ankur Bapna, Zhehuai Chen, Nanxin Chen, Bo Li, Vera Axelrod, Gary Wang, Zhong Meng, Ke Hu, Andrew Rosenberg, Rohit Prabhavalkar, Daniel S. Park, Parisa Haghani, Jason Riesa, Ginger Perng, Hagen Soltau, Trevor Strohman, Bhuvana Ramabhadran, Tara N. Sainath, Pedro J. Moreno, Chung-Cheng Chiu, Johan Schalkwyk, Françoise Beaufays, Yonghui Wu:
Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages. CoRR abs/2303.01037 (2023)
[i38]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-04947
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-04947
Tao Lei, Junwen Bai, Siddhartha Brahma, Joshua Ainslie, Kenton Lee, Yanqi Zhou, Nan Du, Vincent Y. Zhao, Yuexin Wu, Bo Li, Yu Zhang, Ming-Wei Chang:
Conditional Adapters: Parameter-efficient Transfer Learning with Fast Inference. CoRR abs/2304.04947 (2023)
[i37]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-13408
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-13408
Qiujia Li, Bo Li, Dongseong Hwang, Tara N. Sainath, Pedro Moreno Mengibar:
Modular Domain Adaptation for Conformer-Based Streaming ASR. CoRR abs/2305.13408 (2023)
[i36]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-15663
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-15663
Ke Hu, Bo Li, Tara N. Sainath, Yu Zhang, Françoise Beaufays:
Mixture-of-Expert Conformer for Streaming Multilingual ASR. CoRR abs/2305.15663 (2023)
[i35]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-01015
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-01015
Zih-Ching Chen, Chao-Han Huck Yang, Bo Li, Yu Zhang, Nanxin Chen, Shuo-Yiin Chang, Rohit Prabhavalkar, Hung-yi Lee, Tara N. Sainath:
How to Estimate Model Transferability of Pre-Trained Speech Models? CoRR abs/2306.01015 (2023)
[i34]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-12963
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-12963
Weiran Wang, Rohit Prabhavalkar, Dongseong Hwang, Qiujia Li, Khe Chai Sim, Bo Li, James Qin, Xingyu Cai, Adam Stooke, Zhong Meng, CJ Zheng, Yanzhang He, Tara N. Sainath, Pedro Moreno Mengibar:
Massive End-to-end Models for Short Search Queries. CoRR abs/2309.12963 (2023)
[i33]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-08553
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-08553
Shaojin Ding, David Qiu, David Rim, Yanzhang He, Oleg Rybakov, Bo Li, Rohit Prabhavalkar, Weiran Wang, Tara N. Sainath, Shivani Agrawal, Zhonglin Han, Jian Li, Amir Yazdanbakhsh:
USM-Lite: Quantization and Sparsity Aware Fine-tuning for Speech Recognition with Universal Speech Models. CoRR abs/2312.08553 (2023)
2022
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/ZhangPHQGSJXHWZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/ZhangPHQGSJXHWZ22
Yu Zhang, Daniel S. Park, Wei Han, James Qin, Anmol Gulati, Joel Shor, Aren Jansen, Yuanzhong Xu, Yanping Huang, Shibo Wang, Zongwei Zhou, Bo Li, Min Ma, William Chan, Jiahui Yu, Yongqiang Wang, Liangliang Cao, Khe Chai Sim, Bhuvana Ramabhadran, Tara N. Sainath, Françoise Beaufays, Zhifeng Chen, Quoc V. Le, Chung-Cheng Chiu, Ruoming Pang, Yonghui Wu:
BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition. IEEE J. Sel. Top. Signal Process. 16(6): 1519-1532 (2022)
[c57]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiPZSSHZFGP22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiPZSSHZFGP22
Bo Li, Ruoming Pang, Yu Zhang, Tara N. Sainath, Trevor Strohman, Parisa Haghani, Yun Zhu, Brian Farris, Neeraj Gaur, Manasa Prasad:
Massively Multilingual ASR: A Lifelong Learning Solution. ICASSP 2022: 6397-6401
[c56]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BaiLZBSSS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BaiLZBSSS22
Junwen Bai, Bo Li, Yu Zhang, Ankur Bapna, Nikhil Siddhartha, Khe Chai Sim, Tara N. Sainath:
Joint Unsupervised and Supervised Training for Multilingual ASR. ICASSP 2022: 6402-6406
[c55]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SainathHNBWQCPG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SainathHNBWQCPG22
Tara N. Sainath, Yanzhang He, Arun Narayanan, Rami Botros, Weiran Wang, David Qiu, Chung-Cheng Chiu, Rohit Prabhavalkar, Alexander Gruenstein, Anmol Gulati, Bo Li, David Rybach, Emmanuel Guzman, Ian McGraw, James Qin, Krzysztof Choromanski, Qiao Liang, Robert David, Ruoming Pang, Shuo-Yiin Chang, Trevor Strohman, W. Ronny Huang, Wei Han, Yonghui Wu, Yu Zhang:
Improving The Latency And Quality Of Cascaded Encoders. ICASSP 2022: 8112-8116
[c54]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangLLSC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangLLSC22
Chao Zhang, Bo Li, Zhiyun Lu, Tara N. Sainath, Shuo-Yiin Chang:
Improving the Fusion of Acoustic and Text Representations in RNN-T. ICASSP 2022: 8117-8121
[c53]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChangLSZSLH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChangLSZSLH22
Shuo-Yiin Chang, Bo Li, Tara N. Sainath, Chao Zhang, Trevor Strohman, Qiao Liang, Yanzhang He:
Turn-Taking Prediction for Natural Conversational Speech. INTERSPEECH 2022: 1821-1825
[c52]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChangPWS0LSUFS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChangPWS0LSUFS22
Shuo-Yiin Chang, Guru Prakash, Zelin Wu, Tara N. Sainath, Bo Li, Qiao Liang, Adam Stambler, Shyam Upadhyay, Manaal Faruqui, Trevor Strohman:
Streaming Intended Query Detection using E2E Modeling for Continued Conversation. INTERSPEECH 2022: 1826-1830
[c51]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiSPCXSCLLHHB22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiSPCXSCLLHHB22
Bo Li, Tara N. Sainath, Ruoming Pang, Shuo-Yiin Chang, Qiumin Xu, Trevor Strohman, Vince Chen, Qiao Liang, Heguang Liu, Yanzhang He, Parisa Haghani, Sameer Bidichandani:
A Language Agnostic Multilingual Streaming On-Device ASR System. INTERSPEECH 2022: 3188-3192
[c50]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangLSSMCH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangLSSMCH22
Chao Zhang, Bo Li, Tara N. Sainath, Trevor Strohman, Sepand Mavandadi, Shuo-Yiin Chang, Parisa Haghani:
Streaming End-to-End Multilingual Speech Recognition with Joint Language Identification. INTERSPEECH 2022: 3223-3227
[c49]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/SainathPBZHCLWS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/SainathPBZHCLWS22
Tara N. Sainath, Rohit Prabhavalkar, Ankur Bapna, Yu Zhang, Zhouyuan Huo, Zhehuai Chen, Bo Li, Weiran Wang, Trevor Strohman:
JOIST: A Joint Speech and Text Streaming Model for ASR. SLT 2022: 52-59
[c48]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/BijwadiaCLSZH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/BijwadiaCLSZH22
Shaan Bijwadia, Shuo-Yiin Chang, Bo Li, Tara N. Sainath, Chao Zhang, Yanzhang He:
Unified End-to-End Speech Recognition and Endpointing for Fast and Efficient Speech Systems. SLT 2022: 310-316
[c47]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/HuLS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/HuLS22
Ke Hu, Bo Li, Tara N. Sainath:
Scaling Up Deliberation For Multilingual ASR. SLT 2022: 771-776
[c46]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/MavandadiLZFSS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/MavandadiLZFSS22
Sepand Mavandadi, Bo Li, Chao Zhang, Brian Farris, Tara N. Sainath, Trevor Strohman:
A Truly Multilingual First Pass and Monolingual Second Pass Streaming on-Device ASR System. SLT 2022: 838-845
[i32]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2201-10240
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-10240
Chao Zhang, Bo Li, Zhiyun Lu, Tara N. Sainath, Shuo-Yiin Chang:
Improving the fusion of acoustic and text representations in RNN-T. CoRR abs/2201.10240 (2022)
[i31]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-03067
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-03067
Sandy Ritchie, You-Chi Cheng, Mingqing Chen, Rajiv Mathews, Daan van Esch, Bo Li, Khe Chai Sim:
Large vocabulary speech recognition for languages of Africa: multilingual modeling and self-supervised learning. CoRR abs/2208.03067 (2022)
[i30]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-13321
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-13321
Shuo-Yiin Chang, Bo Li, Tara N. Sainath, Chao Zhang, Trevor Strohman, Qiao Liang, Yanzhang He:
Turn-Taking Prediction for Natural Conversational Speech. CoRR abs/2208.13321 (2022)
[i29]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-13322
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-13322
Shuo-Yiin Chang, Guru Prakash, Zelin Wu, Qiao Liang, Tara N. Sainath, Bo Li, Adam Stambler, Shyam Upadhyay, Manaal Faruqui, Trevor Strohman:
Streaming Intended Query Detection using E2E Modeling for Continued Conversation. CoRR abs/2208.13322 (2022)
[i28]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-13916
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-13916
Bo Li, Tara N. Sainath, Ruoming Pang, Shuo-Yiin Chang, Qiumin Xu, Trevor Strohman, Vince Chen, Qiao Liang, Heguang Liu, Yanzhang He, Parisa Haghani, Sameer Bidichandani:
A Language Agnostic Multilingual Streaming On-Device ASR System. CoRR abs/2208.13916 (2022)
[i27]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-06058
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-06058
Chao Zhang, Bo Li, Tara N. Sainath, Trevor Strohman, Sepand Mavandadi, Shuo-Yiin Chang, Parisa Haghani:
Streaming End-to-End Multilingual Speech Recognition with Joint Language Identification. CoRR abs/2209.06058 (2022)
[i26]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-05785
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-05785
Ke Hu, Bo Li, Tara N. Sainath:
Scaling Up Deliberation for Multilingual ASR. CoRR abs/2210.05785 (2022)
[i25]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-07353
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-07353
Tara N. Sainath, Rohit Prabhavalkar, Ankur Bapna, Yu Zhang, Zhouyuan Huo, Zhehuai Chen, Bo Li, Weiran Wang, Trevor Strohman:
JOIST: A Joint Speech and Text Streaming Model For ASR. CoRR abs/2210.07353 (2022)
[i24]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-00786
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-00786
Shaan Bijwadia, Shuo-Yiin Chang, Bo Li, Tara N. Sainath, Chao Zhang, Yanzhang He:
Unified End-to-End Speech Recognition and Endpointing for Fast and Efficient Speech Systems. CoRR abs/2211.00786 (2022)
[i23]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-01263
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-01263
Chao-Han Huck Yang, Bo Li, Yu Zhang, Nanxin Chen, Tara N. Sainath, Sabato Marco Siniscalchi, Chin-Hui Lee:
A Quantum Kernel Learning Approach to Acoustic Modeling for Spoken Command Recognition. CoRR abs/2211.01263 (2022)
[i22]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-02712
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-02712
Zhouyuan Huo, Khe Chai Sim, Bo Li, Dongseong Hwang, Tara N. Sainath, Trevor Strohman:
Resource-Efficient Transfer Learning From Speech Foundation Model Using Hierarchical Feature Fusion. CoRR abs/2211.02712 (2022)
2021
[c45]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/LiPSGZQHHMB21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/LiPSGZQHHMB21
Bo Li, Ruoming Pang, Tara N. Sainath, Anmol Gulati, Yu Zhang, James Qin, Parisa Haghani, W. Ronny Huang, Min Ma, Junwen Bai:
Scaling End-to-End Models for Large-Scale Multilingual ASR. ASRU 2021: 1011-1018
[c44]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiGYSCNCPHQ0LZS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiGYSCNCPHQ0LZS21
Bo Li, Anmol Gulati, Jiahui Yu, Tara N. Sainath, Chung-Cheng Chiu, Arun Narayanan, Shuo-Yiin Chang, Ruoming Pang, Yanzhang He, James Qin, Wei Han, Qiao Liang, Yu Zhang, Trevor Strohman, Yonghui Wu:
A Better and Faster end-to-end Model for Streaming ASR. ICASSP 2021: 5634-5638
[c43]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YuCLCSHNHGWP21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YuCLCSHNHGWP21
Jiahui Yu, Chung-Cheng Chiu, Bo Li, Shuo-Yiin Chang, Tara N. Sainath, Yanzhang He, Arun Narayanan, Wei Han, Anmol Gulati, Yonghui Wu, Ruoming Pang:
FastEmit: Low-Latency Streaming ASR with Sequence-Level Emission Regularization. ICASSP 2021: 6004-6008
[c42]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiQZLHWCS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiQZLHWCS21
Qiujia Li, David Qiu, Yu Zhang, Bo Li, Yanzhang He, Philip C. Woodland, Liangliang Cao, Trevor Strohman:
Confidence Estimation for Attention-Based Sequence-to-Sequence Models for Speech Recognition. ICASSP 2021: 6388-6392
[c41]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/QiuLHZLCPBLHSM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/QiuLHZLCPBLHSM21
David Qiu, Qiujia Li, Yanzhang He, Yu Zhang, Bo Li, Liangliang Cao, Rohit Prabhavalkar, Deepti Bhatia, Wei Li, Ke Hu, Tara N. Sainath, Ian McGraw:
Learning Word-Level Confidence for Subword End-To-End ASR. ICASSP 2021: 6393-6397
[c40]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/YuHGCLSWP21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/YuHGCLSWP21
Jiahui Yu, Wei Han, Anmol Gulati, Chung-Cheng Chiu, Bo Li, Tara N. Sainath, Yonghui Wu, Ruoming Pang:
Dual-mode ASR: Unify and Improve Streaming ASR with Full-context Modeling. ICLR 2021
[c39]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SainathHNBPRAVQ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SainathHNBPRAVQ21
Tara N. Sainath, Yanzhang He, Arun Narayanan, Rami Botros, Ruoming Pang, David Rybach, Cyril Allauzen, Ehsan Variani, James Qin, Quoc-Nam Le-The, Shuo-Yiin Chang, Bo Li, Anmol Gulati, Jiahui Yu, Chung-Cheng Chiu, Diamantino Caseiro, Wei Li, Qiao Liang, Pat Rondon:
An Efficient Streaming Non-Recurrent On-Device End-to-End Model with Improvements to Rare-Word Modeling. Interspeech 2021: 1777-1781
[c38]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiZLCW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiZLCW21
Qiujia Li, Yu Zhang, Bo Li, Liangliang Cao, Philip C. Woodland:
Residual Energy-Based Models for End-to-End Speech Recognition. Interspeech 2021: 4069-4073
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2103-06716
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-06716
David Qiu, Qiujia Li, Yanzhang He, Yu Zhang, Bo Li, Liangliang Cao, Rohit Prabhavalkar, Deepti Bhatia, Wei Li, Ke Hu, Tara N. Sainath, Ian McGraw:
Learning Word-Level Confidence For Subword End-to-End ASR. CoRR abs/2103.06716 (2021)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2103-14152
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-14152
Qiujia Li, Yu Zhang, Bo Li, Liangliang Cao, Philip C. Woodland:
Residual Energy-Based Models for End-to-End Speech Recognition. CoRR abs/2103.14152 (2021)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2104-14830
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-14830
Bo Li, Ruoming Pang, Tara N. Sainath, Anmol Gulati, Yu Zhang, James Qin, Parisa Haghani, W. Ronny Huang, Min Ma:
Scaling End-to-End Models for Large-Scale Multilingual ASR. CoRR abs/2104.14830 (2021)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2109-13226
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-13226
Yu Zhang, Daniel S. Park, Wei Han, James Qin, Anmol Gulati, Joel Shor, Aren Jansen, Yuanzhong Xu, Yanping Huang, Shibo Wang, Zongwei Zhou, Bo Li, Min Ma, William Chan, Jiahui Yu, Yongqiang Wang, Liangliang Cao, Khe Chai Sim, Bhuvana Ramabhadran, Tara N. Sainath, Françoise Beaufays, Zhifeng Chen, Quoc V. Le, Chung-Cheng Chiu, Ruoming Pang, Yonghui Wu:
BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition. CoRR abs/2109.13226 (2021)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2111-08137
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-08137
Junwen Bai, Bo Li, Yu Zhang, Ankur Bapna, Nikhil Siddhartha, Khe Chai Sim, Tara N. Sainath:
Joint Unsupervised and Supervised Training for Multilingual ASR. CoRR abs/2111.08137 (2021)
2020
[c37]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SainathHLNPBCLA20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SainathHLNPBCLA20
Tara N. Sainath, Yanzhang He, Bo Li, Arun Narayanan, Ruoming Pang, Antoine Bruguier, Shuo-Yiin Chang, Wei Li, Raziel Alvarez, Zhifeng Chen, Chung-Cheng Chiu, David Garcia, Alexander Gruenstein, Ke Hu, Anjuli Kannan, Qiao Liang, Ian McGraw, Cal Peyser, Rohit Prabhavalkar, Golan Pundak, David Rybach, Yuan Shangguan, Yash Sheth, Trevor Strohman, Mirkó Visontai, Yonghui Wu, Yu Zhang, Ding Zhao:
A Streaming On-Device End-To-End Model Surpassing Server-Side Conventional Model Quality and Latency. ICASSP 2020: 6059-6063
[c36]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiCSPHSW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiCSPHSW20
Bo Li, Shuo-Yiin Chang, Tara N. Sainath, Ruoming Pang, Yanzhang He, Trevor Strohman, Yonghui Wu:
Towards Fast and Accurate Streaming End-To-End ASR. ICASSP 2020: 6069-6073
[c35]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ParkZCCLCLW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ParkZCCLCLW20
Daniel S. Park, Yu Zhang, Chung-Cheng Chiu, Youzheng Chen, Bo Li, William Chan, Quoc V. Le, Yonghui Wu:
Specaugment on Large Scale Datasets. ICASSP 2020: 6879-6883
[c34]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WuLZAS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WuLZAS20
Zelin Wu, Bo Li, Yu Zhang, Petar S. Aleksic, Tara N. Sainath:
Multistate Encoding with End-To-End Speech RNN Transducer Network. ICASSP 2020: 7819-7823
[c33]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Chang0RHLSS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Chang0RHLSS20
Shuo-Yiin Chang, Bo Li, David Rybach, Yanzhang He, Wei Li, Tara N. Sainath, Trevor Strohman:
Low Latency Speech Recognition Using End-to-End Prefetching. INTERSPEECH 2020: 1962-1966
[c32]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ParkZJHCLWL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ParkZJHCLWL20
Daniel S. Park, Yu Zhang, Ye Jia, Wei Han, Chung-Cheng Chiu, Bo Li, Yonghui Wu, Quoc V. Le:
Improved Noisy Student Training for Automatic Speech Recognition. INTERSPEECH 2020: 2817-2821
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2003-12710
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-12710
Tara N. Sainath, Yanzhang He, Bo Li, Arun Narayanan, Ruoming Pang, Antoine Bruguier, Shuo-Yiin Chang, Wei Li, Raziel Alvarez, Zhifeng Chen, Chung-Cheng Chiu, David Garcia, Alexander Gruenstein, Ke Hu, Minho Jin, Anjuli Kannan, Qiao Liang, Ian McGraw, Cal Peyser, Rohit Prabhavalkar, Golan Pundak, David Rybach, Yuan Shangguan, Yash Sheth, Trevor Strohman, Mirkó Visontai, Yonghui Wu, Yu Zhang, Ding Zhao:
A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency. CoRR abs/2003.12710 (2020)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2005-09629
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-09629
Daniel S. Park, Yu Zhang, Ye Jia, Wei Han, Chung-Cheng Chiu, Bo Li, Yonghui Wu, Quoc V. Le:
Improved Noisy Student Training for Automatic Speech Recognition. CoRR abs/2005.09629 (2020)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-06030
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-06030
Jiahui Yu, Wei Han, Anmol Gulati, Chung-Cheng Chiu, Bo Li, Tara N. Sainath, Yonghui Wu, Ruoming Pang:
Universal ASR: Unify and Improve Streaming ASR with Full-context Modeling. CoRR abs/2010.06030 (2020)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-11148
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-11148
Jiahui Yu, Chung-Cheng Chiu, Bo Li, Shuo-Yiin Chang, Tara N. Sainath, Yanzhang He, Arun Narayanan, Wei Han, Anmol Gulati, Yonghui Wu, Ruoming Pang:
FastEmit: Low-latency Streaming ASR with Sequence-level Emission Regularization. CoRR abs/2010.11148 (2020)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-11428
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-11428
Qiujia Li, David Qiu, Yu Zhang, Bo Li, Yanzhang He, Philip C. Woodland, Liangliang Cao, Trevor Strohman:
Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition. CoRR abs/2010.11428 (2020)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2011-10798
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-10798
Bo Li, Anmol Gulati, Jiahui Yu, Tara N. Sainath, Chung-Cheng Chiu, Arun Narayanan, Shuo-Yiin Chang, Ruoming Pang, Yanzhang He, James Qin, Wei Han, Qiao Liang, Yu Zhang, Trevor Strohman, Yonghui Wu:
A Better and Faster End-to-End Model for Streaming ASR. CoRR abs/2011.10798 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/PurwinsSLNA19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/PurwinsSLNA19
Hendrik Purwins, Bob L. Sturm, Bo Li, Juhan Nam, Abeer Alwan:
Introduction to the Issue on Data Science: Machine Learning for Audio Signal Processing. IEEE J. Sel. Top. Signal Process. 13(2): 203-205 (2019)
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/PurwinsLVSCS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/PurwinsLVSCS19
Hendrik Purwins, Bo Li, Tuomas Virtanen, Jan Schlüter, Shuo-Yiin Chang, Tara N. Sainath:
Deep Learning for Audio Signal Processing. IEEE J. Sel. Top. Signal Process. 13(2): 206-219 (2019)
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/ChangLS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/ChangLS19
Shuo-Yiin Chang, Bo Li, Gabor Simko:
A Unified Endpointer Using Multitask and Multidomain Training. ASRU 2019: 100-106
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/0028SPW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/0028SPW19
Bo Li, Tara N. Sainath, Ruoming Pang, Zelin Wu:
Semi-supervised Training for End-to-end Models via Weak Distillation. ICASSP 2019: 2837-2841
[c29]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiZSWC19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiZSWC19
Bo Li, Yu Zhang, Tara N. Sainath, Yonghui Wu, William Chan:
Bytes Are All You Need: End-to-end Multilingual Speech Recognition and Synthesis with Bytes. ICASSP 2019: 5621-5625
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HeymannS019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HeymannS019
Jahn Heymann, Khe Chai Sim, Bo Li:
Improving CTC Using Stimulated Learning for Sequence Modeling. ICASSP 2019: 5701-5705
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HeSPMAZRKWPLBSL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HeSPMAZRKWPLBSL19
Yanzhang He, Tara N. Sainath, Rohit Prabhavalkar, Ian McGraw, Raziel Alvarez, Ding Zhao, David Rybach, Anjuli Kannan, Yonghui Wu, Ruoming Pang, Qiao Liang, Deepti Bhatia, Yuan Shangguan, Bo Li, Golan Pundak, Khe Chai Sim, Tom Bagby, Shuo-Yiin Chang, Kanishka Rao, Alexander Gruenstein:
Streaming End-to-end Speech Recognition for Mobile Devices. ICASSP 2019: 6381-6385
[c26]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhaoSRRBLP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhaoSRRBLP19
Ding Zhao, Tara N. Sainath, David Rybach, Pat Rondon, Deepti Bhatia, Bo Li, Ruoming Pang:
Shallow-Fusion End-to-End Contextual Biasing. INTERSPEECH 2019: 1418-1422
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1902-08295
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-08295
Jonathan Shen, Patrick Nguyen, Yonghui Wu, Zhifeng Chen, Mia Xu Chen, Ye Jia, Anjuli Kannan, Tara N. Sainath, Yuan Cao, Chung-Cheng Chiu, Yanzhang He, Jan Chorowski, Smit Hinsu, Stella Laurenzo, James Qin, Orhan Firat, Wolfgang Macherey, Suyog Gupta, Ankur Bapna, Shuyuan Zhang, Ruoming Pang, Ron J. Weiss, Rohit Prabhavalkar, Qiao Liang, Benoit Jacob, Bowen Liang, HyoukJoong Lee, Ciprian Chelba, Sébastien Jean, Bo Li, Melvin Johnson, Rohan Anil, Rajat Tibrewal, Xiaobing Liu, Akiko Eriguchi, Navdeep Jaitly, Naveen Ari, Colin Cherry, Parisa Haghani, Otavio Good, Youlong Cheng, Raziel Alvarez, Isaac Caswell, Wei-Ning Hsu, Zongheng Yang, Kuan-Chieh Wang, Ekaterina Gonina, Katrin Tomanek, Ben Vanik, Zelin Wu, Llion Jones, Mike Schuster, Yanping Huang, Dehao Chen, Kazuki Irie, George F. Foster, John Richardson, Klaus Macherey, Antoine Bruguier, Heiga Zen, Colin Raffel, Shankar Kumar, Kanishka Rao, David Rybach, Matthew Murray, Vijayaditya Peddinti, Maxim Krikun, Michiel Bacchiani, Thomas B. Jablin, Robert Suderman, Ian Williams, Benjamin Lee, Deepti Bhatia, Justin Carlson, Semih Yavuz, Yu Zhang, Ian McGraw, Max Galkin, Qi Ge, Golan Pundak, Chad Whipkey, Todd Wang, Uri Alon, Dmitry Lepikhin, Ye Tian, Sara Sabour, William Chan, Shubham Toshniwal, Baohua Liao, Michael Nirschl, Pat Rondon:
Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling. CoRR abs/1902.08295 (2019)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1905-00078
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-00078
Hendrik Purwins, Bo Li, Tuomas Virtanen, Jan Schlüter, Shuo-Yiin Chang, Tara N. Sainath:
Deep Learning for Audio Signal Processing. CoRR abs/1905.00078 (2019)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1912-05533
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-05533
Daniel S. Park, Yu Zhang, Chung-Cheng Chiu, Youzheng Chen, Bo Li, William Chan, Quoc V. Le, Yonghui Wu:
SpecAugment on Large Scale Datasets. CoRR abs/1912.05533 (2019)
2018
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiSSBWNCWR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiSSBWNCWR18
Bo Li, Tara N. Sainath, Khe Chai Sim, Michiel Bacchiani, Eugene Weinstein, Patrick Nguyen, Zhifeng Chen, Yanghui Wu, Kanishka Rao:
Multi-Dialect Speech Recognition with a Single Sequence-to-Sequence Model. ICASSP 2018: 4749-4753
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChiuSWPNCKWRGJL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChiuSWPNCKWRGJL18
Chung-Cheng Chiu, Tara N. Sainath, Yonghui Wu, Rohit Prabhavalkar, Patrick Nguyen, Zhifeng Chen, Anjuli Kannan, Ron J. Weiss, Kanishka Rao, Ekaterina Gonina, Navdeep Jaitly, Bo Li, Jan Chorowski, Michiel Bacchiani:
State-of-the-Art Speech Recognition with Sequence-to-Sequence Models. ICASSP 2018: 4774-4778
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ToshniwalSWLMWR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ToshniwalSWLMWR18
Shubham Toshniwal, Tara N. Sainath, Ron J. Weiss, Bo Li, Pedro J. Moreno, Eugene Weinstein, Kanishka Rao:
Multilingual Speech Recognition with a Single End-to-End Model. ICASSP 2018: 4904-4908
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DonahueLP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DonahueLP18
Chris Donahue, Bo Li, Rohit Prabhavalkar:
Exploring Speech Enhancement with Generative Adversarial Networks for Robust Speech Recognition. ICASSP 2018: 5024-5028
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChangLSSTOV18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChangLSSTOV18
Shuo-Yiin Chang, Bo Li, Gabor Simko, Tara N. Sainath, Anshuman Tripathi, Aäron van den Oord, Oriol Vinyals:
Temporal Modeling Using Dilated Convolution and Gating for Voice-Activity-Detection. ICASSP 2018: 5549-5553
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SainathPKLKRSNL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SainathPKLKRSNL18
Tara N. Sainath, Rohit Prabhavalkar, Shankar Kumar, Seungji Lee, Anjuli Kannan, David Rybach, Vlad Schogol, Patrick Nguyen, Bo Li, Yonghui Wu, Zhifeng Chen, Chung-Cheng Chiu:
No Need for a Lexicon? Evaluating the Value of the Pronunciation Lexica in End-to-End Models. ICASSP 2018: 5859-5863
[c19]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SimNMTPSHLB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SimNMTPSHLB18
Khe Chai Sim, Arun Narayanan, Ananya Misra, Anshuman Tripathi, Golan Pundak, Tara N. Sainath, Parisa Haghani, Bo Li, Michiel Bacchiani:
Domain Adaptation Using Factorized Hidden Layer for Robust Automatic Speech Recognition. INTERSPEECH 2018: 892-896
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1811-06621
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-06621
Yanzhang He, Tara N. Sainath, Rohit Prabhavalkar, Ian McGraw, Raziel Alvarez, Ding Zhao, David Rybach, Anjuli Kannan, Yonghui Wu, Ruoming Pang, Qiao Liang, Deepti Bhatia, Yuan Shangguan, Bo Li, Golan Pundak, Khe Chai Sim, Tom Bagby, Shuo-Yiin Chang, Kanishka Rao, Alexander Gruenstein:
Streaming End-to-end Speech Recognition For Mobile Devices. CoRR abs/1811.06621 (2018)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1811-09021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-09021
Bo Li, Yu Zhang, Tara N. Sainath, Yonghui Wu, William Chan:
Bytes are All You Need: End-to-End Multilingual Speech Recognition and Synthesis with Bytes. CoRR abs/1811.09021 (2018)
2017
[j3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/jaihc/XieHL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jaihc/XieHL17
Lei Xie, Janne Heikkilä, Bo Li:
Media computing and applications for immersive communications: recent advances. J. Ambient Intell. Humaniz. Comput. 8(6): 827-828 (2017)
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/SainathWWLNVBSS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/SainathWWLNVBSS17
Tara N. Sainath, Ron J. Weiss, Kevin W. Wilson, Bo Li, Arun Narayanan, Ehsan Variani, Michiel Bacchiani, Izhak Shafran, Andrew W. Senior, Kean K. Chin, Ananya Misra, Chanwoo Kim:
Multichannel Signal Processing With Deep Neural Networks for Automatic Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 25(5): 965-979 (2017)
[c18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiSNCBMSSPCSWWV17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiSNCBMSSPCSWWV17
Bo Li, Tara N. Sainath, Arun Narayanan, Joe Caroselli, Michiel Bacchiani, Ananya Misra, Izhak Shafran, Hasim Sak, Golan Pundak, Kean K. Chin, Khe Chai Sim, Ron J. Weiss, Kevin W. Wilson, Ehsan Variani, Chanwoo Kim, Olivier Siohan, Mitchel Weintraub, Erik McDermott, Richard Rose, Matt Shannon:
Acoustic Modeling for Google Home. INTERSPEECH 2017: 399-403
[c17]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PrabhavalkarRSL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PrabhavalkarRSL17
Rohit Prabhavalkar, Kanishka Rao, Tara N. Sainath, Bo Li, Leif Johnson, Navdeep Jaitly:
A Comparison of Sequence-to-Sequence Models for Speech Recognition. INTERSPEECH 2017: 939-943
[c16]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiS17
Bo Li, Tara N. Sainath:
Reducing the Computational Complexity of Two-Dimensional LSTMs. INTERSPEECH 2017: 964-968
[c15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PrabhavalkarSLR17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PrabhavalkarSLR17
Rohit Prabhavalkar, Tara N. Sainath, Bo Li, Kanishka Rao, Navdeep Jaitly:
An Analysis of "Attention" in Sequence-to-Sequence Models. INTERSPEECH 2017: 3702-3706
[c14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChangLSSP17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChangLSSP17
Shuo-Yiin Chang, Bo Li, Tara N. Sainath, Gabor Simko, Carolina Parada:
Endpoint Detection Using Grid Long Short-Term Memory Networks for Streaming Speech Recognition. INTERSPEECH 2017: 3812-3816
[p1]
- view
  authority control:
- export record
  dblp key:
  - books/sp/17/SainathWWNBLVSSCMK17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/books/sp/17/SainathWWNBLVSSCMK17
Tara N. Sainath, Ron J. Weiss, Kevin W. Wilson, Arun Narayanan, Michiel Bacchiani, Bo Li, Ehsan Variani, Izhak Shafran, Andrew W. Senior, Kean K. Chin, Ananya Misra, Chanwoo Kim:
Raw Multichannel Processing Using Deep Neural Networks. New Era for Robust Speech Recognition, Exploiting Deep Learning 2017: 105-133
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1711-01694
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1711-01694
Shubham Toshniwal, Tara N. Sainath, Ron J. Weiss, Bo Li, Pedro J. Moreno, Eugene Weinstein, Kanishka Rao:
Multilingual Speech Recognition With A Single End-To-End Model. CoRR abs/1711.01694 (2017)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1711-05747
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1711-05747
Chris Donahue, Bo Li, Rohit Prabhavalkar:
Exploring Speech Enhancement with Generative Adversarial Networks for Robust Speech Recognition. CoRR abs/1711.05747 (2017)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1712-01541
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1712-01541
Bo Li, Tara N. Sainath, Khe Chai Sim, Michiel Bacchiani, Eugene Weinstein, Patrick Nguyen, Zhifeng Chen, Yonghui Wu, Kanishka Rao:
Multi-Dialect Speech Recognition With A Single Sequence-To-Sequence Model. CoRR abs/1712.01541 (2017)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1712-01769
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1712-01769
Chung-Cheng Chiu, Tara N. Sainath, Yonghui Wu, Rohit Prabhavalkar, Patrick Nguyen, Zhifeng Chen, Anjuli Kannan, Ron J. Weiss, Kanishka Rao, Katya Gonina, Navdeep Jaitly, Bo Li, Jan Chorowski, Michiel Bacchiani:
State-of-the-art Speech Recognition With Sequence-to-Sequence Models. CoRR abs/1712.01769 (2017)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1712-01864
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1712-01864
Tara N. Sainath, Rohit Prabhavalkar, Shankar Kumar, Seungji Lee, Anjuli Kannan, David Rybach, Vlad Schogol, Patrick Nguyen, Bo Li, Yonghui Wu, Zhifeng Chen, Chung-Cheng Chiu:
No Need for a Lexicon? Evaluating the Value of the Pronunciation Lexica in End-to-End Models. CoRR abs/1712.01864 (2017)
2016
[c13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SainathL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SainathL16
Tara N. Sainath, Bo Li:
Modeling Time-Frequency Patterns with LSTM vs. Convolutional Architectures for LVCSR Tasks. INTERSPEECH 2016: 813-817
[c12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiSWWB16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiSWWB16
Bo Li, Tara N. Sainath, Ron J. Weiss, Kevin W. Wilson, Michiel Bacchiani:
Neural Network Adaptive Beamforming for Robust Multichannel Speech Recognition. INTERSPEECH 2016: 1976-1980
[c11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiZ16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiZ16
Bo Li, Heiga Zen:
Multi-Language Multi-Speaker Acoustic Modeling for LSTM-RNN Based Statistical Parametric Speech Synthesis. INTERSPEECH 2016: 2468-2472
2014
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/LiS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LiS14
Bo Li, Khe Chai Sim:
A Spectral Masking Approach to Noise-Robust Speech Recognition Using Deep Neural Networks. IEEE ACM Trans. Audio Speech Lang. Process. 22(8): 1296-1305 (2014)
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiS14
Bo Li, Khe Chai Sim:
An ideal hidden-activation mask for deep neural networks based noise-robust speech recognition. ICASSP 2014: 200-204
[c9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiS14
Bo Li, Khe Chai Sim:
Modeling long temporal contexts for robust DNN-based speech recognition. INTERSPEECH 2014: 353-357
2013
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/DuanFLSW13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/DuanFLSW13
Zhiyan Duan, Haotian Fang, Bo Li, Khe Chai Sim, Ye Wang:
The NUS sung and spoken lyrics corpus: A quantitative comparison of singing and speech. APSIPA 2013: 1-9
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/LiS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/LiS13
Bo Li, Khe Chai Sim:
Improving robustness of deep neural networks via spectral masking for automatic speech recognition. ASRU 2013: 279-284
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiS13a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiS13a
Bo Li, Khe Chai Sim:
Noise adaptive front-end normalization based on Vector Taylor Series for Deep Neural Networks in robust speech recognition. ICASSP 2013: 7408-7412
[c5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiTS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiTS13
Bo Li, Yu Tsao, Khe Chai Sim:
An investigation of spectral restoration algorithms for deep neural networks based noise robust speech recognition. INTERSPEECH 2013: 3002-3006
2012
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/icmi/WangLLWWS12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmi/WangLLWWS12
Guangsen Wang, Bo Li, Shilin Liu, Xuancong Wang, Xiaoxuan Wang, Khe Chai Sim:
Improving mandarin predictive text input by augmenting pinyin initials with speech and tonal information. ICMI 2012: 545-550
[c3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiS12a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiS12a
Bo Li, Khe Chai Sim:
A Two-stage Speaker Adaptation Approach for Subspace Gaussian Mixture Model based Nonnative Speech Recognition. INTERSPEECH 2012: 1772-1775
2010
[c2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiS10
Bo Li, Khe Chai Sim:
Comparison of discriminative input and output transformations for speaker adaptation in the hybrid NN/HMM systems. INTERSPEECH 2010: 526-529
[c1]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiS10a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiS10a
Bo Li, Khe Chai Sim:
Hidden logistic linear regression for support vector machine based phone verification. INTERSPEECH 2010: 2614-2617

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.