default search action

combined dblp search
author search
venue search
publication search

ask others

Cheng-I Lai

Cheng-I Jeff Lai

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/YangCHLLWSCTHFCLCHTLLMWL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/YangCHLLWSCTHFCLCHTLLMWL24
Shu-Wen Yang, Heng-Jui Chang, Zili Huang, Andy T. Liu, Cheng-I Lai, Haibin Wu, Jiatong Shi, Xuankai Chang, Hsiang-Sheng Tsai, Wen-Chin Huang, Tzu-hsun Feng, Po-Han Chi, Yist Y. Lin, Yung-Sung Chuang, Tzu-Hsien Huang, Wei-Cheng Tseng, Kushal Lakhotia, Shang-Wen Li, Abdelrahman Mohamed, Shinji Watanabe, Hung-yi Lee:
A Large-Scale Evaluation of Speech Foundation Models. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2884-2899 (2024)
[i20]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-09385
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-09385
Shu-Wen Yang, Heng-Jui Chang, Zili Huang, Andy T. Liu, Cheng-I Lai, Haibin Wu, Jiatong Shi, Xuankai Chang, Hsiang-Sheng Tsai, Wen-Chin Huang, Tzu-hsun Feng, Po-Han Chi, Yist Y. Lin, Yung-Sung Chuang, Tzu-Hsien Huang, Wei-Cheng Tseng, Kushal Lakhotia, Shang-Wen Li, Abdelrahman Mohamed, Shinji Watanabe, Hung-yi Lee:
A Large-Scale Evaluation of Speech Foundation Models. CoRR abs/2404.09385 (2024)
2023
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/LaiSPKGCCBCHZLG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/LaiSPKGCCBCHZLG23
Cheng-I Jeff Lai, Freda Shi, Puyuan Peng, Yoon Kim, Kevin Gimpel, Shiyu Chang, Yung-Sung Chuang, Saurabhchand Bhati, David D. Cox, David Harwath, Yang Zhang, Karen Livescu, James R. Glass:
Audio-Visual Neural Syntax Acquisition. ASRU 2023: 1-8
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TsengLL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TsengLL23
Yuan Tseng, Cheng-I Jeff Lai, Hung-Yi Lee:
Cascading and Direct Approaches to Unsupervised Constituency Parsing on Spoken Sentences. ICASSP 2023: 1-5
[i19]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-08809
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-08809
Yuan Tseng, Cheng-I Lai, Hung-yi Lee:
Cascading and Direct Approaches to Unsupervised Constituency Parsing on Spoken Sentences. CoRR abs/2303.08809 (2023)
[i18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-09843
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-09843
Cheng-I Jeff Lai, Zhiyun Lu, Liangliang Cao, Ruoming Pang:
Instruction-Following Speech Recognition. CoRR abs/2309.09843 (2023)
[i17]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-07654
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-07654
Cheng-I Jeff Lai, Freda Shi, Puyuan Peng, Yoon Kim, Kevin Gimpel, Shiyu Chang, Yung-Sung Chuang, Saurabhchand Bhati, David D. Cox, David Harwath, Yang Zhang, Karen Livescu, James R. Glass:
Audio-Visual Neural Syntax Acquisition. CoRR abs/2310.07654 (2023)
2022
[c17]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/GongLCG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/GongLCG22
Yuan Gong, Cheng-I Lai, Yu-An Chung, James R. Glass:
SSAST: Self-Supervised Audio Spectrogram Transformer. AAAI 2022: 10699-10709
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/LiuJLROG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/LiuJLROG22
Alexander H. Liu, SouYoung Jin, Cheng-I Lai, Andrew Rouditchenko, Aude Oliva, James R. Glass:
Cross-Modal Discrete Representation Learning. ACL (1) 2022: 3013-3035
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/TsaiCHHLYDLLSCH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/TsaiCHHLYDLLSCH22
Hsiang-Sheng Tsai, Heng-Jui Chang, Wen-Chin Huang, Zili Huang, Kushal Lakhotia, Shu-Wen Yang, Shuyan Dong, Andy T. Liu, Cheng-I Lai, Jiatong Shi, Xuankai Chang, Phil Hall, Hsuan-Jui Chen, Shang-Wen Li, Shinji Watanabe, Abdelrahman Mohamed, Hung-yi Lee:
SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities. ACL (1) 2022: 8479-8492
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LaiCZCQLCLYCG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LaiCZCQLCLYCG22
Cheng-I Jeff Lai, Erica Cooper, Yang Zhang, Shiyu Chang, Kaizhi Qian, Yi-Lun Liao, Yung-Sung Chuang, Alexander H. Liu, Junichi Yamagishi, David D. Cox, James R. Glass:
On the Interplay between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis. ICASSP 2022: 8447-8451
[c13]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/QianZGNLCHC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/QianZGNLCHC22
Kaizhi Qian, Yang Zhang, Heting Gao, Junrui Ni, Cheng-I Lai, David D. Cox, Mark Hasegawa-Johnson, Shiyu Chang:
ContentVec: An Improved Self-Supervised Speech Representation by Disentangling Speakers. ICML 2022: 18003-18017
[c12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuLHABG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuLHABG22
Alexander H. Liu, Cheng-I Lai, Wei-Ning Hsu, Michael Auli, Alexei Baevski, James R. Glass:
Simple and Effective Unsupervised Speech Synthesis. INTERSPEECH 2022: 843-847
[c11]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/Fu0QYYLL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Fu0QYYLL22
Yonggan Fu, Yang Zhang, Kaizhi Qian, Zhifan Ye, Zhongzhi Yu, Cheng-I Jeff Lai, Celine Lin:
Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Speech Processing. NeurIPS 2022
[i16]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-06849
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-06849
Hsiang-Sheng Tsai, Heng-Jui Chang, Wen-Chin Huang, Zili Huang, Kushal Lakhotia, Shu-Wen Yang, Shuyan Dong, Andy T. Liu, Cheng-I Jeff Lai, Jiatong Shi, Xuankai Chang, Phil Hall, Hsuan-Jui Chen, Shang-Wen Li, Shinji Watanabe, Abdelrahman Mohamed, Hung-yi Lee:
SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities. CoRR abs/2203.06849 (2022)
[i15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-02524
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-02524
Alexander H. Liu, Cheng-I Jeff Lai, Wei-Ning Hsu, Michael Auli, Alexei Baevski, James R. Glass:
Simple and Effective Unsupervised Speech Synthesis. CoRR abs/2204.02524 (2022)
[i14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-09224
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-09224
Kaizhi Qian, Yang Zhang, Heting Gao, Junrui Ni, Cheng-I Lai, David D. Cox, Mark Hasegawa-Johnson, Shiyu Chang:
Improving Self-Supervised Speech Representations by Disentangling Speakers. CoRR abs/2204.09224 (2022)
[i13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-01522
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-01522
Yonggan Fu, Yang Zhang, Kaizhi Qian, Zhifan Ye, Zhongzhi Yu, Cheng-I Lai, Yingyan Lin:
Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Speech Processing. CoRR abs/2211.01522 (2022)
2021
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LaiCL0G21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LaiCL0G21
Cheng-I Lai, Yung-Sung Chuang, Hung-Yi Lee, Shang-Wen Li, James R. Glass:
Semi-Supervised Spoken Language Understanding via Self-Supervised Speech and Language Model Pretraining. ICASSP 2021: 7468-7472
[c9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YangCCLLLLSCLHT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YangCCLLLLSCLHT21
Shu-Wen Yang, Po-Han Chi, Yung-Sung Chuang, Cheng-I Jeff Lai, Kushal Lakhotia, Yist Y. Lin, Andy T. Liu, Jiatong Shi, Xuankai Chang, Guan-Ting Lin, Tzu-Hsien Huang, Wei-Cheng Tseng, Ko-tik Lee, Da-Rong Liu, Zili Huang, Shuyan Dong, Shang-Wen Li, Shinji Watanabe, Abdelrahman Mohamed, Hung-yi Lee:
SUPERB: Speech Processing Universal PERformance Benchmark. Interspeech 2021: 1194-1198
[c8]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/LaiZLCLCQKCG21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LaiZLCLCQKCG21
Cheng-I Jeff Lai, Yang Zhang, Alexander H. Liu, Shiyu Chang, Yi-Lun Liao, Yung-Sung Chuang, Kaizhi Qian, Sameer Khurana, David D. Cox, James R. Glass:
PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition. NeurIPS 2021: 21256-21272
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2105-01051
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-01051
Shu-Wen Yang, Po-Han Chi, Yung-Sung Chuang, Cheng-I Jeff Lai, Kushal Lakhotia, Yist Y. Lin, Andy T. Liu, Jiatong Shi, Xuankai Chang, Guan-Ting Lin, Tzu-Hsien Huang, Wei-Cheng Tseng, Ko-tik Lee, Da-Rong Liu, Zili Huang, Shuyan Dong, Shang-Wen Li, Shinji Watanabe, Abdelrahman Mohamed, Hung-yi Lee:
SUPERB: Speech processing Universal PERformance Benchmark. CoRR abs/2105.01051 (2021)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-05438
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-05438
Alexander H. Liu, SouYoung Jin, Cheng-I Jeff Lai, Andrew Rouditchenko, Aude Oliva, James R. Glass:
Cross-Modal Discrete Representation Learning. CoRR abs/2106.05438 (2021)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-05933
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-05933
Cheng-I Jeff Lai, Yang Zhang, Alexander H. Liu, Shiyu Chang, Yi-Lun Liao, Yung-Sung Chuang, Kaizhi Qian, Sameer Khurana, David D. Cox, James R. Glass:
PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition. CoRR abs/2106.05933 (2021)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-01147
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-01147
Cheng-I Jeff Lai, Erica Cooper, Yang Zhang, Shiyu Chang, Kaizhi Qian, Yi-Lun Liao, Yung-Sung Chuang, Alexander H. Liu, Junichi Yamagishi, David D. Cox, James R. Glass:
On the Interplay Between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis. CoRR abs/2110.01147 (2021)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-09784
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-09784
Yuan Gong, Cheng-I Jeff Lai, Yu-An Chung, James R. Glass:
SSAST: Self-Supervised Audio Spectrogram Transformer. CoRR abs/2110.09784 (2021)
2020
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/CooperLYFWCY20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/CooperLYFWCY20
Erica Cooper, Cheng-I Lai, Yusuke Yasuda, Fuming Fang, Xin Wang, Nanxin Chen, Junichi Yamagishi:
Zero-Shot Multi-Speaker Text-To-Speech with State-Of-The-Art Neural Speaker Embeddings. ICASSP 2020: 6184-6188
[c6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CooperLYY20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CooperLYY20
Erica Cooper, Cheng-I Lai, Yusuke Yasuda, Junichi Yamagishi:
Can Speaker Augmentation Improve Multi-Speaker End-to-End TTS? INTERSPEECH 2020: 3979-3983
[c5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0006LLWCY20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0006LLWCY20
Yi Zhao, Haoyu Li, Cheng-I Lai, Jennifer Williams, Erica Cooper, Junichi Yamagishi:
Improved Prosody from Learned F0 Codebook Representations for VQ-VAE Speech Waveform Reconstruction. INTERSPEECH 2020: 4417-4421
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2005-07884
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-07884
Yi Zhao, Haoyu Li, Cheng-I Lai, Jennifer Williams, Erica Cooper, Junichi Yamagishi:
Improved Prosody from Learned F0 Codebook Representations for VQ-VAE Speech Waveform Reconstruction. CoRR abs/2005.07884 (2020)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-13826
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-13826
Cheng-I Lai, Yung-Sung Chuang, Hung-yi Lee, Shang-wen Li, James R. Glass:
Semi-Supervised Spoken Language Understanding via Self-Supervised Speech and Language Model Pretraining. CoRR abs/2010.13826 (2020)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2011-06195
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-06195
Cheng-I Lai, Jin Cao, Sravan Bodapati, Shang-Wen Li:
Towards Semi-Supervised Semantics Understanding from Speech. CoRR abs/2011.06195 (2020)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2011-07347
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-07347
Fan-Keng Sun, Cheng-I Lai:
Conditioned Natural Language Generation using only Unconditioned Language Model: An Exploration. CoRR abs/2011.07347 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LaiARYDK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LaiARYDK19
Cheng-I Lai, Alberto Abad, Korin Richmond, Junichi Yamagishi, Najim Dehak, Simon King:
Attentive Filtering Networks for Audio Replay Attack Detection. ICASSP 2019: 6316-6320
[c3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LaiCVD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LaiCVD19
Cheng-I Lai, Nanxin Chen, Jesús Villalba, Najim Dehak:
ASSERT: Anti-Spoofing with Squeeze-Excitation and Residual Networks. INTERSPEECH 2019: 1013-1017
[c2]
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/mtsummit/MarchisioGLK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mtsummit/MarchisioGLK19
Kelly Marchisio, Jialiang Guo, Cheng-I Lai, Philipp Koehn:
Controlling the Reading Level of Machine Translation Output. MTSummit (1) 2019: 193-203
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1904-01120
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-01120
Cheng-I Lai, Nanxin Chen, Jesús Villalba, Najim Dehak:
ASSERT: Anti-Spoofing with Squeeze-Excitation and Residual neTworks. CoRR abs/1904.01120 (2019)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1904-01575
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-01575
Cheng-I Lai:
Contrastive Predictive Coding Based Feature for Automatic Speaker Verification. CoRR abs/1904.01575 (2019)
2018
[c1]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NidadavoluLVD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NidadavoluLVD18
Phani Sankar Nidadavolu, Cheng-I Lai, Jesús Villalba, Najim Dehak:
Investigation on Bandwidth Extension for Speaker Recognition. INTERSPEECH 2018: 1111-1115
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1810-13048
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-13048
Cheng-I Lai, Alberto Abad, Korin Richmond, Junichi Yamagishi, Najim Dehak, Simon King:
Attentive Filtering Networks for Audio Replay Attack Detection. CoRR abs/1810.13048 (2018)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.