default search action
Erica Cooper
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [j7]Aidan Pine, Erica Cooper, David Guzmán, Eric Joanis, Anna Kazantseva, Ross Krekoski, Roland Kuhn, Samuel Larkin, Patrick Littell, Delaney Lothian, Akwiratékha' Martin, Korin Richmond, Marc Tessier, Cassia Valentini-Botinhao, Dan Wells, Junichi Yamagishi:
Speech Generation for Indigenous Language Education. Comput. Speech Lang. 90: 101723 (2025) - 2024
- [j6]Chang Zeng, Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi:
Joint speaker encoder and neural back-end model for fully end-to-end automatic speaker verification with multiple enrollment utterances. Comput. Speech Lang. 86: 101619 (2024) - [j5]Cheng Gong, Xin Wang, Erica Cooper, Dan Wells, Longbiao Wang, Jianwu Dang, Korin Richmond, Junichi Yamagishi:
ZMM-TTS: Zero-Shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-Supervised Discrete Speech Representations. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4036-4051 (2024) - [c38]Aditya Ravuri, Erica Cooper, Junichi Yamagishi:
Uncertainty as a Predictor: Leveraging Self-Supervised Learning for Zero-Shot MOS Prediction. ICASSP Workshops 2024: 580-584 - [c37]Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, Nicholas W. D. Evans, Massimiliano Todisco, Jean-François Bonastre, Mickael Rouvier:
Synvox2: Towards A Privacy-Friendly Voxceleb2 Dataset. ICASSP 2024: 11421-11425 - [i34]Lin Zhang, Xin Wang, Erica Cooper, Mireia Díez, Federico Landini, Nicholas W. D. Evans, Junichi Yamagishi:
Spoof Diarization: "What Spoofed When" in Partially Spoofed Audio. CoRR abs/2406.07816 (2024) - [i33]Zhengyang Chen, Xuechen Liu, Erica Cooper, Junichi Yamagishi, Yanmin Qian:
Generating Speakers by Prompting Listener Impressions for Pre-trained Multi-Speaker Text-to-Speech Systems. CoRR abs/2406.08812 (2024) - [i32]Cheng Gong, Erica Cooper, Xin Wang, Chunyu Qiang, Mengzhe Geng, Dan Wells, Longbiao Wang, Jianwu Dang, Marc Tessier, Aidan Pine, Korin Richmond, Junichi Yamagishi:
An Initial Investigation of Language Adaptation for TTS Systems under Low-resource Scenarios. CoRR abs/2406.08911 (2024) - [i31]Chang Zeng, Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi:
Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches. CoRR abs/2409.06327 (2024) - [i30]Wen-Chin Huang, Szu-Wei Fu, Erica Cooper, Ryandhimas E. Zezario, Tomoki Toda, Hsin-Min Wang, Junichi Yamagishi, Yu Tsao:
The VoiceMOS Challenge 2024: Beyond Speech Quality Prediction. CoRR abs/2409.07001 (2024) - 2023
- [j4]Lin Zhang, Xin Wang, Erica Cooper, Nicholas W. D. Evans, Junichi Yamagishi:
The PartialSpoof Database and Countermeasures for the Detection of Short Fake Speech Segments Embedded in an Utterance. IEEE ACM Trans. Audio Speech Lang. Process. 31: 813-825 (2023) - [j3]Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, Natalia A. Tomashenko:
Speaker Anonymization Using Orthogonal Householder Neural Network. IEEE ACM Trans. Audio Speech Lang. Process. 31: 3681-3695 (2023) - [c36]Lifan Zhong, Erica Cooper, Junichi Yamagishi, Nobuaki Minematsu:
Exploring Isolated Musical Notes as Pre-training Data for Predominant Instrument Recognition in Polyphonic Music. APSIPA ASC 2023: 2312-2319 - [c35]Erica Cooper, Wen-Chin Huang, Yu Tsao, Hsin-Min Wang, Tomoki Toda, Junichi Yamagishi:
The Voicemos Challenge 2023: Zero-Shot Subjective Speech Quality Prediction for Multiple Domains. ASRU 2023: 1-7 - [c34]Hemant Yadav, Erica Cooper, Junichi Yamagishi, Sunayana Sitaram, Rajiv Ratn Shah:
Partial Rank Similarity Minimization Method for Quality MOS Prediction of Unseen Speech Synthesis Systems in Zero-Shot and Semi-Supervised Setting. ASRU 2023: 1-7 - [c33]Xuan Shi, Erica Cooper, Xin Wang, Junichi Yamagishi, Shrikanth Narayanan:
Can Knowledge of End-to-End Text-to-Speech Models Improve Neural Midi-to-Audio Synthesis Systems? ICASSP 2023: 1-5 - [c32]Erica Cooper, Junichi Yamagishi:
Investigating Range-Equalizing Bias in Mean Opinion Score Ratings of Synthesized Speech. INTERSPEECH 2023: 1104-1108 - [c31]Chang Zeng, Xin Wang, Xiaoxiao Miao, Erica Cooper, Junichi Yamagishi:
Improving Generalization Ability of Countermeasures for New Mismatch Scenario by Combining Multiple Advanced Regularization Terms. INTERSPEECH 2023: 1998-2002 - [c30]Lin Zhang, Xin Wang, Erica Cooper, Nicholas W. D. Evans, Junichi Yamagishi:
Range-Based Equal Error Rate for Spoof Localization. INTERSPEECH 2023: 3212-3216 - [c29]Orian Sharoni, Roee Shenberg, Erica Cooper:
SASPEECH: A Hebrew Single Speaker Dataset for Text To Speech and Voice Conversion. INTERSPEECH 2023: 5566-5570 - [i29]Lin Zhang, Xin Wang, Erica Cooper, Nicholas W. D. Evans, Junichi Yamagishi:
Range-Based Equal Error Rate for Spoof Localization. CoRR abs/2305.17739 (2023) - [i28]Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, Natalia A. Tomashenko:
Language-independent speaker anonymization using orthogonal Householder neural network. CoRR abs/2305.18823 (2023) - [i27]Lifan Zhong, Erica Cooper, Junichi Yamagishi, Nobuaki Minematsu:
Exploring Isolated Musical Notes as Pre-training Data for Predominant Instrument Recognition in Polyphonic Music. CoRR abs/2306.08850 (2023) - [i26]Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, Nicholas W. D. Evans, Massimiliano Todisco, Jean-François Bonastre, Mickael Rouvier:
SynVox2: Towards a privacy-friendly VoxCeleb2 dataset. CoRR abs/2309.06141 (2023) - [i25]Nicolas Jonason, Xin Wang, Erica Cooper, Lauri Juvela, Bob L. T. Sturm, Junichi Yamagishi:
DDSP-based Neural Waveform Synthesis of Polyphonic Guitar Performance from String-wise MIDI Input. CoRR abs/2309.07658 (2023) - [i24]Hemant Yadav, Erica Cooper, Junichi Yamagishi, Sunayana Sitaram, Rajiv Ratn Shah:
Partial Rank Similarity Minimization Method for Quality MOS Prediction of Unseen Speech Synthesis Systems in Zero-Shot and Semi-supervised setting. CoRR abs/2310.05078 (2023) - [i23]Xuechen Liu, Xin Wang, Erica Cooper, Xiaoxiao Miao, Junichi Yamagishi:
Speaker-Text Retrieval via Contrastive Learning. CoRR abs/2312.06055 (2023) - [i22]Cheng Gong, Xin Wang, Erica Cooper, Dan Wells, Longbiao Wang, Jianwu Dang, Korin Richmond, Junichi Yamagishi:
ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations. CoRR abs/2312.14398 (2023) - [i21]Aditya Ravuri, Erica Cooper, Junichi Yamagishi:
Uncertainty as a Predictor: Leveraging Self-Supervised Learning for Zero-Shot MOS Prediction. CoRR abs/2312.15616 (2023) - 2022
- [j2]Xuan Shi, Erica Cooper, Junichi Yamagishi:
Use of Speaker Recognition Approaches for Learning and Evaluating Embedding Representations of Musical Instrument Sounds. IEEE ACM Trans. Audio Speech Lang. Process. 30: 367-377 (2022) - [c28]Wen-Chin Huang, Erica Cooper, Junichi Yamagishi, Tomoki Toda:
LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech. ICASSP 2022: 896-900 - [c27]Chang Zeng, Xin Wang, Erica Cooper, Xiaoxiao Miao, Junichi Yamagishi:
Attention Back-End for Automatic Speaker Verification with Multiple Enrollment Utterances. ICASSP 2022: 6717-6721 - [c26]Erica Cooper, Wen-Chin Huang, Tomoki Toda, Junichi Yamagishi:
Generalization Ability of MOS Prediction Networks. ICASSP 2022: 8442-8446 - [c25]Cheng-I Jeff Lai, Erica Cooper, Yang Zhang, Shiyu Chang, Kaizhi Qian, Yi-Lun Liao, Yung-Sung Chuang, Alexander H. Liu, Junichi Yamagishi, David D. Cox, James R. Glass:
On the Interplay between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis. ICASSP 2022: 8447-8451 - [c24]Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, Natalia A. Tomashenko:
Analyzing Language-Independent Speaker Anonymization Framework under Unseen Conditions. INTERSPEECH 2022: 4426-4430 - [c23]Wen-Chin Huang, Erica Cooper, Yu Tsao, Hsin-Min Wang, Tomoki Toda, Junichi Yamagishi:
The VoiceMOS Challenge 2022. INTERSPEECH 2022: 4536-4540 - [c22]Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, Natalia A. Tomashenko:
Language-Independent Speaker Anonymization Approach Using Self-Supervised Pre-Trained Models. Odyssey 2022: 279-286 - [i20]Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, Natalia A. Tomashenko:
Language-Independent Speaker Anonymization Approach using Self-Supervised Pre-Trained Models. CoRR abs/2202.13097 (2022) - [i19]Wen-Chin Huang, Erica Cooper, Yu Tsao, Hsin-Min Wang, Tomoki Toda, Junichi Yamagishi:
The VoiceMOS Challenge 2022. CoRR abs/2203.11389 (2022) - [i18]Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi, Natalia A. Tomashenko:
Analyzing Language-Independent Speaker Anonymization Framework under Unseen Conditions. CoRR abs/2203.14834 (2022) - [i17]Lin Zhang, Xin Wang, Erica Cooper, Nicholas W. D. Evans, Junichi Yamagishi:
The PartialSpoof Database and Countermeasures for the Detection of Short Generated Audio Segments Embedded in a Speech Utterance. CoRR abs/2204.05177 (2022) - [i16]Chang Zeng, Xiaoxiao Miao, Xin Wang, Erica Cooper, Junichi Yamagishi:
Joint Speaker Encoder and Neural Back-end Model for Fully End-to-End Automatic Speaker Verification with Multiple Enrollment Utterances. CoRR abs/2209.00485 (2022) - [i15]Xuan Shi, Erica Cooper, Xin Wang, Junichi Yamagishi, Shrikanth Narayanan:
Can Knowledge of End-to-End Text-to-Speech Models Improve Neural MIDI-to-Audio Synthesis Systems? CoRR abs/2211.13868 (2022) - 2021
- [c21]Shuhei Kato, Yusuke Yasuda, Xin Wang, Erica Cooper, Junichi Yamagishi:
How Similar or Different is Rakugo Speech Synthesizer to Professional Performers? ICASSP 2021: 6488-6492 - [c20]Jennifer Williams, Yi Zhao, Erica Cooper, Junichi Yamagishi:
Learning Disentangled Phone and Speaker Representations in a Semi-Supervised VQ-VAE Paradigm. ICASSP 2021: 7053-7057 - [c19]Lin Zhang, Xin Wang, Erica Cooper, Junichi Yamagishi, Jose Patino, Nicholas W. D. Evans:
An Initial Investigation for Detecting Partially Spoofed Audio. Interspeech 2021: 4264-4268 - [c18]Jennifer Williams, Jason Fong, Erica Cooper, Junichi Yamagishi:
Exploring Disentanglement with Multilingual and Monolingual VQ-VAE. SSW 2021: 124-129 - [c17]Erica Cooper, Xin Wang, Junichi Yamagishi:
Text-to-Speech Synthesis Techniques for MIDI-to-Audio Synthesis. SSW 2021: 130-135 - [c16]Erica Cooper, Junichi Yamagishi:
How do Voices from Past Speech Synthesis Challenges Compare Today? SSW 2021: 183-188 - [i14]Chang Zeng, Xin Wang, Erica Cooper, Junichi Yamagishi:
Attention Back-end for Automatic Speaker Verification with Multiple Enrollment Utterances. CoRR abs/2104.01541 (2021) - [i13]Lin Zhang, Xin Wang, Erica Cooper, Junichi Yamagishi, Jose Patino, Nicholas W. D. Evans:
An Initial Investigation for Detecting Partially Spoofed Audio. CoRR abs/2104.02518 (2021) - [i12]Erica Cooper, Xin Wang, Junichi Yamagishi:
Text-to-Speech Synthesis Techniques for MIDI-to-Audio Synthesis. CoRR abs/2104.12292 (2021) - [i11]Jennifer Williams, Jason Fong, Erica Cooper, Junichi Yamagishi:
Exploring Disentanglement with Multilingual and Monolingual VQ-VAE. CoRR abs/2105.01573 (2021) - [i10]Erica Cooper, Junichi Yamagishi:
How do Voices from Past Speech Synthesis Challenges Compare Today? CoRR abs/2105.02373 (2021) - [i9]Xuan Shi, Erica Cooper, Junichi Yamagishi:
Use of speaker recognition approaches for learning timbre representations of musical instrument sounds from raw waveforms. CoRR abs/2107.11506 (2021) - [i8]Lin Zhang, Xin Wang, Erica Cooper, Junichi Yamagishi:
Multi-Task Learning in Utterance-Level and Segmental-Level Spoof Detection. CoRR abs/2107.14132 (2021) - [i7]Cheng-I Jeff Lai, Erica Cooper, Yang Zhang, Shiyu Chang, Kaizhi Qian, Yi-Lun Liao, Yung-Sung Chuang, Alexander H. Liu, Junichi Yamagishi, David D. Cox, James R. Glass:
On the Interplay Between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis. CoRR abs/2110.01147 (2021) - [i6]Wen-Chin Huang, Erica Cooper, Junichi Yamagishi, Tomoki Toda:
LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech. CoRR abs/2110.09103 (2021) - 2020
- [j1]Shuhei Kato, Yusuke Yasuda, Xin Wang, Erica Cooper, Shinji Takaki, Junichi Yamagishi:
Modeling of Rakugo Speech and Its Limitations: Toward Speech Synthesis That Entertains Audiences. IEEE Access 8: 138149-138161 (2020) - [c15]Erica Cooper, Cheng-I Lai, Yusuke Yasuda, Fuming Fang, Xin Wang, Nanxin Chen, Junichi Yamagishi:
Zero-Shot Multi-Speaker Text-To-Speech with State-Of-The-Art Neural Speaker Embeddings. ICASSP 2020: 6184-6188 - [c14]Erica Cooper, Cheng-I Lai, Yusuke Yasuda, Junichi Yamagishi:
Can Speaker Augmentation Improve Multi-Speaker End-to-End TTS? INTERSPEECH 2020: 3979-3983 - [c13]Yi Zhao, Haoyu Li, Cheng-I Lai, Jennifer Williams, Erica Cooper, Junichi Yamagishi:
Improved Prosody from Learned F0 Codebook Representations for VQ-VAE Speech Waveform Reconstruction. INTERSPEECH 2020: 4417-4421 - [i5]Yi Zhao, Haoyu Li, Cheng-I Lai, Jennifer Williams, Erica Cooper, Junichi Yamagishi:
Improved Prosody from Learned F0 Codebook Representations for VQ-VAE Speech Waveform Reconstruction. CoRR abs/2005.07884 (2020) - [i4]Antoine Perquin, Erica Cooper, Junichi Yamagishi:
Grapheme or phoneme? An Analysis of Tacotron's Embedded Representations. CoRR abs/2010.10694 (2020) - [i3]Jennifer Williams, Yi Zhao, Erica Cooper, Junichi Yamagishi:
Learning Disentangled Phone and Speaker Representations in a Semi-Supervised VQ-VAE Paradigm. CoRR abs/2010.10727 (2020) - [i2]Shuhei Kato, Yusuke Yasuda, Xin Wang, Erica Cooper, Junichi Yamagishi:
How Similar or Different Is Rakugo Speech Synthesizer to Professional Performers? CoRR abs/2010.11549 (2020) - [i1]Erica Cooper, Xin Wang, Yi Zhao, Yusuke Yasuda, Junichi Yamagishi:
Pretraining Strategies, Waveform Model Choice, and Acoustic Configurations for Multi-Speaker End-to-End Speech Synthesis. CoRR abs/2011.04839 (2020)
2010 – 2019
- 2019
- [b1]Erica Cooper:
Text-to-Speech Synthesis Using Found Data for Low-Resource Languages. Columbia University, USA, 2019 - [c12]Shuhei Kato, Yusuke Yasuda, Xin Wang, Erica Cooper, Shinji Takaki, Junichi Yamagishi:
Rakugo speech synthesis using segment-to-segment neural transduction and style tokens - toward speech synthesis for entertaining audiences. SSW 2019: 111-116 - [c11]Elshadai Tesfaye Biru, Yishak Tofik Mohammed, David Tofu, Erica Cooper, Julia Hirschberg:
Subset Selection, Adaptation, Gemination and Prosody Prediction for Amharic Text-to-Speech Synthesis. SSW 2019: 205-210 - 2018
- [c10]Kai-Zhan Lee, Erica Cooper, Julia Hirschberg:
A Comparison of Speaker-based and Utterance-based Data Selection for Text-to-Speech Synthesis. INTERSPEECH 2018: 2873-2877 - 2017
- [c9]Erica Cooper, Xinyue Wang, Alison Chang, Yocheved Levitan, Julia Hirschberg:
Utterance Selection for Optimizing Intelligibility of TTS Voices Trained on ASR Data. INTERSPEECH 2017: 3971-3975 - 2016
- [c8]Gideon Mendels, Erica Cooper, Julia Hirschberg:
Babler - Data Collection from the Web to Support Speech Recognition and Keyword Search. WAC@ACL 2016: 72-81 - [c7]Erica Cooper, Alison Chang, Yocheved Levitan, Julia Hirschberg:
Data Selection and Adaptation for Naturalness in HMM-Based Speech Synthesis. INTERSPEECH 2016: 357-361 - 2015
- [c6]Gideon Mendels, Erica Cooper, Victor Soto, Julia Hirschberg, Mark J. F. Gales, Kate M. Knill, Anton Ragni, Haipeng Wang:
Improving speech recognition and keyword search for low resource languages using web data. INTERSPEECH 2015: 829-833 - 2014
- [c5]Victor Soto, Erica Cooper, Lidia Mangu, Andrew Rosenberg, Julia Hirschberg:
Rescoring Confusion Networks for Keyword Search. ICASSP 2014: 7088-7092 - 2013
- [c4]Victor Soto, Erica Cooper, Andrew Rosenberg, Julia Hirschberg:
Cross-language phrase boundary detection. ICASSP 2013: 8460-8464
2000 – 2009
- 2009
- [c3]Dogan Can, Erica Cooper, Abhinav Sethy, Christopher M. White, Bhuvana Ramabhadran, Murat Saraclar:
Effect of pronounciations on OOV queries in spoken term detection. ICASSP 2009: 3957-3960 - [c2]Christopher M. White, Abhinav Sethy, Bhuvana Ramabhadran, Patrick J. Wolfe, Erica Cooper, Murat Saraclar, James K. Baker:
Unsupervised pronunciation validation. ICASSP 2009: 4301-4304 - [c1]Dogan Can, Erica Cooper, Arnab Ghoshal, Martin Jansche, Sanjeev Khudanpur, Bhuvana Ramabhadran, Michael Riley, Murat Saraclar, Abhinav Sethy, Morgan Ulinski, Christopher M. White:
Web derived pronunciations for spoken term detection. SIGIR 2009: 83-90
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-07 21:29 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint