default search action

combined dblp search
author search
venue search
publication search

ask others

Mingyang Zhang 0003

> Home > Persons

Person information

affiliation: National University of Singapore, Singapore

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/ZhouZZWL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ZhouZZWL24
Xuehao Zhou, Mingyang Zhang, Yi Zhou, Zhizheng Wu, Haizhou Li:
Accented Text-to-Speech Synthesis With Limited Data. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1699-1711 (2024)
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ZhangZRZYL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ZhangZRZYL24
Mingyang Zhang, Yi Zhou, Yi Ren, Chen Zhang, Xiang Yin, Haizhou Li:
RefXVC: Cross-Lingual Voice Conversion With Enhanced Reference Leveraging. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4146-4156 (2024)
[i10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-10844
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-10844
Xuehao Zhou, Mingyang Zhang, Yi Zhou, Zhiwu Li, Haizhou Li:
Multi-Scale Accent Modeling with Disentangling for Multi-Speaker Multi-Accent TTS Synthesis. CoRR abs/2406.10844 (2024)
[i9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-05004
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-05004
Zhengyang Chen, Shuai Wang, Mingyang Zhang, Xuechen Liu, Junichi Yamagishi, Yanmin Qian:
Disentangling the Prosody and Semantic Information with Pre-trained Model for In-Context Learning based Zero-Shot Voice Conversion. CoRR abs/2409.05004 (2024)
2023
[j5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/spl/ZhouWZTL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/ZhouWZTL23
Yi Zhou, Zhizheng Wu, Mingyang Zhang, Xiaohai Tian, Haizhou Li:
TTS-Guided Training for Accent Conversion Without Parallel Data. IEEE Signal Process. Lett. 30: 533-537 (2023)
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/ZhangZWL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/ZhangZWL23
Mingyang Zhang, Xuehao Zhou, Zhizheng Wu, Haizhou Li:
Towards Zero-Shot Multi-Speaker Multi-Accent Text-to-Speech Synthesis. IEEE Signal Process. Lett. 30: 947-951 (2023)
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/ZhangZWL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/ZhangZWL23
Mingyang Zhang, Yi Zhou, Zhizheng Wu, Haizhou Li:
Zero-shot multi-speaker accent TTS with limited accent data. APSIPA ASC 2023: 1931-1936
[c14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuS0023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuS0023
Junchen Lu, Berrak Sisman, Mingyang Zhang, Haizhou Li:
High-Quality Automatic Voice Over with Accurate Alignment: Supervision through Self-Supervised Discrete Speech Units. INTERSPEECH 2023: 5536-5540
[i8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-04816
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-04816
Xuehao Zhou, Mingyang Zhang, Yi Zhou, Zhizheng Wu, Haizhou Li:
Accented Text-to-Speech Synthesis with Limited Data. CoRR abs/2305.04816 (2023)
[i7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-17005
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-17005
Junchen Lu, Berrak Sisman, Mingyang Zhang, Haizhou Li:
High-Quality Automatic Voice Over with Accurate Alignment: Supervision through Self-Supervised Discrete Speech Units. CoRR abs/2306.17005 (2023)
2022
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LuSLZL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LuSLZL22
Junchen Lu, Berrak Sisman, Rui Liu, Mingyang Zhang, Haizhou Li:
Visualtts: TTS with Accurate Lip-Speech Synchronization for Automatic Voice Over. ICASSP 2022: 8032-8036
2021
[j3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/ZhangZZL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ZhangZZL21
Mingyang Zhang, Yi Zhou, Li Zhao, Haizhou Li:
Transfer Learning From Speech Synthesis to Voice Conversion With Non-Parallel Training Data. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1290-1302 (2021)
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/NikonorovSZL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/NikonorovSZL21
Sergey Nikonorov, Berrak Sisman, Mingyang Zhang, Haizhou Li:
DEEPA: A Deep Neural Analyzer for Speech and Singing Vocoding. ASRU 2021: 618-625
[c11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/0003ZZ0LS021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/0003ZZ0LS021
Mingyang Zhang, Xuehao Zhou, Kun Zhou, Rui Liu, Perry Lam, Berrak Sisman, Haizhou Li:
SUTD-NUS System for Blizzard Challenge 2021. Blizzard Challenge 2021
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-03342
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-03342
Junchen Lu, Berrak Sisman, Rui Liu, Mingyang Zhang, Haizhou Li:
VisualTTS: TTS with Accurate Lip-Speech Synchronization for Automatic Voice Over. CoRR abs/2110.03342 (2021)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-06434
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-06434
Sergey Nikonorov, Berrak Sisman, Mingyang Zhang, Haizhou Li:
DeepA: A Deep Neural Analyzer For Speech And Singing Vocoding. CoRR abs/2110.06434 (2021)
2020
[j2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/speech/ZhangSZL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/ZhangSZL20
Mingyang Zhang, Berrak Sisman, Li Zhao, Haizhou Li:
DeepConversion: Voice conversion with limited parallel training data. Speech Commun. 122: 31-43 (2020)
[c10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/0020TZ0LLS020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/0020TZ0LLS020
Yi Zhou, Xiaohai Tian, Xuehao Zhou, Mingyang Zhang, Grandee Lee, Riu Liu, Berrak Sisman, Haizhou Li:
NUS-HLT System for Blizzard Challenge 2020. Blizzard Challenge / Voice Conversion Challenge 2020
[c9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/Tian0YZD00ZS0020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/Tian0YZD00ZS0020
Xiaohai Tian, Zhichao Wang, Shan Yang, Xinyong Zhou, Hongqiang Du, Yi Zhou, Mingyang Zhang, Kun Zhou, Berrak Sisman, Lei Xie, Haizhou Li:
The NUS & NWPU system for Voice Conversion Challenge 2020. Blizzard Challenge / Voice Conversion Challenge 2020
[c8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhouS0020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhouS0020
Kun Zhou, Berrak Sisman, Mingyang Zhang, Haizhou Li:
Converting Anyone's Emotion: Towards Speaker-Independent Emotional Voice Conversion. INTERSPEECH 2020: 3416-3420
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2005-07025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-07025
Kun Zhou, Berrak Sisman, Mingyang Zhang, Haizhou Li:
Converting Anyone's Emotion: Towards Speaker-Independent Emotional Voice Conversion. CoRR abs/2005.07025 (2020)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2009-14399
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-14399
Mingyang Zhang, Yi Zhou, Li Zhao, Haizhou Li:
Transfer Learning from Speech Synthesis to Voice Conversion with Non-Parallel Training Data. CoRR abs/2009.14399 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/SismanZL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/SismanZL19
Berrak Sisman, Mingyang Zhang, Haizhou Li:
Group Sparse Representation With WaveNet Vocoder Adaptation for Spectrum and Prosody Conversion. IEEE ACM Trans. Audio Speech Lang. Process. 27(6): 1085-1097 (2019)
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SismanZDL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SismanZDL19
Berrak Sisman, Mingyang Zhang, Minghui Dong, Haizhou Li:
On the Study of Generative Adversarial Networks for Cross-Lingual Voice Conversion. ASRU 2019: 144-151
[c6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TjandraS0S0019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TjandraS0S0019
Andros Tjandra, Berrak Sisman, Mingyang Zhang, Sakriani Sakti, Haizhou Li, Satoshi Nakamura:
VQVAE Unsupervised Unit Discovery and Multi-Scale Code2Spec Inverter for Zerospeech Challenge 2019. INTERSPEECH 2019: 1118-1122
[c5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/00030F0Y19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/00030F0Y19
Mingyang Zhang, Xin Wang, Fuming Fang, Haizhou Li, Junichi Yamagishi:
Joint Training Framework for Text-to-Speech and Voice Conversion Using Multi-Source Tacotron and WaveNet. INTERSPEECH 2019: 1298-1302
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1903-12389
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1903-12389
Mingyang Zhang, Xin Wang, Fuming Fang, Haizhou Li, Junichi Yamagishi:
Joint training framework for text-to-speech and voice conversion using multi-source Tacotron and WaveNet. CoRR abs/1903.12389 (2019)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1905-11449
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-11449
Andros Tjandra, Berrak Sisman, Mingyang Zhang, Sakriani Sakti, Haizhou Li, Satoshi Nakamura:
VQVAE Unsupervised Unit Discovery and Multi-scale Code2Spec Inverter for Zerospeech Challenge 2019. CoRR abs/1905.11449 (2019)
2018
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/ZhangSR0Z18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/ZhangSR0Z18
Mingyang Zhang, Berrak Sisman, Sai Sirisha Rallabandi, Haizhou Li, Li Zhao:
Error Reduction Network for DBLSTM-based Voice Conversion. APSIPA 2018: 823-828
[c3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/XiaoY0SH0D018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/XiaoY0SH0D018
Jinba Xiao, Shan Yang, Mingyang Zhang, Berrak Sisman, Dongyan Huang, Lei Xie, Minghui Dong, Haizhou Li:
The I2R-NWPU-NUS Text-to-Speech System for Blizzard Challenge 2018. Blizzard Challenge 2018
[c2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SismanZL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SismanZL18
Berrak Sisman, Mingyang Zhang, Haizhou Li:
A Voice Conversion Framework with Tandem Feature Sparse Representation and Speaker-Adapted WaveNet Vocoder. INTERSPEECH 2018: 1978-1982
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/SismanZS0018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/SismanZS0018
Berrak Sisman, Mingyang Zhang, Sakriani Sakti, Haizhou Li, Satoshi Nakamura:
Adaptive Wavenet Vocoder for Residual Compensation in GAN-Based Voice Conversion. SLT 2018: 282-289

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.