default search action
Kun Zhou 0003
Person information
- affiliation: Alibaba DAMO Academy, Singapore
- affiliation (PhD 2023): National University of Singapore, Department of Electrical and Computer Engineering, Singapore
Other persons with the same name
- Kun Zhou — disambiguation page
- Kun Zhou 0001 — University of Zhejiang, State Key Laboratory of CAD&CG, Hangzhou, China (and 1 more)
- Kun Zhou 0002 — Renmin University of China, School of Information, Beijing, China
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c17]Jia Qi Yip, Shengkui Zhao, Yukun Ma, Chongjia Ni, Chong Zhang, Hao Wang, Trung Hieu Nguyen, Kun Zhou, Dianwen Ng, Eng Siong Chng, Bin Ma:
SPGM: Prioritizing Local Features for Enhanced Speech Separation Performance. ICASSP 2024: 326-330 - [c16]Shengkui Zhao, Yukun Ma, Chongjia Ni, Chong Zhang, Hao Wang, Trung Hieu Nguyen, Kun Zhou, Jia Qi Yip, Dianwen Ng, Bin Ma:
MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation. ICASSP 2024: 10356-10360 - [c15]Sho Inoue, Kun Zhou, Shuai Wang, Haizhou Li:
Hierarchical Emotion Prediction and Control in Text-to-Speech Synthesis. ICASSP 2024: 10601-10605 - [c14]Zongyang Du, Junchen Lu, Kun Zhou, Lakshmish Kaushik, Berrak Sisman:
Converting Anyone's Voice: End-to-End Expressive Voice Conversion with A Conditional Diffusion Model. Odyssey 2024: 172-179 - [c13]Kun Zhou, Berrak Sisman, Carlos Busso, Bin Ma, Haizhou Li:
Mixed-EVC: Mixed Emotion Synthesis and Control in Voice Conversion. Odyssey 2024: 180-186 - [i20]Zongyang Du, Junchen Lu, Kun Zhou, Lakshmish Kaushik, Berrak Sisman:
Converting Anyone's Voice: End-to-End Expressive Voice Conversion with a Conditional Diffusion Model. CoRR abs/2405.01730 (2024) - [i19]Kun Zhou, Shengkui Zhao, Yukun Ma, Chong Zhang, Hao Wang, Dianwen Ng, Chongjia Ni, Trung Hieu Nguyen, Jia Qi Yip, Bin Ma:
Phonetic Enhanced Language Modeling for Text-to-Speech Synthesis. CoRR abs/2406.02009 (2024) - [i18]Xin Jing, Kun Zhou, Andreas Triantafyllopoulos, Björn W. Schuller:
Enhancing Emotional Text-to-Speech Controllability with Natural Language Guidance through Contrastive Learning and Diffusion Models. CoRR abs/2409.06451 (2024) - [i17]Kun Zhou, You Zhang, Shengkui Zhao, Hao Wang, Zexu Pan, Dianwen Ng, Chong Zhang, Chongjia Ni, Yukun Ma, Trung Hieu Nguyen, Jia Qi Yip, Bin Ma:
Emotional Dimension Control in Language Model-Based Text-to-Speech: Spanning a Broad Spectrum of Human Emotions. CoRR abs/2409.16681 (2024) - 2023
- [j3]Kun Zhou, Berrak Sisman, Rajib Rana, Björn W. Schuller, Haizhou Li:
Emotion Intensity and its Control for Emotional Voice Conversion. IEEE Trans. Affect. Comput. 14(1): 31-48 (2023) - [j2]Kun Zhou, Berrak Sisman, Rajib Rana, Björn W. Schuller, Haizhou Li:
Speech Synthesis With Mixed Emotions. IEEE Trans. Affect. Comput. 14(4): 3120-3134 (2023) - [i16]Jia Qi Yip, Shengkui Zhao, Yukun Ma, Chongjia Ni, Chong Zhang, Hao Wang, Trung Hieu Nguyen, Kun Zhou, Dianwen Ng, Eng Siong Chng, Bin Ma:
SPGM: Prioritizing Local Features for enhanced speech separation performance. CoRR abs/2309.12608 (2023) - [i15]Shengkui Zhao, Yukun Ma, Chongjia Ni, Chong Zhang, Hao Wang, Trung Hieu Nguyen, Kun Zhou, Jia Qi Yip, Dianwen Ng, Bin Ma:
MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation. CoRR abs/2312.11825 (2023) - 2022
- [j1]Kun Zhou, Berrak Sisman, Rui Liu, Haizhou Li:
Emotional voice conversion: Theory, databases and ESD. Speech Commun. 137: 1-18 (2022) - [c12]Zongyang Du, Berrak Sisman, Kun Zhou, Haizhou Li:
Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conversion. INTERSPEECH 2022: 2603-2607 - [i14]Kun Zhou, Berrak Sisman, Rajib Rana, Björn W. Schuller, Haizhou Li:
Emotion Intensity and its Control for Emotional Voice Conversion. CoRR abs/2201.03967 (2022) - [i13]Kun Zhou, Berrak Sisman, Rajib Rana, Björn W. Schuller, Haizhou Li:
Speech Synthesis with Mixed Emotions. CoRR abs/2208.05890 (2022) - [i12]Kun Zhou, Berrak Sisman, Carlos Busso, Haizhou Li:
Mixed Emotion Modelling for Emotional Voice Conversion. CoRR abs/2210.13756 (2022) - 2021
- [c11]Zongyang Du, Berrak Sisman, Kun Zhou, Haizhou Li:
Expressive Voice Conversion: A Joint Framework for Speaker Identity and Emotional Style Transfer. ASRU 2021: 594-601 - [c10]Mingyang Zhang, Xuehao Zhou, Kun Zhou, Rui Liu, Perry Lam, Berrak Sisman, Haizhou Li:
SUTD-NUS System for Blizzard Challenge 2021. Blizzard Challenge 2021 - [c9]Kun Zhou, Berrak Sisman, Rui Liu, Haizhou Li:
Seen and Unseen Emotional Style Transfer for Voice Conversion with A New Emotional Speech Dataset. ICASSP 2021: 920-924 - [c8]Kun Zhou, Berrak Sisman, Haizhou Li:
Limited Data Emotional Voice Conversion Leveraging Text-to-Speech: Two-Stage Sequence-to-Sequence Training. Interspeech 2021: 811-815 - [c7]Kun Zhou, Berrak Sisman, Haizhou Li:
Vaw-Gan For Disentanglement And Recomposition Of Emotional Elements In Speech. SLT 2021: 415-422 - [i11]Kun Zhou, Berrak Sisman, Haizhou Li:
Limited Data Emotional Voice Conversion Leveraging Text-to-Speech: Two-stage Sequence-to-Sequence Training. CoRR abs/2103.16809 (2021) - [i10]Kun Zhou, Berrak Sisman, Rui Liu, Haizhou Li:
Emotional Voice Conversion: Theory, Databases and ESD. CoRR abs/2105.14762 (2021) - [i9]Zongyang Du, Berrak Sisman, Kun Zhou, Haizhou Li:
Expressive Voice Conversion: A Joint Framework for Speaker Identity and Emotional Style Transfer. CoRR abs/2107.03748 (2021) - [i8]Zongyang Du, Berrak Sisman, Kun Zhou, Haizhou Li:
Identity Conversion for Emotional Speakers: A Study for Disentanglement of Emotion Style and Speaker Identity. CoRR abs/2110.10326 (2021) - 2020
- [c6]Zongyang Du, Kun Zhou, Berrak Sisman, Haizhou Li:
Spectrum and Prosody Conversion for Cross-lingual Voice Conversion with CycleGAN. APSIPA 2020: 507-513 - [c5]Junchen Lu, Kun Zhou, Berrak Sisman, Haizhou Li:
VAW-GAN for Singing Voice Conversion with Non-parallel Training Data. APSIPA 2020: 514-519 - [c4]Xiaohai Tian, Zhichao Wang, Shan Yang, Xinyong Zhou, Hongqiang Du, Yi Zhou, Mingyang Zhang, Kun Zhou, Berrak Sisman, Lei Xie, Haizhou Li:
The NUS & NWPU system for Voice Conversion Challenge 2020. Blizzard Challenge / Voice Conversion Challenge 2020 - [c3]Kun Zhou, Berrak Sisman, Mingyang Zhang, Haizhou Li:
Converting Anyone's Emotion: Towards Speaker-Independent Emotional Voice Conversion. INTERSPEECH 2020: 3416-3420 - [c2]Kun Zhou, Berrak Sisman, Haizhou Li:
Transforming Spectrum and Prosody for Emotional Voice Conversion with Non-Parallel Training Data. Odyssey 2020: 230-237 - [i7]Kun Zhou, Berrak Sisman, Haizhou Li:
Transforming Spectrum and Prosody for Emotional Voice Conversion with Non-Parallel Training Data. CoRR abs/2002.00198 (2020) - [i6]Kun Zhou, Berrak Sisman, Mingyang Zhang, Haizhou Li:
Converting Anyone's Emotion: Towards Speaker-Independent Emotional Voice Conversion. CoRR abs/2005.07025 (2020) - [i5]Junchen Lu, Kun Zhou, Berrak Sisman, Haizhou Li:
VAW-GAN for Singing Voice Conversion with Non-parallel Training Data. CoRR abs/2008.03992 (2020) - [i4]Zongyang Du, Kun Zhou, Berrak Sisman, Haizhou Li:
Spectrum and Prosody Conversion for Cross-lingual Voice Conversion with CycleGAN. CoRR abs/2008.04562 (2020) - [i3]Kun Zhou, Berrak Sisman, Rui Liu, Haizhou Li:
Seen and Unseen emotional style transfer for voice conversion with a new emotional speech dataset. CoRR abs/2010.14794 (2020) - [i2]Kun Zhou, Berrak Sisman, Haizhou Li:
VAW-GAN for Disentanglement and Recomposition of Emotional Elements in Speech. CoRR abs/2011.02314 (2020)
2010 – 2019
- 2019
- [c1]Emre Yilmaz, Adem Derinel, Kun Zhou, Henk van den Heuvel, Niko Brummer, Haizhou Li, David A. van Leeuwen:
Large-Scale Speaker Diarization of Radio Broadcast Archives. INTERSPEECH 2019: 411-415 - [i1]Emre Yilmaz, Adem Derinel, Kun Zhou, Henk van den Heuvel, Niko Brummer, Haizhou Li, David A. van Leeuwen:
Large-Scale Speaker Diarization of Radio Broadcast Archives. CoRR abs/1906.07955 (2019)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-19 20:49 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint