default search action

combined dblp search
author search
venue search
publication search

ask others

Kun Zhou 0003

> Home > Persons

Person information

affiliation: Alibaba DAMO Academy, Singapore
affiliation (PhD 2023): National University of Singapore, Department of Electrical and Computer Engineering, Singapore

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YipZMNZ000NC024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YipZMNZ000NC024
Jia Qi Yip, Shengkui Zhao, Yukun Ma, Chongjia Ni, Chong Zhang, Hao Wang, Trung Hieu Nguyen, Kun Zhou, Dianwen Ng, Eng Siong Chng, Bin Ma:
SPGM: Prioritizing Local Features for Enhanced Speech Separation Performance. ICASSP 2024: 326-330
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhaoMNZ000YN024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhaoMNZ000YN024
Shengkui Zhao, Yukun Ma, Chongjia Ni, Chong Zhang, Hao Wang, Trung Hieu Nguyen, Kun Zhou, Jia Qi Yip, Dianwen Ng, Bin Ma:
MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation. ICASSP 2024: 10356-10360
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Inoue0W024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Inoue0W024
Sho Inoue, Kun Zhou, Shuai Wang, Haizhou Li:
Hierarchical Emotion Prediction and Control in Text-to-Speech Synthesis. ICASSP 2024: 10601-10605
[c14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/DuL0KS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/DuL0KS24
Zongyang Du, Junchen Lu, Kun Zhou, Lakshmish Kaushik, Berrak Sisman:
Converting Anyone's Voice: End-to-End Expressive Voice Conversion with A Conditional Diffusion Model. Odyssey 2024: 172-179
[c13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/0003SB0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/0003SB0024
Kun Zhou, Berrak Sisman, Carlos Busso, Bin Ma, Haizhou Li:
Mixed-EVC: Mixed Emotion Synthesis and Control in Voice Conversion. Odyssey 2024: 180-186
[i20]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-01730
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-01730
Zongyang Du, Junchen Lu, Kun Zhou, Lakshmish Kaushik, Berrak Sisman:
Converting Anyone's Voice: End-to-End Expressive Voice Conversion with a Conditional Diffusion Model. CoRR abs/2405.01730 (2024)
[i19]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-02009
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-02009
Kun Zhou, Shengkui Zhao, Yukun Ma, Chong Zhang, Hao Wang, Dianwen Ng, Chongjia Ni, Trung Hieu Nguyen, Jia Qi Yip, Bin Ma:
Phonetic Enhanced Language Modeling for Text-to-Speech Synthesis. CoRR abs/2406.02009 (2024)
[i18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-06451
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-06451
Xin Jing, Kun Zhou, Andreas Triantafyllopoulos, Björn W. Schuller:
Enhancing Emotional Text-to-Speech Controllability with Natural Language Guidance through Contrastive Learning and Diffusion Models. CoRR abs/2409.06451 (2024)
[i17]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-16681
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-16681
Kun Zhou, You Zhang, Shengkui Zhao, Hao Wang, Zexu Pan, Dianwen Ng, Chong Zhang, Chongjia Ni, Yukun Ma, Trung Hieu Nguyen, Jia Qi Yip, Bin Ma:
Emotional Dimension Control in Language Model-Based Text-to-Speech: Spanning a Broad Spectrum of Human Emotions. CoRR abs/2409.16681 (2024)
2023
[j3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/taffco/ZhouSRSL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taffco/ZhouSRSL23
Kun Zhou, Berrak Sisman, Rajib Rana, Björn W. Schuller, Haizhou Li:
Emotion Intensity and its Control for Emotional Voice Conversion. IEEE Trans. Affect. Comput. 14(1): 31-48 (2023)
[j2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/taffco/ZhouSRSL23a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taffco/ZhouSRSL23a
Kun Zhou, Berrak Sisman, Rajib Rana, Björn W. Schuller, Haizhou Li:
Speech Synthesis With Mixed Emotions. IEEE Trans. Affect. Comput. 14(4): 3120-3134 (2023)
[i16]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-12608
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-12608
Jia Qi Yip, Shengkui Zhao, Yukun Ma, Chongjia Ni, Chong Zhang, Hao Wang, Trung Hieu Nguyen, Kun Zhou, Dianwen Ng, Eng Siong Chng, Bin Ma:
SPGM: Prioritizing Local Features for enhanced speech separation performance. CoRR abs/2309.12608 (2023)
[i15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-11825
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-11825
Shengkui Zhao, Yukun Ma, Chongjia Ni, Chong Zhang, Hao Wang, Trung Hieu Nguyen, Kun Zhou, Jia Qi Yip, Dianwen Ng, Bin Ma:
MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation. CoRR abs/2312.11825 (2023)
2022
[j1]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/speech/ZhouSLL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/ZhouSLL22
Kun Zhou, Berrak Sisman, Rui Liu, Haizhou Li:
Emotional voice conversion: Theory, databases and ESD. Speech Commun. 137: 1-18 (2022)
[c12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DuSZ022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DuSZ022
Zongyang Du, Berrak Sisman, Kun Zhou, Haizhou Li:
Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conversion. INTERSPEECH 2022: 2603-2607
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2201-03967
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-03967
Kun Zhou, Berrak Sisman, Rajib Rana, Björn W. Schuller, Haizhou Li:
Emotion Intensity and its Control for Emotional Voice Conversion. CoRR abs/2201.03967 (2022)
[i13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-05890
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-05890
Kun Zhou, Berrak Sisman, Rajib Rana, Björn W. Schuller, Haizhou Li:
Speech Synthesis with Mixed Emotions. CoRR abs/2208.05890 (2022)
[i12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-13756
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-13756
Kun Zhou, Berrak Sisman, Carlos Busso, Haizhou Li:
Mixed Emotion Modelling for Emotional Voice Conversion. CoRR abs/2210.13756 (2022)
2021
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/DuSZL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/DuSZL21
Zongyang Du, Berrak Sisman, Kun Zhou, Haizhou Li:
Expressive Voice Conversion: A Joint Framework for Speaker Identity and Emotional Style Transfer. ASRU 2021: 594-601
[c10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/0003ZZ0LS021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/0003ZZ0LS021
Mingyang Zhang, Xuehao Zhou, Kun Zhou, Rui Liu, Perry Lam, Berrak Sisman, Haizhou Li:
SUTD-NUS System for Blizzard Challenge 2021. Blizzard Challenge 2021
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhouS0021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhouS0021
Kun Zhou, Berrak Sisman, Rui Liu, Haizhou Li:
Seen and Unseen Emotional Style Transfer for Voice Conversion with A New Emotional Speech Dataset. ICASSP 2021: 920-924
[c8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhouSL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhouSL21
Kun Zhou, Berrak Sisman, Haizhou Li:
Limited Data Emotional Voice Conversion Leveraging Text-to-Speech: Two-Stage Sequence-to-Sequence Training. Interspeech 2021: 811-815
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/ZhouS021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/ZhouS021
Kun Zhou, Berrak Sisman, Haizhou Li:
Vaw-Gan For Disentanglement And Recomposition Of Emotional Elements In Speech. SLT 2021: 415-422
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2103-16809
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-16809
Kun Zhou, Berrak Sisman, Haizhou Li:
Limited Data Emotional Voice Conversion Leveraging Text-to-Speech: Two-stage Sequence-to-Sequence Training. CoRR abs/2103.16809 (2021)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2105-14762
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-14762
Kun Zhou, Berrak Sisman, Rui Liu, Haizhou Li:
Emotional Voice Conversion: Theory, Databases and ESD. CoRR abs/2105.14762 (2021)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2107-03748
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-03748
Zongyang Du, Berrak Sisman, Kun Zhou, Haizhou Li:
Expressive Voice Conversion: A Joint Framework for Speaker Identity and Emotional Style Transfer. CoRR abs/2107.03748 (2021)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-10326
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-10326
Zongyang Du, Berrak Sisman, Kun Zhou, Haizhou Li:
Identity Conversion for Emotional Speakers: A Study for Disentanglement of Emotion Style and Speaker Identity. CoRR abs/2110.10326 (2021)
2020
[c6]
- view
  - electronic edition @ ieee.org
  - no references & citations available
- export record
  dblp key:
  - conf/apsipa/DuZS020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/DuZS020
Zongyang Du, Kun Zhou, Berrak Sisman, Haizhou Li:
Spectrum and Prosody Conversion for Cross-lingual Voice Conversion with CycleGAN. APSIPA 2020: 507-513
[c5]
- view
  - electronic edition @ ieee.org
  - no references & citations available
- export record
  dblp key:
  - conf/apsipa/LuZS020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/LuZS020
Junchen Lu, Kun Zhou, Berrak Sisman, Haizhou Li:
VAW-GAN for Singing Voice Conversion with Non-parallel Training Data. APSIPA 2020: 514-519
[c4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/Tian0YZD00ZS0020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/Tian0YZD00ZS0020
Xiaohai Tian, Zhichao Wang, Shan Yang, Xinyong Zhou, Hongqiang Du, Yi Zhou, Mingyang Zhang, Kun Zhou, Berrak Sisman, Lei Xie, Haizhou Li:
The NUS & NWPU system for Voice Conversion Challenge 2020. Blizzard Challenge / Voice Conversion Challenge 2020
[c3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhouS0020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhouS0020
Kun Zhou, Berrak Sisman, Mingyang Zhang, Haizhou Li:
Converting Anyone's Emotion: Towards Speaker-Independent Emotional Voice Conversion. INTERSPEECH 2020: 3416-3420
[c2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/odyssey/ZhouS020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/ZhouS020
Kun Zhou, Berrak Sisman, Haizhou Li:
Transforming Spectrum and Prosody for Emotional Voice Conversion with Non-Parallel Training Data. Odyssey 2020: 230-237
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2002-00198
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-00198
Kun Zhou, Berrak Sisman, Haizhou Li:
Transforming Spectrum and Prosody for Emotional Voice Conversion with Non-Parallel Training Data. CoRR abs/2002.00198 (2020)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2005-07025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-07025
Kun Zhou, Berrak Sisman, Mingyang Zhang, Haizhou Li:
Converting Anyone's Emotion: Towards Speaker-Independent Emotional Voice Conversion. CoRR abs/2005.07025 (2020)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2008-03992
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-03992
Junchen Lu, Kun Zhou, Berrak Sisman, Haizhou Li:
VAW-GAN for Singing Voice Conversion with Non-parallel Training Data. CoRR abs/2008.03992 (2020)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2008-04562
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-04562
Zongyang Du, Kun Zhou, Berrak Sisman, Haizhou Li:
Spectrum and Prosody Conversion for Cross-lingual Voice Conversion with CycleGAN. CoRR abs/2008.04562 (2020)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-14794
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-14794
Kun Zhou, Berrak Sisman, Rui Liu, Haizhou Li:
Seen and Unseen emotional style transfer for voice conversion with a new emotional speech dataset. CoRR abs/2010.14794 (2020)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2011-02314
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-02314
Kun Zhou, Berrak Sisman, Haizhou Li:
VAW-GAN for Disentanglement and Recomposition of Emotional Elements in Speech. CoRR abs/2011.02314 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c1]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YilmazDZHB0L19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YilmazDZHB0L19
Emre Yilmaz, Adem Derinel, Kun Zhou, Henk van den Heuvel, Niko Brummer, Haizhou Li, David A. van Leeuwen:
Large-Scale Speaker Diarization of Radio Broadcast Archives. INTERSPEECH 2019: 411-415
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1906-07955
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-07955
Emre Yilmaz, Adem Derinel, Kun Zhou, Henk van den Heuvel, Niko Brummer, Haizhou Li, David A. van Leeuwen:
Large-Scale Speaker Diarization of Radio Broadcast Archives. CoRR abs/1906.07955 (2019)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.