default search action

combined dblp search
author search
venue search
publication search

ask others

Zhizheng Wu 0001

> Home > Persons

Person information

affiliation: Chinese University of Hong Kong, Shenzhen, China
affiliation (former): Meta
affiliation (former): JD.com
affiliation (former): Apple
affiliation (former): University of Edinburgh, UK
affiliation (former): Microsoft Research Asia
affiliation (Ph.D., 2015): Nanyang Technological University, Singapore

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j19]
- view
  authority control:
- export record
  dblp key:
  - journals/cg/XueWWZHW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/cg/XueWWZHW24
Liumeng Xue, Chaoren Wang, Mingxuan Wang, Xueyao Zhang, Jun Han, Zhizheng Wu:
SingVisio: Visual analytics of diffusion model for singing voice conversion. Comput. Graph. 124: 104058 (2024)
[j18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/ZhouZZWL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ZhouZZWL24
Xuehao Zhou, Mingyang Zhang, Yi Zhou, Zhizheng Wu, Haizhou Li:
Accented Text-to-Speech Synthesis With Limited Data. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1699-1711 (2024)
[j17]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/GuZXLW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/GuZXLW24
Yicheng Gu, Xueyao Zhang, Liumeng Xue, Haizhou Li, Zhizheng Wu:
An Investigation of Time-Frequency Representation Discriminators for High-Fidelity Vocoders. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4569-4579 (2024)
[c65]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangLLZWLXFS024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangLLZWLXFS024
Li Wang, Jiaqi Li, Yuhao Luo, Jiahao Zheng, Lei Wang, Hao Li, Ke Xu, Chengfang Fang, Jie Shi, Zhizheng Wu:
ADVSV: An Over-the-Air Adversarial Attack Dataset for Speaker Verification. ICASSP 2024: 4555-4559
[c64]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiWXW024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiWXW024
Jiaqi Li, Li Wang, Liumeng Xue, Lei Wang, Zhizheng Wu:
An Initial Investigation of Neural Replay Simulator for Over-The-Air Adversarial Perturbations to Automatic Speaker Verification. ICASSP 2024: 4635-4639
[c63]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GuZX024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GuZX024
Yicheng Gu, Xueyao Zhang, Liumeng Xue, Zhizheng Wu:
Multi-Scale Sub-Band Constant-Q Transform Discriminator for High-Fidelity Vocoder. ICASSP 2024: 10616-10620
[c62]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/JuWS0XYLLST000024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/JuWS0XYLLST000024
Zeqian Ju, Yuancheng Wang, Kai Shen, Xu Tan, Detai Xin, Dongchao Yang, Eric Liu, Yichong Leng, Kaitao Song, Siliang Tang, Zhizheng Wu, Tao Qin, Xiangyang Li, Wei Ye, Shikun Zhang, Jiang Bian, Lei He, Jinyu Li, Sheng Zhao:
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models. ICML 2024
[i26]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-12264
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-12264
Xianghu Yue, Xiaohai Tian, Malu Zhang, Zhizheng Wu, Haizhou Li:
CoAVT: A Cognition-Inspired Unified Audio-Visual-Text Pre-Training Model for Multimodal Processing. CoRR abs/2401.12264 (2024)
[i25]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-12660
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-12660
Liumeng Xue, Chaoren Wang, Mingxuan Wang, Xueyao Zhang, Jun Han, Zhizheng Wu:
SingVisio: Visual Analytics of Diffusion Model for Singing Voice Conversion. CoRR abs/2402.12660 (2024)
[i24]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-03100
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-03100
Zeqian Ju, Yuancheng Wang, Kai Shen, Xu Tan, Detai Xin, Dongchao Yang, Yanqing Liu, Yichong Leng, Kaitao Song, Siliang Tang, Zhizheng Wu, Tao Qin, Xiang-Yang Li, Wei Ye, Shikun Zhang, Jiang Bian, Lei He, Jinyu Li, Sheng Zhao:
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models. CoRR abs/2403.03100 (2024)
[i23]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-17161
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-17161
Yicheng Gu, Xueyao Zhang, Liumeng Xue, Haizhou Li, Zhizheng Wu:
An Investigation of Time-Frequency Representation Discriminators for High-Fidelity Vocoder. CoRR abs/2404.17161 (2024)
[i22]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-13340
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-13340
Junyi Ao, Yuancheng Wang, Xiaohai Tian, Dekun Chen, Jun Zhang, Lu Lu, Yuxuan Wang, Haizhou Li, Zhizheng Wu:
SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words. CoRR abs/2406.13340 (2024)
[i21]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-01494
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-01494
Yiming Zhang, Yicheng Gu, Yanhong Zeng, Zhening Xing, Yuancheng Wang, Zhizheng Wu, Kai Chen:
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. CoRR abs/2407.01494 (2024)
[i20]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-02857
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-02857
Zeyu Xie, Xuenan Xu, Zhizheng Wu, Mengyue Wu:
AudioTime: A Temporally-aligned Audio-text Benchmark Dataset. CoRR abs/2407.02857 (2024)
[i19]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-02869
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-02869
Zeyu Xie, Xuenan Xu, Zhizheng Wu, Mengyue Wu:
PicoAudio: Enabling Precise Timestamp and Frequency Controllability of Audio Events in Text-to-audio Generation. CoRR abs/2407.02869 (2024)
[i18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-05361
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-05361
Haorui He, Zengqiang Shang, Chaoren Wang, Xuyuan Li, Yicheng Gu, Hua Hua, Liwei Liu, Chen Yang, Jiaqi Li, Peiyang Shi, Yuancheng Wang, Kai Chen, Pengyuan Zhang, Zhizheng Wu:
Emilia: An Extensive, Multilingual, and Diverse Speech Dataset for Large-Scale Speech Generation. CoRR abs/2407.05361 (2024)
[i17]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-14340
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-14340
Yinghao Ma, Anders Øland, Anton Ragni, Bleiz Macsen Del Sette, Charalampos Saitis, Chris Donahue, Chenghua Lin, Christos Plachouras, Emmanouil Benetos, Elio Quinton, Elona Shatri, Fabio Morreale, Ge Zhang, György Fazekas, Gus Xia, Huan Zhang, Ilaria Manco, Jiawen Huang, Julien Guinot, Liwei Lin, Luca Marinelli, Max W. Y. Lam, Megha Sharma, Qiuqiang Kong, Roger B. Dannenberg, Ruibin Yuan, Shangda Wu, Shih-Lun Wu, Shuqi Dai, Shun Lei, Shiyin Kang, Simon Dixon, Wenhu Chen, Wenhao Huang, Xingjian Du, Xingwei Qu, Xu Tan, Yizhi Li, Zeyue Tian, Zhiyong Wu, Zhizheng Wu, Ziyang Ma, Ziyu Wang:
Foundation Models for Music: A Survey. CoRR abs/2408.14340 (2024)
[i16]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-00750
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-00750
Yuancheng Wang, Haoyue Zhan, Liwei Liu, Ruihong Zeng, Haotian Guo, Jiachen Zheng, Qiang Zhang, Shunsi Zhang, Zhizheng Wu:
MaskGCT: Zero-Shot Text-to-Speech with Masked Generative Codec Transformer. CoRR abs/2409.00750 (2024)
[i15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-04016
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-04016
Jiaqi Li, Dongmei Wang, Xiaofei Wang, Yao Qian, Long Zhou, Shujie Liu, Midia Yousefi, Canrun Li, Chung-Hsien Tsai, Zhen Xiao, Yanqing Liu, Junkun Chen, Sheng Zhao, Jinyu Li, Zhizheng Wu, Michael Zeng:
Investigating Neural Audio Codecs for Speech Language Model-Based Speech Generation. CoRR abs/2409.04016 (2024)
[i14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-11308
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-11308
Peizhuo Liu, Li Wang, Renqiang He, Haorui He, Lei Wang, Huadi Zheng, Jie Shi, Tong Xiao, Zhizheng Wu:
SpMis: An Investigation of Synthetic Spoken Misinformation Detection. CoRR abs/2409.11308 (2024)
2023
[j16]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/spl/ZhouWZTL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/ZhouWZTL23
Yi Zhou, Zhizheng Wu, Mingyang Zhang, Xiaohai Tian, Haizhou Li:
TTS-Guided Training for Accent Conversion Without Parallel Data. IEEE Signal Process. Lett. 30: 533-537 (2023)
[j15]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/ZhangZWL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/ZhangZWL23
Mingyang Zhang, Xuehao Zhou, Zhizheng Wu, Haizhou Li:
Towards Zero-Shot Multi-Speaker Multi-Accent Text-to-Speech Synthesis. IEEE Signal Process. Lett. 30: 947-951 (2023)
[j14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/ZhouWTL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ZhouWTL23
Yi Zhou, Zhizheng Wu, Xiaohai Tian, Haizhou Li:
Optimization of Cross-Lingual Voice Conversion With Linguistics Losses to Reduce Foreign Accents. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1916-1926 (2023)
[c61]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/ZhangZWL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/ZhangZWL23
Mingyang Zhang, Yi Zhou, Zhizheng Wu, Haizhou Li:
Zero-shot multi-speaker accent TTS with limited accent data. APSIPA ASC 2023: 1931-1936
[c60]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuG0023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuG0023
Qinghua Liu, Meng Ge, Zhizheng Wu, Haizhou Li:
PIAVE: A Pose-Invariant Audio-Visual Speaker Extraction Network. INTERSPEECH 2023: 3719-3723
[c59]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/WangJ0H00Z23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WangJ0H00Z23
Yuancheng Wang, Zeqian Ju, Xu Tan, Lei He, Zhizheng Wu, Jiang Bian, Sheng Zhao:
AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models. NeurIPS 2023
[i13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-00830
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-00830
Yuancheng Wang, Zeqian Ju, Xu Tan, Lei He, Zhizheng Wu, Jiang Bian, Sheng Zhao:
AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models. CoRR abs/2304.00830 (2023)
[i12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-04816
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-04816
Xuehao Zhou, Mingyang Zhang, Yi Zhou, Zhizheng Wu, Haizhou Li:
Accented Text-to-Speech Synthesis with Limited Data. CoRR abs/2305.04816 (2023)
[i11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-06723
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-06723
Qinghua Liu, Meng Ge, Zhizheng Wu, Haizhou Li:
PIAVE: A Pose-Invariant Audio-Visual Speaker Extraction Network. CoRR abs/2309.06723 (2023)
[i10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-05354
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-05354
Jiaqi Li, Li Wang, Liumeng Xue, Lei Wang, Zhizheng Wu:
An Initial Investigation of Neural Replay Simulator for Over-the-Air Adversarial Perturbations to Automatic Speaker Verification. CoRR abs/2310.05354 (2023)
[i9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-05369
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-05369
Li Wang, Jiaqi Li, Yuhao Luo, Jiahao Zheng, Lei Wang, Hao Li, Ke Xu, Chengfang Fang, Jie Shi, Zhizheng Wu:
AdvSV: An Over-the-Air Adversarial Attack Dataset for Speaker Verification. CoRR abs/2310.05369 (2023)
[i8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-05813
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-05813
Xiangyu Shi, Yuhao Luo, Li Wang, Haorui He, Hao Li, Lei Wang, Zhizheng Wu:
Audio compression-assisted feature extraction for voice replay attack detection. CoRR abs/2310.05813 (2023)
[i7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-11160
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-11160
Xueyao Zhang, Yicheng Gu, Haopeng Chen, Zihao Fang, Lexiao Zou, Liumeng Xue, Zhizheng Wu:
Leveraging Content-based Features from Multiple Acoustic Models for Singing Voice Conversion. CoRR abs/2310.11160 (2023)
[i6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-14957
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-14957
Yicheng Gu, Xueyao Zhang, Liumeng Xue, Zhizheng Wu:
Multi-Scale Sub-Band Constant-Q Transform Discriminator for High-Fidelity Vocoder. CoRR abs/2311.14957 (2023)
[i5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-09911
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-09911
Xueyao Zhang, Liumeng Xue, Yuancheng Wang, Yicheng Gu, Xi Chen, Zihao Fang, Haopeng Chen, Lexiao Zou, Chaoren Wang, Jun Han, Kai Chen, Haizhou Li, Zhizheng Wu:
Amphion: An Open-Source Audio, Music and Speech Generation Toolkit. CoRR abs/2312.09911 (2023)
2022
[c58]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/ZengW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/ZengW22
Zhiping Zeng, Zhizheng Wu:
Audio Splicing Localization: Can We Accurately Locate the Splicing Tampering? ISCSLP 2022: 120-124
2021
[c57]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhouTW021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhouTW021
Yi Zhou, Xiaohai Tian, Zhizheng Wu, Haizhou Li:
Cross-Lingual Voice Conversion with a Cycle Consistency Loss on Linguistic Representation. Interspeech 2021: 1374-1378

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c56]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/0001X019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/0001X019
Zhizheng Wu, Zhihang Xie, Simon King:
The Blizzard Challenge 2019. Blizzard Challenge 2019
[c55]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XueSXXW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XueSXXW19
Liumeng Xue, Wei Song, Guanghui Xu, Lei Xie, Zhizheng Wu:
Building a Mixed-Lingual Neural TTS System with Only Monolingual Data. INTERSPEECH 2019: 2060-2064
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1904-06063
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-06063
Liumeng Xue, Wei Song, Guanghui Xu, Lei Xie, Zhizheng Wu:
Building a mixed-lingual neural TTS system with only monolingual data. CoRR abs/1904.06063 (2019)
2017
[j13]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/WuYKHSSET17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/WuYKHSSET17
Zhizheng Wu, Junichi Yamagishi, Tomi Kinnunen, Cemal Hanilçi, Md. Sahidullah, Aleksandr Sizov, Nicholas W. D. Evans, Massimiliano Todisco:
ASVspoof: The Automatic Speaker Verification Spoofing and Countermeasures Challenge. IEEE J. Sel. Top. Signal Process. 11(4): 588-604 (2017)
[j12]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/TianLWCL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/TianLWCL17
Xiaohai Tian, Siu Wa Lee, Zhizheng Wu, Eng Siong Chng, Haizhou Li:
An Exemplar-Based Approach to Frequency Warping for Voice Conversion. IEEE ACM Trans. Audio Speech Lang. Process. 25(10): 1863-1876 (2017)
[j11]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/QianCDW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/QianCDW17
Yanmin Qian, Nanxin Chen, Heinrich Dinkel, Zhizheng Wu:
Deep Feature Engineering for Noise Robust Spoofing Detection. IEEE ACM Trans. Audio Speech Lang. Process. 25(10): 1942-1955 (2017)
[c54]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CapesCCGHHHHLNP17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CapesCCGHHHHLNP17
Tim Capes, Paul Coles, Alistair Conkie, Ladan Golipour, Abie Hadjitarkhani, Qiong Hu, Nancy Huddleston, Melvyn Hunt, Jiangchuan Li, Matthias Neeracher, Kishore Prahallad, Tuomo Raitio, Ramya Rasipuram, Greg Townsend, Becci Williamson, David Winarsky, Zhizheng Wu, Hepeng Zhang:
Siri On-Device Deep Learning-Guided Unit Selection Text-to-Speech System. INTERSPEECH 2017: 4011-4015
2016
[j10]
- view
  authority control:
- export record
  dblp key:
  - journals/mta/WuL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/mta/WuL16
Zhizheng Wu, Haizhou Li:
On the study of replay and voice conversion attacks to text-dependent speaker verification. Multim. Tools Appl. 75(9): 5311-5327 (2016)
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/SaratxagaSWHN16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/SaratxagaSWHN16
Ibon Saratxaga, Jon Sánchez, Zhizheng Wu, Inma Hernáez, Eva Navas:
Synthetic speech detection using phase information. Speech Commun. 81: 30-41 (2016)
[j8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/WuLDKKLSSTWY16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WuLDKKLSSTWY16
Zhizheng Wu, Phillip L. De Leon, Cenk Demiroglu, Ali Khodabakhsh, Simon King, Zhen-Hua Ling, Daisuke Saito, Bryan Stewart, Tomoki Toda, Mirjam Wester, Junichi Yamagishi:
Anti-Spoofing for Text-Independent Speaker Verification: An Initial Database, Comparison of Countermeasures, and Human Performance. IEEE ACM Trans. Audio Speech Lang. Process. 24(4): 768-783 (2016)
[j7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/WuK16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WuK16
Zhizheng Wu, Simon King:
Improving Trajectory Modelling for DNN-Based Speech Synthesis by Using Stacked Bottleneck Features and Minimum Generation Error Training. IEEE ACM Trans. Audio Speech Lang. Process. 24(7): 1255-1265 (2016)
[c53]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/WeiWX16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/WeiWX16
Zhen Wei, Zhizheng Wu, Lei Xie:
Predicting articulatory movement from text using deep architecture with stacked bottleneck features. APSIPA 2016: 1-6
[c52]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/WuWX16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/WuWX16
Jie Wu, Zhizheng Wu, Lei Xie:
On the use of I-vectors and average voice model for voice conversion without parallel data. APSIPA 2016: 1-6
[c51]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/YangWX16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/YangWX16
Shan Yang, Zhizheng Wu, Lei Xie:
On the training of DNN-based average voice model for speech synthesis. APSIPA 2016: 1-6
[c50]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/MerrittR0W16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/MerrittR0W16
Thomas Merritt, Srikanth Ronanki, Zhizheng Wu, Oliver Watts:
The CSTR entry to the Blizzard Challenge 2016. Blizzard Challenge 2016
[c49]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TianWXCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TianWXCL16
Xiaohai Tian, Zhizheng Wu, Xiong Xiao, Eng Siong Chng, Haizhou Li:
Spoofing detection from a feature representation perspective. ICASSP 2016: 2119-2123
[c48]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HenterRWWWK16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HenterRWWWK16
Gustav Eje Henter, Srikanth Ronanki, Oliver Watts, Mirjam Wester, Zhizheng Wu, Simon King:
Robust TTS duration modelling using DNNS. ICASSP 2016: 5130-5134
[c47]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WuK16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WuK16
Zhizheng Wu, Simon King:
Investigating gated recurrent networks for speech synthesis. ICASSP 2016: 5140-5144
[c46]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MerrittCWYK16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MerrittCWYK16
Thomas Merritt, Robert A. J. Clark, Zhizheng Wu, Junichi Yamagishi, Simon King:
Deep neural network-guided unit selection synthesis. ICASSP 2016: 5145-5149
[c45]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WattsHMWK16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WattsHMWK16
Oliver Watts, Gustav Eje Henter, Thomas Merritt, Zhizheng Wu, Simon King:
From HMMS to DNNS: Where do the improvements come from? ICASSP 2016: 5505-5509
[c44]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TodaCSVWWY16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TodaCSVWWY16
Tomoki Toda, Ling-Hui Chen, Daisuke Saito, Fernando Villavicencio, Mirjam Wester, Zhizheng Wu, Junichi Yamagishi:
The Voice Conversion Challenge 2016. INTERSPEECH 2016: 1632-1636
[c43]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WesterWY16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WesterWY16
Mirjam Wester, Zhizheng Wu, Junichi Yamagishi:
Analysis of the Voice Conversion Challenge 2016 Evaluation Results. INTERSPEECH 2016: 1637-1641
[c42]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TianWXCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TianWXCL16
Xiaohai Tian, Zhizheng Wu, Xiong Xiao, Eng Siong Chng, Haizhou Li:
An Investigation of Spoofing Speech Detection Under Additive Noise and Reverberant Conditions. INTERSPEECH 2016: 1715-1719
[c41]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/EspicVWK16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/EspicVWK16
Felipe Espic, Cassia Valentini-Botinhao, Zhizheng Wu, Simon King:
Waveform Generation Based on Signal Reshaping for Statistical Parametric Speech Synthesis. INTERSPEECH 2016: 2263-2267
[c40]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RonankiHWK16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RonankiHWK16
Srikanth Ronanki, Gustav Eje Henter, Zhizheng Wu, Simon King:
A Template-Based Approach for Speech Synthesis Intonation Generation Using LSTMs. INTERSPEECH 2016: 2463-2467
[c39]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AiraksinenBJWKA16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AiraksinenBJWKA16
Manu Airaksinen, Bajibabu Bollepalli, Lauri Juvela, Zhizheng Wu, Simon King, Paavo Alku:
GlottDNN - A Full-Band Glottal Vocoder for Statistical Parametric Speech Synthesis. INTERSPEECH 2016: 2473-2477
[c38]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ssw/WesterWY16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ssw/WesterWY16
Mirjam Wester, Zhizheng Wu, Junichi Yamagishi:
Multidimensional scaling of systems in the Voice Conversion Challenge 2016. SSW 2016: 38-43
[c37]
- view
  - electronic edition @ isca-archive.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/ssw/RonankiWWK16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ssw/RonankiWWK16
Srikanth Ronanki, Zhizheng Wu, Oliver Watts, Simon King:
A Demonstration of the Merlin Open Source Neural Network Speech Synthesis System. SSW 2016: 124
[c36]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ssw/LiWX16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ssw/LiWX16
Mei Li, Zhizheng Wu, Lei Xie:
On the impact of phoneme alignment in DNN-based speech synthesis. SSW 2016: 196-201
[c35]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ssw/WuWK16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ssw/WuWK16
Zhizheng Wu, Oliver Watts, Simon King:
Merlin: An Open Source Neural Network Speech Synthesis System. SSW 2016: 202-207
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/WuK16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/WuK16
Zhizheng Wu, Simon King:
Investigating gated recurrent neural networks for speech synthesis. CoRR abs/1601.02539 (2016)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/TianWXCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/TianWXCL16
Xiaohai Tian, Zhizheng Wu, Xiong Xiao, Eng Siong Chng, Haizhou Li:
Spoofing detection under noisy conditions: a preliminary investigation and an initial database. CoRR abs/1602.02950 (2016)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/WuK16a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/WuK16a
Zhizheng Wu, Simon King:
Improving Trajectory Modelling for DNN-based Speech Synthesis by using Stacked Bottleneck Features and Minimum Trajectory Error Training. CoRR abs/1602.06727 (2016)
2015
[b1]
- view
  - electronic edition via handle.net
  - no references & citations available
- export record
  dblp key:
  - phd/sg/Wu15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/sg/Wu15
Zhizheng Wu:
Spectral mapping for voice conversion. Nanyang Technological University, Singapore, 2015
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/mta/WuCL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/mta/WuCL15
Zhizheng Wu, Engsiong Chng, Haizhou Li:
Exemplar-based voice conversion using joint nonnegative matrix factorization. Multim. Tools Appl. 74(22): 9943-9958 (2015)
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/WuEKYAL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/WuEKYAL15
Zhizheng Wu, Nicholas W. D. Evans, Tomi Kinnunen, Junichi Yamagishi, Federico Alegre, Haizhou Li:
Spoofing and countermeasures for speaker verification: A survey. Speech Commun. 66: 130-153 (2015)
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/tifs/SizovKKWM15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tifs/SizovKKWM15
Aleksandr Sizov, Elie Khoury, Tomi Kinnunen, Zhizheng Wu, Sébastien Marcel:
Joint Speaker Verification and Antispoofing in the i-Vector Space. IEEE Trans. Inf. Forensics Secur. 10(4): 821-832 (2015)
[c34]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TianWLHCD15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TianWLHCD15
Xiaohai Tian, Zhizheng Wu, Siu Wa Lee, Nguyen Quy Hy, Engsiong Chng, Minghui Dong:
Sparse representation for frequency warping based voice conversion. ICASSP 2015: 4235-4239
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WuKDYSTK15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WuKDYSTK15
Zhizheng Wu, Ali Khodabakhsh, Cenk Demiroglu, Junichi Yamagishi, Daisuke Saito, Tomoki Toda, Simon King:
SAS: A speaker verification spoofing database containing diverse attacks. ICASSP 2015: 4440-4444
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WuVWK15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WuVWK15
Zhizheng Wu, Cassia Valentini-Botinhao, Oliver Watts, Simon King:
Deep neural networks employing Multi-Task Learning and stacked bottleneck features for speech synthesis. ICASSP 2015: 4460-4464
[c31]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuK15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuK15
Zhizheng Wu, Simon King:
Minimum trajectory error training for deep neural networks, combined with stacked bottleneck features. INTERSPEECH 2015: 309-313
[c30]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuWRYSM15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuWRYSM15
Qiong Hu, Zhizheng Wu, Korin Richmond, Junichi Yamagishi, Yannis Stylianou, Ranniery Maia:
Fusion of multiple parameterisations for DNN-based sinusoidal speech synthesis with multi-task learning. INTERSPEECH 2015: 854-858
[c29]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Valentini-Botinhao15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Valentini-Botinhao15
Cassia Valentini-Botinhao, Zhizheng Wu, Simon King:
Towards minimum perceptual error training for DNN-based speech synthesis. INTERSPEECH 2015: 869-873
[c28]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuSVRK15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuSVRK15
Zhizheng Wu, Pawel Swietojanski, Christophe Veaux, Steve Renals, Simon King:
A study of speaker adaptation for DNN-based speech synthesis. INTERSPEECH 2015: 879-883
[c27]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuKEYHSS15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuKEYHSS15
Zhizheng Wu, Tomi Kinnunen, Nicholas W. D. Evans, Junichi Yamagishi, Cemal Hanilçi, Md. Sahidullah, Aleksandr Sizov:
ASVspoof 2015: the first automatic speaker verification spoofing and countermeasures challenge. INTERSPEECH 2015: 2037-2041
[c26]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WesterWY15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WesterWY15
Mirjam Wester, Zhizheng Wu, Junichi Yamagishi:
Human vs machine spoofing detection on wideband and narrowband data. INTERSPEECH 2015: 2047-2051
[c25]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MerrittYWWK15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MerrittYWWK15
Thomas Merritt, Junichi Yamagishi, Zhizheng Wu, Oliver Watts, Simon King:
Deep neural network context embeddings for model selection in rich-context HMM synthesis. INTERSPEECH 2015: 2207-2211
[c24]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WattsWK15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WattsWK15
Oliver Watts, Zhizheng Wu, Simon King:
Sentence-level control vectors for deep neural network speech synthesis. INTERSPEECH 2015: 2217-2221
[c23]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TianWLHDC15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TianWLHDC15
Xiaohai Tian, Zhizheng Wu, Siu Wa Lee, Nguyen Quy Hy, Minghui Dong, Engsiong Chng:
System fusion for high-performance voice conversion. INTERSPEECH 2015: 2759-2763
[c22]
- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/WuK15a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuK15a
Zhizheng Wu, Tomi Kinnunen:
Automatic speaker verification spoofing and countermeasures (ASVspoof 2015): introductory talk by the organizers. INTERSPEECH 2015
[r2]
- view
  authority control:
- export record
  dblp key:
  - reference/bio/EvansAWK15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/reference/bio/EvansAWK15
Nicholas W. D. Evans, Federico Alegre, Zhizheng Wu, Tomi Kinnunen:
Anti-spoofing, Voice Conversion. Encyclopedia of Biometrics 2015: 115-122
[r1]
- view
  authority control:
- export record
  dblp key:
  - reference/bio/EvansAKWY15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/reference/bio/EvansAKWY15
Nicholas W. D. Evans, Federico Alegre, Tomi Kinnunen, Zhizheng Wu, Junichi Yamagishi:
Anti-spoofing, Voice Databases. Encyclopedia of Biometrics 2015: 123-128
2014
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/WuVCL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WuVCL14
Zhizheng Wu, Tuomas Virtanen, Engsiong Chng, Haizhou Li:
Exemplar-Based Sparse Representation With Residual Compensation for Voice Conversion. IEEE ACM Trans. Audio Speech Lang. Process. 22(10): 1506-1521 (2014)
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/WuGCL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/WuGCL14
Zhizheng Wu, Sheng Gao, Engsiong Chng, Haizhou Li:
A study on replay attack and anti-spoofing for text-dependent speaker verification. APSIPA 2014: 1-5
[c20]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KhouryKSWM14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KhouryKSWM14
Elie Khoury, Tomi Kinnunen, Aleksandr Sizov, Zhizheng Wu, Sébastien Marcel:
Introducing i-vectors for joint anti-spoofing and speaker verification. INTERSPEECH 2014: 61-65
[c19]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeWDTL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeWDTL14
Siu Wa Lee, Zhizheng Wu, Minghui Dong, Xiaohai Tian, Haizhou Li:
A comparative study of spectral transformation techniques for singing voice synthesis. INTERSPEECH 2014: 2499-2503
[c18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuSL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuSL14
Zhizheng Wu, Chng Eng Siong, Haizhou Li:
Joint nonnegative matrix factorization for exemplar-based voice conversion. INTERSPEECH 2014: 2509-2513
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/TianWLC14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/TianWLC14
Xiaohai Tian, Zhizheng Wu, Siu Wa Lee, Engsiong Chng:
Correlation-based frequency warping for voice conversion. ISCSLP 2014: 211-215
[p1]
- view
  authority control:
- export record
  dblp key:
  - series/acvpr/EvansKYWAL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/series/acvpr/EvansKYWAL14
Nicholas W. D. Evans, Tomi Kinnunen, Junichi Yamagishi, Zhizheng Wu, Federico Alegre, Phillip L. De Leon:
Speaker Recognition Anti-spoofing. Handbook of Biometric Anti-Spoofing 2014: 125-146
2013
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/TianWC13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/TianWC13
Xiaohai Tian, Zhizheng Wu, Engsiong Chng:
Local partial least square regression for spectral mapping in voice conversion. APSIPA 2013: 1-6
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/WuL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/WuL13
Zhizheng Wu, Haizhou Li:
Voice conversion and spoofing attack on speaker verification systems. APSIPA 2013: 1-9
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/chinasip/WuCL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/chinasip/WuCL13
Zhizheng Wu, Engsiong Chng, Haizhou Li:
Conditional restricted Boltzmann machine for voice conversion. ChinaSIP 2013: 104-108
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WuXCL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WuXCL13
Zhizheng Wu, Xiong Xiao, Engsiong Chng, Haizhou Li:
Synthetic speech detection using temporal modulation feature. ICASSP 2013: 7234-7238
[c12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuLLCKL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuLLCKL13
Zhizheng Wu, Anthony Larcher, Kong-Aik Lee, Engsiong Chng, Tomi Kinnunen, Haizhou Li:
Vulnerability evaluation of speaker verification under voice conversion spoofing: the effect of text constraints. INTERSPEECH 2013: 950-954
[c11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuVKCL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuVKCL13
Zhizheng Wu, Tuomas Virtanen, Tomi Kinnunen, Engsiong Chng, Haizhou Li:
Exemplar-based unit selection for voice conversion utilizing temporal information. INTERSPEECH 2013: 3057-3061
[c10]
- view
  - electronic edition @ isca-archive.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/ssw/WuVKCL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ssw/WuVKCL13
Zhizheng Wu, Tuomas Virtanen, Tomi Kinnunen, Eng Siong Chng, Haizhou Li:
Exemplar-based voice conversion using non-negative spectrogram deconvolution. SSW 2013: 201-206
2012
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/WuKCL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/WuKCL12
Zhizheng Wu, Tomi Kinnunen, Engsiong Chng, Haizhou Li:
Mixture of Factor Analyzers Using Priors From Non-Parallel Speech for Voice Conversion. IEEE Signal Process. Lett. 19(12): 914-917 (2012)
[c9]
- view
  - electronic edition @ ieee.org
  - no references & citations available
- export record
  dblp key:
  - conf/apsipa/WuKCLA12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/WuKCLA12
Zhizheng Wu, Tomi Kinnunen, Engsiong Chng, Haizhou Li, Eliathamby Ambikairajah:
A study on spoofing attack in state-of-the-art speaker verification: the telephone speech case. APSIPA 2012: 1-5
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KinnunenWLSCL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KinnunenWLSCL12
Tomi Kinnunen, Zhizheng Wu, Kong-Aik Lee, Filip Sedlak, Engsiong Chng, Haizhou Li:
Vulnerability of speaker verification systems against voice conversion spoofing attacks: The case of telephone speech. ICASSP 2012: 4401-4404
[c7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuSL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuSL12
Zhizheng Wu, Chng Eng Siong, Haizhou Li:
Detecting Converted Speech and Natural Speech for anti-Spoofing Attack in Speaker Recognition. INTERSPEECH 2012: 1700-1703
2011
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/QianWGS11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/QianWGS11
Yao Qian, Zhizheng Wu, Boyang Gao, Frank K. Soong:
Improved Prosody Generation by Maximizing Joint Probability of State and Longer Units. IEEE Trans. Speech Audio Process. 19(6): 1702-1710 (2011)
2010
[c6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuKCL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuKCL10
Zhizheng Wu, Tomi Kinnunen, Engsiong Chng, Haizhou Li:
Text-independent F0 transformation with non-parallel data for voice conversion. INTERSPEECH 2010: 1732-1735
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/QianWMS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/QianWMS10
Yao Qian, Zhizheng Wu, Xuezhe Ma, Frank K. Soong:
Automatic prosody prediction and detection with Conditional Random Field (CRF) models. ISCSLP 2010: 135-138

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2009
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/QianWS09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/QianWS09
Yao Qian, Zhizheng Wu, Frank K. Soong:
Improved prosody generation by maximizing joint likelihood of state and longer units. ICASSP 2009: 3781-3784
[c3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QianSWW09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QianSWW09
Yao Qian, Frank K. Soong, Miaomiao Wang, Zhizheng Wu:
A minimum v/u error approach to F0 generation in HMM-based TTS. INTERSPEECH 2009: 408-411
2008
[c2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GaoQWS08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GaoQWS08
Boyang Gao, Yao Qian, Zhizheng Wu, Frank K. Soong:
Duration refinement by jointly optimizing state and longer unit likelihood. INTERSPEECH 2008: 2266-2269
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/WuQSZ08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/WuQSZ08
Zhizheng Wu, Yao Qian, Frank K. Soong, Bo Zhang:
Modeling and Generating Tone Contour with Phrase Intonation for Mandarin Chinese Speech. ISCSLP 2008: 121-124

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.