default search action
Zhizheng Wu 0001
Person information
- affiliation: Chinese University of Hong Kong-Shenzhen (CUHK-SZ), Shenzhen, China
- affiliation (former): Meta
- affiliation (former): JD.com
- affiliation (former): Apple
- affiliation (former): University of Edinburgh, UK
- affiliation (former): Microsoft Research Asia
- affiliation (Ph.D., 2015): Nanyang Technological University, Singapore
Other persons with the same name
- Zhizheng Wu 0002 — Shanghai University, School of Mechatronic Engineering and Automation, China
- Zhizheng Wu 0003 — Southeast University, Nanjing, China
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j19]Liumeng Xue, Chaoren Wang, Mingxuan Wang, Xueyao Zhang, Jun Han, Zhizheng Wu:
SingVisio: Visual analytics of diffusion model for singing voice conversion. Comput. Graph. 124: 104058 (2024) - [j18]Xuehao Zhou, Mingyang Zhang, Yi Zhou, Zhizheng Wu, Haizhou Li:
Accented Text-to-Speech Synthesis With Limited Data. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1699-1711 (2024) - [j17]Yicheng Gu, Xueyao Zhang, Liumeng Xue, Haizhou Li, Zhizheng Wu:
An Investigation of Time-Frequency Representation Discriminators for High-Fidelity Vocoders. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4569-4579 (2024) - [c65]Li Wang, Jiaqi Li, Yuhao Luo, Jiahao Zheng, Lei Wang, Hao Li, Ke Xu, Chengfang Fang, Jie Shi, Zhizheng Wu:
ADVSV: An Over-the-Air Adversarial Attack Dataset for Speaker Verification. ICASSP 2024: 4555-4559 - [c64]Jiaqi Li, Li Wang, Liumeng Xue, Lei Wang, Zhizheng Wu:
An Initial Investigation of Neural Replay Simulator for Over-The-Air Adversarial Perturbations to Automatic Speaker Verification. ICASSP 2024: 4635-4639 - [c63]Yicheng Gu, Xueyao Zhang, Liumeng Xue, Zhizheng Wu:
Multi-Scale Sub-Band Constant-Q Transform Discriminator for High-Fidelity Vocoder. ICASSP 2024: 10616-10620 - [c62]Zeqian Ju, Yuancheng Wang, Kai Shen, Xu Tan, Detai Xin, Dongchao Yang, Eric Liu, Yichong Leng, Kaitao Song, Siliang Tang, Zhizheng Wu, Tao Qin, Xiangyang Li, Wei Ye, Shikun Zhang, Jiang Bian, Lei He, Jinyu Li, Sheng Zhao:
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models. ICML 2024 - [i27]Xianghu Yue, Xiaohai Tian, Malu Zhang, Zhizheng Wu, Haizhou Li:
CoAVT: A Cognition-Inspired Unified Audio-Visual-Text Pre-Training Model for Multimodal Processing. CoRR abs/2401.12264 (2024) - [i26]Liumeng Xue, Chaoren Wang, Mingxuan Wang, Xueyao Zhang, Jun Han, Zhizheng Wu:
SingVisio: Visual Analytics of Diffusion Model for Singing Voice Conversion. CoRR abs/2402.12660 (2024) - [i25]Zeqian Ju, Yuancheng Wang, Kai Shen, Xu Tan, Detai Xin, Dongchao Yang, Yanqing Liu, Yichong Leng, Kaitao Song, Siliang Tang, Zhizheng Wu, Tao Qin, Xiang-Yang Li, Wei Ye, Shikun Zhang, Jiang Bian, Lei He, Jinyu Li, Sheng Zhao:
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models. CoRR abs/2403.03100 (2024) - [i24]Yicheng Gu, Xueyao Zhang, Liumeng Xue, Haizhou Li, Zhizheng Wu:
An Investigation of Time-Frequency Representation Discriminators for High-Fidelity Vocoder. CoRR abs/2404.17161 (2024) - [i23]Junyi Ao, Yuancheng Wang, Xiaohai Tian, Dekun Chen, Jun Zhang, Lu Lu, Yuxuan Wang, Haizhou Li, Zhizheng Wu:
SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words. CoRR abs/2406.13340 (2024) - [i22]Yiming Zhang, Yicheng Gu, Yanhong Zeng, Zhening Xing, Yuancheng Wang, Zhizheng Wu, Kai Chen:
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. CoRR abs/2407.01494 (2024) - [i21]Zeyu Xie, Xuenan Xu, Zhizheng Wu, Mengyue Wu:
AudioTime: A Temporally-aligned Audio-text Benchmark Dataset. CoRR abs/2407.02857 (2024) - [i20]Zeyu Xie, Xuenan Xu, Zhizheng Wu, Mengyue Wu:
PicoAudio: Enabling Precise Timestamp and Frequency Controllability of Audio Events in Text-to-audio Generation. CoRR abs/2407.02869 (2024) - [i19]Haorui He, Zengqiang Shang, Chaoren Wang, Xuyuan Li, Yicheng Gu, Hua Hua, Liwei Liu, Chen Yang, Jiaqi Li, Peiyang Shi, Yuancheng Wang, Kai Chen, Pengyuan Zhang, Zhizheng Wu:
Emilia: An Extensive, Multilingual, and Diverse Speech Dataset for Large-Scale Speech Generation. CoRR abs/2407.05361 (2024) - [i18]Yinghao Ma, Anders Øland, Anton Ragni, Bleiz Macsen Del Sette, Charalampos Saitis, Chris Donahue, Chenghua Lin, Christos Plachouras, Emmanouil Benetos, Elio Quinton, Elona Shatri, Fabio Morreale, Ge Zhang, György Fazekas, Gus Xia, Huan Zhang, Ilaria Manco, Jiawen Huang, Julien Guinot, Liwei Lin, Luca Marinelli, Max W. Y. Lam, Megha Sharma, Qiuqiang Kong, Roger B. Dannenberg, Ruibin Yuan, Shangda Wu, Shih-Lun Wu, Shuqi Dai, Shun Lei, Shiyin Kang, Simon Dixon, Wenhu Chen, Wenhao Huang, Xingjian Du, Xingwei Qu, Xu Tan, Yizhi Li, Zeyue Tian, Zhiyong Wu, Zhizheng Wu, Ziyang Ma, Ziyu Wang:
Foundation Models for Music: A Survey. CoRR abs/2408.14340 (2024) - [i17]Yuancheng Wang, Haoyue Zhan, Liwei Liu, Ruihong Zeng, Haotian Guo, Jiachen Zheng, Qiang Zhang, Shunsi Zhang, Zhizheng Wu:
MaskGCT: Zero-Shot Text-to-Speech with Masked Generative Codec Transformer. CoRR abs/2409.00750 (2024) - [i16]Jiaqi Li, Dongmei Wang, Xiaofei Wang, Yao Qian, Long Zhou, Shujie Liu, Midia Yousefi, Canrun Li, Chung-Hsien Tsai, Zhen Xiao, Yanqing Liu, Junkun Chen, Sheng Zhao, Jinyu Li, Zhizheng Wu, Michael Zeng:
Investigating Neural Audio Codecs for Speech Language Model-Based Speech Generation. CoRR abs/2409.04016 (2024) - [i15]Peizhuo Liu, Li Wang, Renqiang He, Haorui He, Lei Wang, Huadi Zheng, Jie Shi, Tong Xiao, Zhizheng Wu:
SpMis: An Investigation of Synthetic Spoken Misinformation Detection. CoRR abs/2409.11308 (2024) - [i14]Junyan Ye, Baichuan Zhou, Zilong Huang, Junan Zhang, Tianyi Bai, Hengrui Kang, Jun He, Honglin Lin, Zihao Wang, Tong Wu, Zhizheng Wu, Yiping Chen, Dahua Lin, Conghui He, Weijia Li:
LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models. CoRR abs/2410.09732 (2024) - 2023
- [j16]Yi Zhou, Zhizheng Wu, Mingyang Zhang, Xiaohai Tian, Haizhou Li:
TTS-Guided Training for Accent Conversion Without Parallel Data. IEEE Signal Process. Lett. 30: 533-537 (2023) - [j15]Mingyang Zhang, Xuehao Zhou, Zhizheng Wu, Haizhou Li:
Towards Zero-Shot Multi-Speaker Multi-Accent Text-to-Speech Synthesis. IEEE Signal Process. Lett. 30: 947-951 (2023) - [j14]Yi Zhou, Zhizheng Wu, Xiaohai Tian, Haizhou Li:
Optimization of Cross-Lingual Voice Conversion With Linguistics Losses to Reduce Foreign Accents. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1916-1926 (2023) - [c61]Mingyang Zhang, Yi Zhou, Zhizheng Wu, Haizhou Li:
Zero-shot multi-speaker accent TTS with limited accent data. APSIPA ASC 2023: 1931-1936 - [c60]Qinghua Liu, Meng Ge, Zhizheng Wu, Haizhou Li:
PIAVE: A Pose-Invariant Audio-Visual Speaker Extraction Network. INTERSPEECH 2023: 3719-3723 - [c59]Yuancheng Wang, Zeqian Ju, Xu Tan, Lei He, Zhizheng Wu, Jiang Bian, Sheng Zhao:
AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models. NeurIPS 2023 - [i13]Yuancheng Wang, Zeqian Ju, Xu Tan, Lei He, Zhizheng Wu, Jiang Bian, Sheng Zhao:
AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models. CoRR abs/2304.00830 (2023) - [i12]Xuehao Zhou, Mingyang Zhang, Yi Zhou, Zhizheng Wu, Haizhou Li:
Accented Text-to-Speech Synthesis with Limited Data. CoRR abs/2305.04816 (2023) - [i11]Qinghua Liu, Meng Ge, Zhizheng Wu, Haizhou Li:
PIAVE: A Pose-Invariant Audio-Visual Speaker Extraction Network. CoRR abs/2309.06723 (2023) - [i10]Jiaqi Li, Li Wang, Liumeng Xue, Lei Wang, Zhizheng Wu:
An Initial Investigation of Neural Replay Simulator for Over-the-Air Adversarial Perturbations to Automatic Speaker Verification. CoRR abs/2310.05354 (2023) - [i9]Li Wang, Jiaqi Li, Yuhao Luo, Jiahao Zheng, Lei Wang, Hao Li, Ke Xu, Chengfang Fang, Jie Shi, Zhizheng Wu:
AdvSV: An Over-the-Air Adversarial Attack Dataset for Speaker Verification. CoRR abs/2310.05369 (2023) - [i8]Xiangyu Shi, Yuhao Luo, Li Wang, Haorui He, Hao Li, Lei Wang, Zhizheng Wu:
Audio compression-assisted feature extraction for voice replay attack detection. CoRR abs/2310.05813 (2023) - [i7]Xueyao Zhang, Yicheng Gu, Haopeng Chen, Zihao Fang, Lexiao Zou, Liumeng Xue, Zhizheng Wu:
Leveraging Content-based Features from Multiple Acoustic Models for Singing Voice Conversion. CoRR abs/2310.11160 (2023) - [i6]Yicheng Gu, Xueyao Zhang, Liumeng Xue, Zhizheng Wu:
Multi-Scale Sub-Band Constant-Q Transform Discriminator for High-Fidelity Vocoder. CoRR abs/2311.14957 (2023) - [i5]Xueyao Zhang, Liumeng Xue, Yuancheng Wang, Yicheng Gu, Xi Chen, Zihao Fang, Haopeng Chen, Lexiao Zou, Chaoren Wang, Jun Han, Kai Chen, Haizhou Li, Zhizheng Wu:
Amphion: An Open-Source Audio, Music and Speech Generation Toolkit. CoRR abs/2312.09911 (2023) - 2022
- [c58]Zhiping Zeng, Zhizheng Wu:
Audio Splicing Localization: Can We Accurately Locate the Splicing Tampering? ISCSLP 2022: 120-124 - 2021
- [c57]Yi Zhou, Xiaohai Tian, Zhizheng Wu, Haizhou Li:
Cross-Lingual Voice Conversion with a Cycle Consistency Loss on Linguistic Representation. Interspeech 2021: 1374-1378
2010 – 2019
- 2019
- [c56]Zhizheng Wu, Zhihang Xie, Simon King:
The Blizzard Challenge 2019. Blizzard Challenge 2019 - [c55]Liumeng Xue, Wei Song, Guanghui Xu, Lei Xie, Zhizheng Wu:
Building a Mixed-Lingual Neural TTS System with Only Monolingual Data. INTERSPEECH 2019: 2060-2064 - [i4]Liumeng Xue, Wei Song, Guanghui Xu, Lei Xie, Zhizheng Wu:
Building a mixed-lingual neural TTS system with only monolingual data. CoRR abs/1904.06063 (2019) - 2017
- [j13]Zhizheng Wu, Junichi Yamagishi, Tomi Kinnunen, Cemal Hanilçi, Md. Sahidullah, Aleksandr Sizov, Nicholas W. D. Evans, Massimiliano Todisco:
ASVspoof: The Automatic Speaker Verification Spoofing and Countermeasures Challenge. IEEE J. Sel. Top. Signal Process. 11(4): 588-604 (2017) - [j12]Xiaohai Tian, Siu Wa Lee, Zhizheng Wu, Eng Siong Chng, Haizhou Li:
An Exemplar-Based Approach to Frequency Warping for Voice Conversion. IEEE ACM Trans. Audio Speech Lang. Process. 25(10): 1863-1876 (2017) - [j11]Yanmin Qian, Nanxin Chen, Heinrich Dinkel, Zhizheng Wu:
Deep Feature Engineering for Noise Robust Spoofing Detection. IEEE ACM Trans. Audio Speech Lang. Process. 25(10): 1942-1955 (2017) - [c54]Tim Capes, Paul Coles, Alistair Conkie, Ladan Golipour, Abie Hadjitarkhani, Qiong Hu, Nancy Huddleston, Melvyn Hunt, Jiangchuan Li, Matthias Neeracher, Kishore Prahallad, Tuomo Raitio, Ramya Rasipuram, Greg Townsend, Becci Williamson, David Winarsky, Zhizheng Wu, Hepeng Zhang:
Siri On-Device Deep Learning-Guided Unit Selection Text-to-Speech System. INTERSPEECH 2017: 4011-4015 - 2016
- [j10]Zhizheng Wu, Haizhou Li:
On the study of replay and voice conversion attacks to text-dependent speaker verification. Multim. Tools Appl. 75(9): 5311-5327 (2016) - [j9]Ibon Saratxaga, Jon Sánchez, Zhizheng Wu, Inma Hernáez, Eva Navas:
Synthetic speech detection using phase information. Speech Commun. 81: 30-41 (2016) - [j8]Zhizheng Wu, Phillip L. De Leon, Cenk Demiroglu, Ali Khodabakhsh, Simon King, Zhen-Hua Ling, Daisuke Saito, Bryan Stewart, Tomoki Toda, Mirjam Wester, Junichi Yamagishi:
Anti-Spoofing for Text-Independent Speaker Verification: An Initial Database, Comparison of Countermeasures, and Human Performance. IEEE ACM Trans. Audio Speech Lang. Process. 24(4): 768-783 (2016) - [j7]Zhizheng Wu, Simon King:
Improving Trajectory Modelling for DNN-Based Speech Synthesis by Using Stacked Bottleneck Features and Minimum Generation Error Training. IEEE ACM Trans. Audio Speech Lang. Process. 24(7): 1255-1265 (2016) - [c53]Zhen Wei, Zhizheng Wu, Lei Xie:
Predicting articulatory movement from text using deep architecture with stacked bottleneck features. APSIPA 2016: 1-6 - [c52]Jie Wu, Zhizheng Wu, Lei Xie:
On the use of I-vectors and average voice model for voice conversion without parallel data. APSIPA 2016: 1-6 - [c51]Shan Yang, Zhizheng Wu, Lei Xie:
On the training of DNN-based average voice model for speech synthesis. APSIPA 2016: 1-6 - [c50]Thomas Merritt, Srikanth Ronanki, Zhizheng Wu, Oliver Watts:
The CSTR entry to the Blizzard Challenge 2016. Blizzard Challenge 2016 - [c49]Xiaohai Tian, Zhizheng Wu, Xiong Xiao, Eng Siong Chng, Haizhou Li:
Spoofing detection from a feature representation perspective. ICASSP 2016: 2119-2123 - [c48]Gustav Eje Henter, Srikanth Ronanki, Oliver Watts, Mirjam Wester, Zhizheng Wu, Simon King:
Robust TTS duration modelling using DNNS. ICASSP 2016: 5130-5134 - [c47]Zhizheng Wu, Simon King:
Investigating gated recurrent networks for speech synthesis. ICASSP 2016: 5140-5144 - [c46]Thomas Merritt, Robert A. J. Clark, Zhizheng Wu, Junichi Yamagishi, Simon King:
Deep neural network-guided unit selection synthesis. ICASSP 2016: 5145-5149 - [c45]Oliver Watts, Gustav Eje Henter, Thomas Merritt, Zhizheng Wu, Simon King:
From HMMS to DNNS: Where do the improvements come from? ICASSP 2016: 5505-5509 - [c44]Tomoki Toda, Ling-Hui Chen, Daisuke Saito, Fernando Villavicencio, Mirjam Wester, Zhizheng Wu, Junichi Yamagishi:
The Voice Conversion Challenge 2016. INTERSPEECH 2016: 1632-1636 - [c43]Mirjam Wester, Zhizheng Wu, Junichi Yamagishi:
Analysis of the Voice Conversion Challenge 2016 Evaluation Results. INTERSPEECH 2016: 1637-1641 - [c42]Xiaohai Tian, Zhizheng Wu, Xiong Xiao, Eng Siong Chng, Haizhou Li:
An Investigation of Spoofing Speech Detection Under Additive Noise and Reverberant Conditions. INTERSPEECH 2016: 1715-1719 - [c41]Felipe Espic, Cassia Valentini-Botinhao, Zhizheng Wu, Simon King:
Waveform Generation Based on Signal Reshaping for Statistical Parametric Speech Synthesis. INTERSPEECH 2016: 2263-2267 - [c40]Srikanth Ronanki, Gustav Eje Henter, Zhizheng Wu, Simon King:
A Template-Based Approach for Speech Synthesis Intonation Generation Using LSTMs. INTERSPEECH 2016: 2463-2467 - [c39]Manu Airaksinen, Bajibabu Bollepalli, Lauri Juvela, Zhizheng Wu, Simon King, Paavo Alku:
GlottDNN - A Full-Band Glottal Vocoder for Statistical Parametric Speech Synthesis. INTERSPEECH 2016: 2473-2477 - [c38]Mirjam Wester, Zhizheng Wu, Junichi Yamagishi:
Multidimensional scaling of systems in the Voice Conversion Challenge 2016. SSW 2016: 38-43 - [c37]Srikanth Ronanki, Zhizheng Wu, Oliver Watts, Simon King:
A Demonstration of the Merlin Open Source Neural Network Speech Synthesis System. SSW 2016: 124 - [c36]Mei Li, Zhizheng Wu, Lei Xie:
On the impact of phoneme alignment in DNN-based speech synthesis. SSW 2016: 196-201 - [c35]Zhizheng Wu, Oliver Watts, Simon King:
Merlin: An Open Source Neural Network Speech Synthesis System. SSW 2016: 202-207 - [i3]Zhizheng Wu, Simon King:
Investigating gated recurrent neural networks for speech synthesis. CoRR abs/1601.02539 (2016) - [i2]Xiaohai Tian, Zhizheng Wu, Xiong Xiao, Eng Siong Chng, Haizhou Li:
Spoofing detection under noisy conditions: a preliminary investigation and an initial database. CoRR abs/1602.02950 (2016) - [i1]Zhizheng Wu, Simon King:
Improving Trajectory Modelling for DNN-based Speech Synthesis by using Stacked Bottleneck Features and Minimum Trajectory Error Training. CoRR abs/1602.06727 (2016) - 2015
- [b1]Zhizheng Wu:
Spectral mapping for voice conversion. Nanyang Technological University, Singapore, 2015 - [j6]Zhizheng Wu, Engsiong Chng, Haizhou Li:
Exemplar-based voice conversion using joint nonnegative matrix factorization. Multim. Tools Appl. 74(22): 9943-9958 (2015) - [j5]Zhizheng Wu, Nicholas W. D. Evans, Tomi Kinnunen, Junichi Yamagishi, Federico Alegre, Haizhou Li:
Spoofing and countermeasures for speaker verification: A survey. Speech Commun. 66: 130-153 (2015) - [j4]Aleksandr Sizov, Elie Khoury, Tomi Kinnunen, Zhizheng Wu, Sébastien Marcel:
Joint Speaker Verification and Antispoofing in the i-Vector Space. IEEE Trans. Inf. Forensics Secur. 10(4): 821-832 (2015) - [c34]Xiaohai Tian, Zhizheng Wu, Siu Wa Lee, Nguyen Quy Hy, Engsiong Chng, Minghui Dong:
Sparse representation for frequency warping based voice conversion. ICASSP 2015: 4235-4239 - [c33]Zhizheng Wu, Ali Khodabakhsh, Cenk Demiroglu, Junichi Yamagishi, Daisuke Saito, Tomoki Toda, Simon King:
SAS: A speaker verification spoofing database containing diverse attacks. ICASSP 2015: 4440-4444 - [c32]Zhizheng Wu, Cassia Valentini-Botinhao, Oliver Watts, Simon King:
Deep neural networks employing Multi-Task Learning and stacked bottleneck features for speech synthesis. ICASSP 2015: 4460-4464 - [c31]Zhizheng Wu, Simon King:
Minimum trajectory error training for deep neural networks, combined with stacked bottleneck features. INTERSPEECH 2015: 309-313 - [c30]Qiong Hu, Zhizheng Wu, Korin Richmond, Junichi Yamagishi, Yannis Stylianou, Ranniery Maia:
Fusion of multiple parameterisations for DNN-based sinusoidal speech synthesis with multi-task learning. INTERSPEECH 2015: 854-858 - [c29]Cassia Valentini-Botinhao, Zhizheng Wu, Simon King:
Towards minimum perceptual error training for DNN-based speech synthesis. INTERSPEECH 2015: 869-873 - [c28]Zhizheng Wu, Pawel Swietojanski, Christophe Veaux, Steve Renals, Simon King:
A study of speaker adaptation for DNN-based speech synthesis. INTERSPEECH 2015: 879-883 - [c27]Zhizheng Wu, Tomi Kinnunen, Nicholas W. D. Evans, Junichi Yamagishi, Cemal Hanilçi, Md. Sahidullah, Aleksandr Sizov:
ASVspoof 2015: the first automatic speaker verification spoofing and countermeasures challenge. INTERSPEECH 2015: 2037-2041 - [c26]Mirjam Wester, Zhizheng Wu, Junichi Yamagishi:
Human vs machine spoofing detection on wideband and narrowband data. INTERSPEECH 2015: 2047-2051 - [c25]Thomas Merritt, Junichi Yamagishi, Zhizheng Wu, Oliver Watts, Simon King:
Deep neural network context embeddings for model selection in rich-context HMM synthesis. INTERSPEECH 2015: 2207-2211 - [c24]Oliver Watts, Zhizheng Wu, Simon King:
Sentence-level control vectors for deep neural network speech synthesis. INTERSPEECH 2015: 2217-2221 - [c23]Xiaohai Tian, Zhizheng Wu, Siu Wa Lee, Nguyen Quy Hy, Minghui Dong, Engsiong Chng:
System fusion for high-performance voice conversion. INTERSPEECH 2015: 2759-2763 - [c22]Zhizheng Wu, Tomi Kinnunen:
Automatic speaker verification spoofing and countermeasures (ASVspoof 2015): introductory talk by the organizers. INTERSPEECH 2015 - [r2]Nicholas W. D. Evans, Federico Alegre, Zhizheng Wu, Tomi Kinnunen:
Anti-spoofing, Voice Conversion. Encyclopedia of Biometrics 2015: 115-122 - [r1]Nicholas W. D. Evans, Federico Alegre, Tomi Kinnunen, Zhizheng Wu, Junichi Yamagishi:
Anti-spoofing, Voice Databases. Encyclopedia of Biometrics 2015: 123-128 - 2014
- [j3]Zhizheng Wu, Tuomas Virtanen, Engsiong Chng, Haizhou Li:
Exemplar-Based Sparse Representation With Residual Compensation for Voice Conversion. IEEE ACM Trans. Audio Speech Lang. Process. 22(10): 1506-1521 (2014) - [c21]Zhizheng Wu, Sheng Gao, Engsiong Chng, Haizhou Li:
A study on replay attack and anti-spoofing for text-dependent speaker verification. APSIPA 2014: 1-5 - [c20]Elie Khoury, Tomi Kinnunen, Aleksandr Sizov, Zhizheng Wu, Sébastien Marcel:
Introducing i-vectors for joint anti-spoofing and speaker verification. INTERSPEECH 2014: 61-65 - [c19]Siu Wa Lee, Zhizheng Wu, Minghui Dong, Xiaohai Tian, Haizhou Li:
A comparative study of spectral transformation techniques for singing voice synthesis. INTERSPEECH 2014: 2499-2503 - [c18]Zhizheng Wu, Chng Eng Siong, Haizhou Li:
Joint nonnegative matrix factorization for exemplar-based voice conversion. INTERSPEECH 2014: 2509-2513 - [c17]Xiaohai Tian, Zhizheng Wu, Siu Wa Lee, Engsiong Chng:
Correlation-based frequency warping for voice conversion. ISCSLP 2014: 211-215 - [p1]Nicholas W. D. Evans, Tomi Kinnunen, Junichi Yamagishi, Zhizheng Wu, Federico Alegre, Phillip L. De Leon:
Speaker Recognition Anti-spoofing. Handbook of Biometric Anti-Spoofing 2014: 125-146 - 2013
- [c16]Xiaohai Tian, Zhizheng Wu, Engsiong Chng:
Local partial least square regression for spectral mapping in voice conversion. APSIPA 2013: 1-6 - [c15]Zhizheng Wu, Haizhou Li:
Voice conversion and spoofing attack on speaker verification systems. APSIPA 2013: 1-9 - [c14]Zhizheng Wu, Engsiong Chng, Haizhou Li:
Conditional restricted Boltzmann machine for voice conversion. ChinaSIP 2013: 104-108 - [c13]Zhizheng Wu, Xiong Xiao, Engsiong Chng, Haizhou Li:
Synthetic speech detection using temporal modulation feature. ICASSP 2013: 7234-7238 - [c12]Zhizheng Wu, Anthony Larcher, Kong-Aik Lee, Engsiong Chng, Tomi Kinnunen, Haizhou Li:
Vulnerability evaluation of speaker verification under voice conversion spoofing: the effect of text constraints. INTERSPEECH 2013: 950-954 - [c11]Zhizheng Wu, Tuomas Virtanen, Tomi Kinnunen, Engsiong Chng, Haizhou Li:
Exemplar-based unit selection for voice conversion utilizing temporal information. INTERSPEECH 2013: 3057-3061 - [c10]Zhizheng Wu, Tuomas Virtanen, Tomi Kinnunen, Eng Siong Chng, Haizhou Li:
Exemplar-based voice conversion using non-negative spectrogram deconvolution. SSW 2013: 201-206 - 2012
- [j2]Zhizheng Wu, Tomi Kinnunen, Engsiong Chng, Haizhou Li:
Mixture of Factor Analyzers Using Priors From Non-Parallel Speech for Voice Conversion. IEEE Signal Process. Lett. 19(12): 914-917 (2012) - [c9]Zhizheng Wu, Tomi Kinnunen, Engsiong Chng, Haizhou Li, Eliathamby Ambikairajah:
A study on spoofing attack in state-of-the-art speaker verification: the telephone speech case. APSIPA 2012: 1-5 - [c8]Tomi Kinnunen, Zhizheng Wu, Kong-Aik Lee, Filip Sedlak, Engsiong Chng, Haizhou Li:
Vulnerability of speaker verification systems against voice conversion spoofing attacks: The case of telephone speech. ICASSP 2012: 4401-4404 - [c7]Zhizheng Wu, Chng Eng Siong, Haizhou Li:
Detecting Converted Speech and Natural Speech for anti-Spoofing Attack in Speaker Recognition. INTERSPEECH 2012: 1700-1703 - 2011
- [j1]Yao Qian, Zhizheng Wu, Boyang Gao, Frank K. Soong:
Improved Prosody Generation by Maximizing Joint Probability of State and Longer Units. IEEE Trans. Speech Audio Process. 19(6): 1702-1710 (2011) - 2010
- [c6]Zhizheng Wu, Tomi Kinnunen, Engsiong Chng, Haizhou Li:
Text-independent F0 transformation with non-parallel data for voice conversion. INTERSPEECH 2010: 1732-1735 - [c5]Yao Qian, Zhizheng Wu, Xuezhe Ma, Frank K. Soong:
Automatic prosody prediction and detection with Conditional Random Field (CRF) models. ISCSLP 2010: 135-138
2000 – 2009
- 2009
- [c4]Yao Qian, Zhizheng Wu, Frank K. Soong:
Improved prosody generation by maximizing joint likelihood of state and longer units. ICASSP 2009: 3781-3784 - [c3]Yao Qian, Frank K. Soong, Miaomiao Wang, Zhizheng Wu:
A minimum v/u error approach to F0 generation in HMM-based TTS. INTERSPEECH 2009: 408-411 - 2008
- [c2]Boyang Gao, Yao Qian, Zhizheng Wu, Frank K. Soong:
Duration refinement by jointly optimizing state and longer unit likelihood. INTERSPEECH 2008: 2266-2269 - [c1]Zhizheng Wu, Yao Qian, Frank K. Soong, Bo Zhang:
Modeling and Generating Tone Contour with Phrase Intonation for Mandarin Chinese Speech. ISCSLP 2008: 121-124
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-11 20:44 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint