default search action

combined dblp search
author search
venue search
publication search

ask others

Yong Xu 0004

> Home > Persons

Person information

affiliation: Tencent America LLC, Seattle, USA
affiliation (former): University of Surrey, Centre for Vision, Speech and Signal Processing, Guildford, UK
affiliation (PhD 2015): University of Science and Technology of China, Hefei, China

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j14]
- view
  authority control:
- export record
  dblp key:
  - journals/tmm/LiuXWW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmm/LiuXWW24
Yang Liu, Yong Xu, Peipei Wu, Wenwu Wang:
Labelled Non-Zero Diffusion Particle Flow SMC-PHD Filtering for Multi-Speaker Tracking. IEEE Trans. Multim. 26: 2544-2559 (2024)
[c59]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/ZhaoQ0LCB024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/ZhaoQ0LCB024
Jinzheng Zhao, Xinyuan Qian, Yong Xu, Haohe Liu, Yin Cao, Davide Berghi, Wenwu Wang:
Text-Queried Target Sound Event Localization. EUSIPCO 2024: 261-265
[c58]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XuXKWYY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XuXKWYY24
Zhongweiyang Xu, Yong Xu, Vinay Kothapally, Heming Wang, Muqiao Yang, Dong Yu:
SPATIALCODEC: Neural Spatial Speech Coding. ICASSP 2024: 1131-1135
[c57]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YangZXXWR024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YangZXXWR024
Muqiao Yang, Chunlei Zhang, Yong Xu, Zhongweiyang Xu, Heming Wang, Bhiksha Raj, Dong Yu:
uSee: Unified Speech Enhancement And Editing with Conditional Diffusion Models. ICASSP 2024: 7125-7129
[i49]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-17431
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-17431
Mohan Shi, Zengrui Jin, Yaoxun Xu, Yong Xu, Shi-Xiong Zhang, Kun Wei, Yiwen Shao, Chunlei Zhang, Dong Yu:
Advancing Multi-talker ASR Performance with Large Language Models. CoRR abs/2408.17431 (2024)
[i48]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-00819
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-00819
Zengrui Jin, Yifan Yang, Mohan Shi, Wei Kang, Xiaoyu Yang, Zengwei Yao, Fangjun Kuang, Liyong Guo, Lingwei Meng, Long Lin, Yong Xu, Shi-Xiong Zhang, Daniel Povey:
LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization. CoRR abs/2409.00819 (2024)
[i47]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-10819
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-10819
Jiarui Hai, Yong Xu, Hao Zhang, Chenxing Li, Helin Wang, Mounya Elhilali, Dong Yu:
EzAudio: Enhancing Text-to-Audio Generation with Efficient Diffusion Transformer. CoRR abs/2409.10819 (2024)
2023
[c56]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/YuXZZY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/YuXZZY23
Meng Yu, Yong Xu, Chunlei Zhang, Shi-Xiong Zhang, Dong Yu:
Neuralecho: Hybrid of Full-Band and Sub-Band Recurrent Neural Network For Acoustic Echo Cancellation and Speech Enhancement. ASRU 2023: 1-8
[c55]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KothapallyXYZY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KothapallyXYZY23
Vinay Kothapally, Yong Xu, Meng Yu, Shi-Xiong Zhang, Dong Yu:
Deep Neural Mel-Subband Beamformer for in-Car Speech Separation. ICASSP 2023: 1-5
[c54]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0004K0Z023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0004K0Z023
Yong Xu, Vinay Kothapally, Meng Yu, Shixiong Zhang, Dong Yu:
Zoneformer: On-device Neural Beamformer For In-car Multi-zone Speech Separation, Enhancement and Echo Cancellation. INTERSPEECH 2023: 5117-5121
[i46]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-07432
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-07432
Zhongweiyang Xu, Yong Xu, Vinay Kothapally, Heming Wang, Muqiao Yang, Dong Yu:
SpatialCodec: Neural Spatial Speech Coding. CoRR abs/2309.07432 (2023)
[i45]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-16308
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-16308
Jinzheng Zhao, Yong Xu, Xinyuan Qian, Wenwu Wang:
Audio Visual Speaker Localization from EgoCentric Views. CoRR abs/2309.16308 (2023)
[i44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-00900
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-00900
Muqiao Yang, Chunlei Zhang, Yong Xu, Zhongweiyang Xu, Heming Wang, Bhiksha Raj, Dong Yu:
uSee: Unified Speech Enhancement and Editing with Conditional Diffusion Models. CoRR abs/2310.00900 (2023)
[i43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-14778
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-14778
Jinzheng Zhao, Yong Xu, Xinyuan Qian, Davide Berghi, Peipei Wu, Meng Cui, Jianyuan Sun, Philip J. B. Jackson, Wenwu Wang:
Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions. CoRR abs/2310.14778 (2023)
2022
[c53]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/ZhaoWGLSXW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/ZhaoWGLSXW22
Jinzheng Zhao, Peipei Wu, Shidrokh Goudarzi, Xubo Liu, Jianyuan Sun, Yong Xu, Wenwu Wang:
Visually Assisted Self-supervised Audio Speaker Localization and Tracking. EUSIPCO 2022: 787-791
[c52]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhaoWLXMGW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhaoWLXMGW22
Jinzheng Zhao, Peipei Wu, Xubo Liu, Yong Xu, Lyudmila Mihaylova, Simon J. Godsill, Wenwu Wang:
Audio-Visual Tracking of Multiple Speakers Via a PMBM Filter. ICASSP 2022: 5068-5072
[c51]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Kothapally00Z022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Kothapally00Z022
Vinay Kothapally, Yong Xu, Meng Yu, Shi-Xiong Zhang, Dong Yu:
Joint Neural AEC and Beamforming with Double-Talk Detection. INTERSPEECH 2022: 2528-2532
[c50]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhaoWLGLXW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhaoWLGLXW22
Jinzheng Zhao, Peipei Wu, Xubo Liu, Shidrokh Goudarzi, Haohe Liu, Yong Xu, Wenwu Wang:
Audio Visual Multi-Speaker Tracking with Improved GCF and PMBM Filter. INTERSPEECH 2022: 3704-3708
[c49]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/MaitiUWZYZX22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/MaitiUWZYZX22
Soumi Maiti, Yushi Ueda, Shinji Watanabe, Chunlei Zhang, Meng Yu, Shi-Xiong Zhang, Yong Xu:
EEND-SS: Joint End-to-End Neural Speaker Diarization and Speech Separation for Flexible Number of Speakers. SLT 2022: 480-487
[i42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-17068
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-17068
Yushi Ueda, Soumi Maiti, Shinji Watanabe, Chunlei Zhang, Meng Yu, Shi-Xiong Zhang, Yong Xu:
EEND-SS: Joint End-to-End Neural Speaker Diarization and Speech Separation for Flexible Number of Speakers. CoRR abs/2203.17068 (2022)
[i41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-10401
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-10401
Meng Yu, Yong Xu, Chunlei Zhang, Shi-Xiong Zhang, Dong Yu:
NeuralEcho: A Self-Attentive Recurrent Neural Network For Unified Acoustic Echo Suppression And Speech Enhancement. CoRR abs/2205.10401 (2022)
[i40]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-12345
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-12345
Qiuqiang Kong, Shilei Liu, Junjie Shi, Xuzhou Ye, Yin Cao, Qiaoxi Zhu, Yong Xu, Yuxuan Wang:
Neural Sound Field Decomposition with Super-resolution of Sound Direction. CoRR abs/2210.12345 (2022)
[i39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-12590
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-12590
Vinay Kothapally, Yong Xu, Meng Yu, Shi-Xiong Zhang, Dong Yu:
Deep Neural Mel-Subband Beamformer for In-car Speech Separation. CoRR abs/2211.12590 (2022)
2021
[j13]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/MichelsantiTZXY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/MichelsantiTZXY21
Daniel Michelsanti, Zheng-Hua Tan, Shi-Xiong Zhang, Yong Xu, Meng Yu, Dong Yu, Jesper Jensen:
An Overview of Deep-Learning-Based Audio-Visual Speech Enhancement and Separation. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1368-1396 (2021)
[j12]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ZhangXYZCWY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ZhangXYZCWY21
Zhuohuang Zhang, Yong Xu, Meng Yu, Shi-Xiong Zhang, Lianwu Chen, Donald S. Williamson, Dong Yu:
Multi-Channel Multi-Frame ADL-MVDR for Target Speech Separation. IEEE ACM Trans. Audio Speech Lang. Process. 29: 3526-3540 (2021)
[c48]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Zhang00ZC021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Zhang00ZC021
Zhuohuang Zhang, Yong Xu, Meng Yu, Shi-Xiong Zhang, Lianwu Chen, Dong Yu:
ADL-MVDR: All Deep Learning MVDR Beamformer for Target Speech Separation. ICASSP 2021: 6089-6093
[c47]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SubramanianW00021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SubramanianW00021
Aswin Shanmugam Subramanian, Chao Weng, Shinji Watanabe, Meng Yu, Yong Xu, Shi-Xiong Zhang, Dong Yu:
Directional ASR: A New Paradigm for E2E Multi-Speaker Speech Recognition with Source Localization. ICASSP 2021: 8433-8437
[c46]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangWC0YXZWSY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangWC0YXZWSY21
Helin Wang, Bo Wu, Lianwu Chen, Meng Yu, Jianwei Yu, Yong Xu, Shi-Xiong Zhang, Chao Weng, Dan Su, Dong Yu:
TeCANet: Temporal-Contextual Attention Network for Environment-Aware Speech Dereverberation. Interspeech 2021: 1109-1113
[c45]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiX0Z00021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiX0Z00021
Xiyun Li, Yong Xu, Meng Yu, Shi-Xiong Zhang, Jiaming Xu, Bo Xu, Dong Yu:
MIMO Self-Attentive RNN Beamformer for Multi-Speaker Speech Separation. Interspeech 2021: 1119-1123
[c44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuZXZ021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuZXZ021
Meng Yu, Chunlei Zhang, Yong Xu, Shi-Xiong Zhang, Dong Yu:
MetricNet: Towards Improved Modeling For Non-Intrusive Speech Quality Assessment. Interspeech 2021: 2142-2146
[c43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuZ0Z021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuZ0Z021
Yong Xu, Zhuohuang Zhang, Meng Yu, Shi-Xiong Zhang, Dong Yu:
Generalized Spatio-Temporal RNN Beamformer for Target Speech Separation. Interspeech 2021: 3076-3080
[c42]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/LiuYXWZC021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/LiuYXWZC021
Jianming Liu, Meng Yu, Yong Xu, Chao Weng, Shi-Xiong Zhang, Lianwu Chen, Dong Yu:
Neural Mask based Multi-channel Convolutional Beamforming for Joint Dereverberation, Echo Cancellation and Denoising. SLT 2021: 766-770
[c41]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/NiXYWZYM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/NiXYWZYM21
Zhaoheng Ni, Yong Xu, Meng Yu, Bo Wu, Shi-Xiong Zhang, Dong Yu, Michael I. Mandel:
WPD++: An Improved Neural Beamformer for Simultaneous Speech Separation and Dereverberation. SLT 2021: 817-824
[i38]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2101-01280
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2101-01280
Yong Xu, Zhuohuang Zhang, Meng Yu, Shi-Xiong Zhang, Lianwu Chen, Dong Yu:
Generalized RNN beamformer for target speech separation. CoRR abs/2101.01280 (2021)
[i37]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-16849
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-16849
Helin Wang, Bo Wu, Lianwu Chen, Meng Yu, Jianwei Yu, Yong Xu, Shi-Xiong Zhang, Chao Weng, Dan Su, Dong Yu:
TeCANet: Temporal-Contextual Attention Network for Environment-Aware Speech Dereverberation. CoRR abs/2103.16849 (2021)
[i36]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-01227
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-01227
Meng Yu, Chunlei Zhang, Yong Xu, Shi-Xiong Zhang, Dong Yu:
MetricNet: Towards Improved Modeling For Non-Intrusive Speech Quality Assessment. CoRR abs/2104.01227 (2021)
[i35]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-08450
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-08450
Xiyun Li, Yong Xu, Meng Yu, Shi-Xiong Zhang, Jiaming Xu, Bo Xu, Dong Yu:
MIMO Self-attentive RNN Beamformer for Multi-speaker Speech Separation. CoRR abs/2104.08450 (2021)
[i34]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2111-04904
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-04904
Vinay Kothapally, Yong Xu, Meng Yu, Shi-Xiong Zhang, Dong Yu:
Joint AEC AND Beamforming with Double-Talk Detection using RNN-Transformer. CoRR abs/2111.04904 (2021)
2020
[j11]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/GuZXCZY20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/GuZXCZY20
Rongzhi Gu, Shi-Xiong Zhang, Yong Xu, Lianwu Chen, Yuexian Zou, Dong Yu:
Multi-Modal Multi-Channel Target Speech Separation. IEEE J. Sel. Top. Signal Process. 14(3): 530-541 (2020)
[j10]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/TanXZYY20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/TanXZYY20
Ke Tan, Yong Xu, Shi-Xiong Zhang, Meng Yu, Dong Yu:
Audio-Visual Speech Separation and Dereverberation With a Two-Stage Multimodal Network. IEEE J. Sel. Top. Signal Process. 14(3): 542-553 (2020)
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/KongXWP20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/KongXWP20
Qiuqiang Kong, Yong Xu, Wenwu Wang, Mark D. Plumbley:
Sound Event Detection of Weakly Labelled Data With CNN-Transformer and Automatic Threshold Optimization. IEEE ACM Trans. Audio Speech Lang. Process. 28: 2450-2460 (2020)
[c40]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DingXZCW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DingXZCW20
Yifan Ding, Yong Xu, Shi-Xiong Zhang, Yahuan Cong, Liqiang Wang:
Self-Supervised Learning for Audio-Visual Speaker Diarization. ICASSP 2020: 4367-4371
[c39]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SubramanianWYZX20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SubramanianWYZX20
Aswin Shanmugam Subramanian, Chao Weng, Meng Yu, Shi-Xiong Zhang, Yong Xu, Shinji Watanabe, Dong Yu:
Far-Field Location Guided Target Speech Extraction Using End-to-End Speech Recognition Objectives. ICASSP 2020: 7299-7303
[c38]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GuZCXYSZY20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GuZCXYSZY20
Rongzhi Gu, Shi-Xiong Zhang, Lianwu Chen, Yong Xu, Meng Yu, Dan Su, Yuexian Zou, Dong Yu:
Enhancing End-to-End Multi-Channel Speech Separation Via Spatial Feature Learning. ICASSP 2020: 7319-7323
[c37]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuYZCWL020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuYZCWL020
Yong Xu, Meng Yu, Shi-Xiong Zhang, Lianwu Chen, Chao Weng, Jianming Liu, Dong Yu:
Neural Spatio-Temporal Beamformer for Target Speech Separation. INTERSPEECH 2020: 56-60
[c36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuWGZCX00YLM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuWGZCX00YLM20
Jianwei Yu, Bo Wu, Rongzhi Gu, Shi-Xiong Zhang, Lianwu Chen, Yong Xu, Meng Yu, Dan Su, Dong Yu, Xunying Liu, Helen Meng:
Audio-Visual Multi-Channel Recognition of Overlapped Speech. INTERSPEECH 2020: 3496-3500
[i33]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-05314
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-05314
Yifan Ding, Yong Xu, Shi-Xiong Zhang, Yahuan Cong, Liqiang Wang:
Self-supervised learning for audio-visual speaker diarization. CoRR abs/2002.05314 (2020)
[i32]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2003-03927
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-03927
Rongzhi Gu, Shi-Xiong Zhang, Lianwu Chen, Yong Xu, Meng Yu, Dan Su, Yuexian Zou, Dong Yu:
Enhancing End-to-End Multi-channel Speech Separation via Spatial Feature Learning. CoRR abs/2003.03927 (2020)
[i31]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2003-07032
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-07032
Rongzhi Gu, Shi-Xiong Zhang, Yong Xu, Lianwu Chen, Yuexian Zou, Dong Yu:
Multi-modal Multi-channel Target Speech Separation. CoRR abs/2003.07032 (2020)
[i30]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-03889
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-03889
Yong Xu, Meng Yu, Shi-Xiong Zhang, Lianwu Chen, Chao Weng, Jianming Liu, Dong Yu:
Neural Spatio-Temporal Beamformer for Target Speech Separation. CoRR abs/2005.03889 (2020)
[i29]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-08571
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-08571
Jianwei Yu, Bo Wu, Rongzhi Gu, Shi-Xiong Zhang, Lianwu Chen, Yong Xu, Meng Yu, Dan Su, Dong Yu, Xunying Liu, Helen Meng:
Audio-visual Multi-channel Recognition of Overlapped Speech. CoRR abs/2005.08571 (2020)
[i28]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-09586
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-09586
Daniel Michelsanti, Zheng-Hua Tan, Shi-Xiong Zhang, Yong Xu, Meng Yu, Dong Yu, Jesper Jensen:
An Overview of Deep-Learning-Based Audio-Visual Speech Enhancement and Separation. CoRR abs/2008.09586 (2020)
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-00091
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-00091
Aswin Shanmugam Subramanian, Chao Weng, Shinji Watanabe, Meng Yu, Yong Xu, Shi-Xiong Zhang, Dong Yu:
Directional ASR: A New Paradigm for E2E Multi-Speaker Speech Recognition with Source Localization. CoRR abs/2011.00091 (2020)
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-09162
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-09162
Zhaoheng Ni, Yong Xu, Meng Yu, Bo Wu, Shi-Xiong Zhang, Dong Yu, Michael I. Mandel:
WPD++: An Improved Neural Beamformer for Simultaneous Speech Separation and Dereverberation. CoRR abs/2011.09162 (2020)
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2012-13442
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-13442
Zhuohuang Zhang, Yong Xu, Meng Yu, Shi-Xiong Zhang, Lianwu Chen, Donald S. Williamson, Dong Yu:
Multi-channel Multi-frame ADL-MVDR for Target Speech Separation. CoRR abs/2012.13442 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/KongXSWP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/KongXSWP19
Qiuqiang Kong, Yong Xu, Iwona Sobieraj, Wenwu Wang, Mark D. Plumbley:
Sound Event Detection and Time-Frequency Segmentation from Weakly Labelled Data. IEEE ACM Trans. Audio Speech Lang. Process. 27(4): 777-787 (2019)
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/KongYXIWP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/KongYXIWP19
Qiuqiang Kong, Changsong Yu, Yong Xu, Turab Iqbal, Wenwu Wang, Mark D. Plumbley:
Weakly Labelled AudioSet Tagging With Attention Neural Networks. IEEE ACM Trans. Audio Speech Lang. Process. 27(11): 1791-1802 (2019)
[c35]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/WuXZCYXY19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/WuXZCYXY19
Jian Wu, Yong Xu, Shi-Xiong Zhang, Lianwu Chen, Meng Yu, Lei Xie, Dong Yu:
Time Domain Audio Visual Speech Separation. ASRU 2019: 667-673
[c34]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KongXICWP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KongXICWP19
Qiuqiang Kong, Yong Xu, Turab Iqbal, Yin Cao, Wenwu Wang, Mark D. Plumbley:
Acoustic Scene Generation with Conditional Samplernn. ICASSP 2019: 925-929
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XuWHLYSY19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XuWHLYSY19
Yong Xu, Chao Weng, Like Hui, Jianming Liu, Meng Yu, Dan Su, Dong Yu:
Joint Training of Complex Ratio Mask Based Beamformer and Acoustic Model for Noise Robust Asr. ICASSP 2019: 6745-6749
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HaoSXSX19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HaoSXSX19
Xiang Hao, Changhao Shan, Yong Xu, Sining Sun, Lei Xie:
An Attention-based Neural Network Approach for Single Channel Speech Enhancement. ICASSP 2019: 6895-6899
[c31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/KongXJWP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/KongXJWP19
Qiuqiang Kong, Yong Xu, Philip J. B. Jackson, Wenwu Wang, Mark D. Plumbley:
Single-Channel Signal Separation and Deconvolution with Generative Adversarial Networks. IJCAI 2019: 2747-2753
[c30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuXZCYX019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuXZCYX019
Jian Wu, Yong Xu, Shi-Xiong Zhang, Lianwu Chen, Meng Yu, Lei Xie, Dong Yu:
Improved Speaker-Dependent Separation for CHiME-5 Challenge. INTERSPEECH 2019: 466-470
[c29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GuCZZXYSZ019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GuCZZXYSZ019
Rongzhi Gu, Lianwu Chen, Shi-Xiong Zhang, Jimeng Zheng, Yong Xu, Meng Yu, Dan Su, Yuexian Zou, Dong Yu:
Neural Spatial Filter: Target Speaker Speech Separation Assisted with Directional Information. INTERSPEECH 2019: 4290-4294
[c28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BahmaninezhadWG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BahmaninezhadWG19
Fahimeh Bahmaninezhad, Jian Wu, Rongzhi Gu, Shi-Xiong Zhang, Yong Xu, Meng Yu, Dong Yu:
A Comprehensive Study of Speech Separation: Spectrogram vs Waveform Separation. INTERSPEECH 2019: 4574-4578
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1903-00765
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1903-00765
Qiuqiang Kong, Changsong Yu, Turab Iqbal, Yong Xu, Wenwu Wang, Mark D. Plumbley:
Weakly labelled AudioSet Classification with Attention Neural Networks. CoRR abs/1903.00765 (2019)
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1904-03476
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-03476
Qiuqiang Kong, Yin Cao, Turab Iqbal, Yong Xu, Wenwu Wang, Mark D. Plumbley:
Cross-task learning for audio tagging, sound event detection and spatial localization: DCASE 2019 baseline systems. CoRR abs/1904.03476 (2019)
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1904-03760
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-03760
Jian Wu, Yong Xu, Shi-Xiong Zhang, Lianwu Chen, Meng Yu, Lei Xie, Dong Yu:
Time Domain Audio Visual Speech Separation. CoRR abs/1904.03760 (2019)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1904-03792
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-03792
Jian Wu, Yong Xu, Shi-Xiong Zhang, Lianwu Chen, Meng Yu, Lei Xie, Dong Yu:
Improved Speaker-Dependent Separation for CHiME-5 Challenge. CoRR abs/1904.03792 (2019)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1905-06286
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-06286
Rongzhi Gu, Jian Wu, Shi-Xiong Zhang, Lianwu Chen, Yong Xu, Meng Yu, Dan Su, Yuexian Zou, Dong Yu:
End-to-End Multi-Channel Speech Separation. CoRR abs/1905.06286 (2019)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1905-07497
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-07497
Fahimeh Bahmaninezhad, Jian Wu, Rongzhi Gu, Shi-Xiong Zhang, Yong Xu, Meng Yu, Dong Yu:
A comprehensive study of speech separation: spectrogram vs waveform separation. CoRR abs/1905.07497 (2019)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-07552
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-07552
Qiuqiang Kong, Yong Xu, Wenwu Wang, Philip J. B. Jackson, Mark D. Plumbley:
Single-Channel Signal Separation and Deconvolution with Generative Adversarial Networks. CoRR abs/1906.07552 (2019)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1909-07352
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-07352
Ke Tan, Yong Xu, Shi-Xiong Zhang, Meng Yu, Dong Yu:
Audio-Visual Speech Separation and Dereverberation with a Two-Stage Multimodal Network. CoRR abs/1909.07352 (2019)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1912-04761
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-04761
Qiuqiang Kong, Yong Xu, Wenwu Wang, Mark D. Plumbley:
Sound Event Detection of Weakly Labelled Data with CNN-Transformer and Automatic Threshold Optimization. CoRR abs/1912.04761 (2019)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1912-07814
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-07814
Fahimeh Bahmaninezhad, Shi-Xiong Zhang, Yong Xu, Meng Yu, John H. L. Hansen, Dong Yu:
A Unified Framework for Speech Separation. CoRR abs/1912.07814 (2019)
2018
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/vlsisp/SunDXX18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/vlsisp/SunDXX18
Lei Sun, Jun Du, Zhipeng Xie, Yong Xu:
Auxiliary Features from Laser-Doppler Vibrometer Sensor for Deep Neural Network Based Robust Speech Recognition. J. Signal Process. Syst. 90(7): 975-983 (2018)
[c27]
- view
  - electronic edition @ dcase.community (open access)
  - details & citations
- export record
  dblp key:
  - conf/dcase/KongIXWP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/KongIXWP18
Qiuqiang Kong, Turab Iqbal, Yong Xu, Wenwu Wang, Mark D. Plumbley:
DCASE 2018 Challenge Surrey cross-task convolutional neural network baseline. DCASE 2018: 217-221
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/Iqbal0KW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/Iqbal0KW18
Turab Iqbal, Yong Xu, Qiuqiang Kong, Wenwu Wang:
Capsule Routing for Sound Event Detection. EUSIPCO 2018: 2255-2259
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/hci/DuelFK0JP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/hci/DuelFK0JP18
Tijs Duel, David M. Frohlich, Christian Kroos, Yong Xu, Philip J. B. Jackson, Mark D. Plumbley:
Supporting Audiography: Design of a System for Sentimental Sound Recording, Classification and Playback. HCI (28) 2018: 24-31
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/ica/ZerminiKXPW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ica/ZerminiKXPW18
Alfredo Zermini, Qiuqiang Kong, Yong Xu, Mark D. Plumbley, Wenwu Wang:
Improving Reverberant Speech Separation with Binaural Cues Using Temporal Context and Convolutional Neural Networks. LVA/ICA 2018: 361-371
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/0004KWP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/0004KWP18
Yong Xu, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley:
Large-Scale Weakly Supervised Audio Classification Using Gated Convolutional Neural Network. ICASSP 2018: 121-125
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Kong0WP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Kong0WP18
Qiuqiang Kong, Yong Xu, Wenwu Wang, Mark D. Plumbley:
Audio Set Classification with Attention Model: A Probabilistic Perspective. ICASSP 2018: 316-320
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KongXWP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KongXWP18
Qiuqiang Kong, Yong Xu, Wenwu Wang, Mark D. Plumbley:
A Joint Separation-Classification Model for Sound Event Detection of Weakly Labelled Data. ICASSP 2018: 321-325
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Liu0JWC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Liu0JWC18
Qingju Liu, Yong Xu, Philip J. B. Jackson, Wenwu Wang, Philip Coleman:
Iterative Deep Neural Networks for Speaker-Independent Binaural Blind Speech Separation. ICASSP 2018: 541-545
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1804-04715
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1804-04715
Qiuqiang Kong, Yong Xu, Iwona Sobieraj, Wenwu Wang, Mark D. Plumbley:
Sound Event Detection and Time-Frequency Segmentation from Weakly Labelled Data. CoRR abs/1804.04715 (2018)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1806-04699
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1806-04699
Turab Iqbal, Yong Xu, Qiuqiang Kong, Wenwu Wang:
Capsule Routing for Sound Event Detection. CoRR abs/1806.04699 (2018)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1808-00773
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1808-00773
Qiuqiang Kong, Turab Iqbal, Yong Xu, Wenwu Wang, Mark D. Plumbley:
DCASE 2018 Challenge baseline with convolutional neural networks. CoRR abs/1808.00773 (2018)
2017
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/pr/DuX17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pr/DuX17
Jun Du, Yong Xu:
Hierarchical deep neural network for multivariate regression. Pattern Recognit. 63: 149-157 (2017)
[j4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/XuHWFSJP17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/XuHWFSJP17
Yong Xu, Qiang Huang, Wenwu Wang, Peter Foster, Siddharth Sigtia, Philip J. B. Jackson, Mark D. Plumbley:
Unsupervised Feature Learning Based on Deep Models for Environmental Audio Tagging. IEEE ACM Trans. Audio Speech Lang. Process. 25(6): 1230-1241 (2017)
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/KongXP17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/KongXP17
Qiuqiang Kong, Yong Xu, Mark D. Plumbley:
Joint detection and classification convolutional neural network on weakly labelled bird audio detection. EUSIPCO 2017: 1749-1753
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KongXWP17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KongXWP17
Qiuqiang Kong, Yong Xu, Wenwu Wang, Mark D. Plumbley:
A joint detection-classification model for audio tagging of weakly labelled data. ICASSP 2017: 641-645
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuangXJWP17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuangXJWP17
Qiang Huang, Yong Xu, Philip J. B. Jackson, Wenwu Wang, Mark D. Plumbley:
Fast tagging of natural sounds using marginal co-regularization. ICASSP 2017: 2991-2995
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/ijcnn/XuKHWP17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcnn/XuKHWP17
Yong Xu, Qiuqiang Kong, Qiang Huang, Wenwu Wang, Mark D. Plumbley:
Convolutional gated recurrent neural network incorporating spatial features for audio tagging. IJCNN 2017: 3461-3466
[c15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuKHWP17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuKHWP17
Yong Xu, Qiuqiang Kong, Qiang Huang, Wenwu Wang, Mark D. Plumbley:
Attention and Localization Based on a Deep Convolutional Recurrent Model for Weakly Supervised Audio Tagging. INTERSPEECH 2017: 3083-3087
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/mmsp/ZerminiLXPBW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mmsp/ZerminiLXPBW17
Alfredo Zermini, Qingju Liu, Yong Xu, Mark D. Plumbley, Dave Betts, Wenwu Wang:
Binaural and log-power spectra features with deep neural networks for speech-noise separation. MMSP 2017: 1-6
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/XuKHWP17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/XuKHWP17
Yong Xu, Qiuqiang Kong, Qiang Huang, Wenwu Wang, Mark D. Plumbley:
Convolutional Gated Recurrent Neural Network Incorporating Spatial Features for Audio Tagging. CoRR abs/1702.07787 (2017)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/XuKHWP17a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/XuKHWP17a
Yong Xu, Qiuqiang Kong, Qiang Huang, Wenwu Wang, Mark D. Plumbley:
Attention and Localization based on a Deep Convolutional Recurrent Model for Weakly Supervised Audio Tagging. CoRR abs/1703.06052 (2017)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/XuDHDL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/XuDHDL17
Yong Xu, Jun Du, Zhen Huang, Li-Rong Dai, Chin-Hui Lee:
Multi-Objective Learning and Mask-Based Post-Processing for Deep Neural Network Based Speech Enhancement. CoRR abs/1703.07172 (2017)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1709-00551
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1709-00551
Yong Xu, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley:
Surrey-cvssp system for DCASE2017 challenge task4. CoRR abs/1709.00551 (2017)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1710-00343
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1710-00343
Yong Xu, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley:
Large-scale weakly supervised audio classification using gated convolutional neural network. CoRR abs/1710.00343 (2017)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1711-00927
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1711-00927
Qiuqiang Kong, Yong Xu, Wenwu Wang, Mark D. Plumbley:
Audio Set classification with attention model: A probabilistic perspective. CoRR abs/1711.00927 (2017)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1711-03037
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1711-03037
Qiuqiang Kong, Yong Xu, Wenwu Wang, Mark D. Plumbley:
A joint separation-classification model for sound event detection of weakly labelled data. CoRR abs/1711.03037 (2017)
2016
[j3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasp/GaoDXLDL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasp/GaoDXLDL16
Tian Gao, Jun Du, Yong Xu, Cong Liu, Li-Rong Dai, Chin-Hui Lee:
Joint training of DNNs by incorporating an explicit dereverberation structure for distant speech recognition. EURASIP J. Adv. Signal Process. 2016: 86 (2016)
[c13]
- view
  - electronic edition @ dcase.community (open access)
  - details & citations
- export record
  dblp key:
  - conf/dcase/XuHWJP16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/XuHWJP16
Yong Xu, Qiang Huang, Wenwu Wang, Philip J. B. Jackson, Mark D. Plumbley:
Fully DNN-Based Multi-Label Regression for Audio Tagging. DCASE 2016: 105-109
[c12]
- view
  - electronic edition @ dcase.community (open access)
  - details & citations
- export record
  dblp key:
  - conf/dcase/XuHWP16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/XuHWP16
Yong Xu, Qiang Huang, Wenwu Wang, Mark D. Plumbley:
Hierarchical Learning for DNN-Based Acoustic Scene Classification. DCASE 2016: 110-114
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/XieDMXMW16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/XieDMXMW16
Zhipeng Xie, Jun Du, Ian McLoughlin, Yong Xu, Feng Ma, Haikun Wang:
Deep neural network for robust speech recognition with auxiliary features from laser-Doppler vibrometer sensor. ISCSLP 2016: 1-5
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/XuHWJP16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/XuHWJP16
Yong Xu, Qiang Huang, Wenwu Wang, Philip J. B. Jackson, Mark D. Plumbley:
Fully DNN-based Multi-label regression for audio tagging. CoRR abs/1606.07695 (2016)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/XuHWFSJP16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/XuHWFSJP16
Yong Xu, Qiang Huang, Wenwu Wang, Peter Foster, Siddharth Sigtia, Philip J. B. Jackson, Mark D. Plumbley:
Fully Deep Neural Networks Incorporating Unsupervised Feature Learning for Audio Tagging. CoRR abs/1607.03681 (2016)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/XuHWP16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/XuHWP16
Yong Xu, Qiang Huang, Wenwu Wang, Mark D. Plumbley:
Hierachical learning for DNN-based acoustic scene classification. CoRR abs/1607.03682 (2016)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/KongXWP16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/KongXWP16
Qiuqiang Kong, Yong Xu, Wenwu Wang, Mark D. Plumbley:
A Joint Detection-Classification Model for Audio Tagging of Weakly Labelled Data. CoRR abs/1610.01797 (2016)
2015
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/XuDDL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/XuDDL15
Yong Xu, Jun Du, Li-Rong Dai, Chin-Hui Lee:
A Regression Approach to Speech Enhancement Based on Deep Neural Networks. IEEE ACM Trans. Audio Speech Lang. Process. 23(1): 7-19 (2015)
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/ica/GaoDXLDL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ica/GaoDXLDL15
Tian Gao, Jun Du, Yong Xu, Cong Liu, Li-Rong Dai, Chin-Hui Lee:
Improving Deep Neural Network Based Speech Enhancement in Low SNR Environments. LVA/ICA 2015: 75-82
[c9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuDHDL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuDHDL15
Yong Xu, Jun Du, Zhen Huang, Li-Rong Dai, Chin-Hui Lee:
Multi-objective learning and mask-based post-processing for deep neural network based speech enhancement. INTERSPEECH 2015: 1508-1512
[c8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiHXL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiHXL15
Kehuang Li, Zhen Huang, Yong Xu, Chin-Hui Lee:
DNN-based speech bandwidth expansion and its application to adding high-frequency missing features for automatic speech recognition of narrowband speech. INTERSPEECH 2015: 2578-2582
2014
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/XuDDL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/XuDDL14
Yong Xu, Jun Du, Li-Rong Dai, Chin-Hui Lee:
An Experimental Study on Speech Enhancement Based on Deep Neural Networks. IEEE Signal Process. Lett. 21(1): 65-68 (2014)
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/chinasip/XuDDL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/chinasip/XuDDL14
Yong Xu, Jun Du, Li-Rong Dai, Chin-Hui Lee:
Global variance equalization for improving deep neural network based speech enhancement. ChinaSIP 2014: 71-75
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DuWGXDL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DuWGXDL14
Jun Du, Qing Wang, Tian Gao, Yong Xu, Li-Rong Dai, Chin-Hui Lee:
Robust speech recognition with speech enhanced deep neural networks. INTERSPEECH 2014: 616-620
[c5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuDDL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuDDL14
Yong Xu, Jun Du, Li-Rong Dai, Chin-Hui Lee:
Dynamic noise aware training for speech enhancement based on deep neural networks. INTERSPEECH 2014: 2670-2674
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/TuDXDL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/TuDXDL14
Yanhui Tu, Jun Du, Yong Xu, Li-Rong Dai, Chin-Hui Lee:
Speech separation based on improved deep neural networks with dual outputs of speech features for both target and interfering speakers. ISCSLP 2014: 250-254
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/XuDDL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/XuDDL14
Yong Xu, Jun Du, Li-Rong Dai, Chin-Hui Lee:
Cross-language transfer learning for deep neural network based speech enhancement. ISCSLP 2014: 336-340
2012
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/XuGSD12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/XuGSD12
Yong Xu, Wu Guo, Shan Su, Li-Rong Dai:
Spoken term detection for OOV terms based on triphone confusion matrix. ISCSLP 2012: 98-102
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/XuGD12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/XuGD12
Yong Xu, Wu Guo, Li-Rong Dai:
A hybrid fragment / syllable-based system for improved OOV term detection. ISCSLP 2012: 378-382

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.