default search action
Qiuqiang Kong
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j17]Yin Zhu, Qiuqiang Kong, Junjie Shi, Shilei Liu, Xuzhou Ye, Ju-Chiang Wang, Hongming Shan, Junping Zhang:
End-to-End Paired Ambisonic-Binaural Audio Rendering. IEEE CAA J. Autom. Sinica 11(2): 502-513 (2024) - [j16]Haohe Liu, Yi Yuan, Xubo Liu, Xinhao Mei, Qiuqiang Kong, Qiao Tian, Yuping Wang, Wenwu Wang, Yuxuan Wang, Mark D. Plumbley:
AudioLDM 2: Learning Holistic Audio Generation With Self-Supervised Pretraining. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2871-2883 (2024) - [j15]Zhichao Wang, Liumeng Xue, Qiuqiang Kong, Lei Xie, Yuanzhe Chen, Qiao Tian, Yuping Wang:
Multi-Level Temporal-Channel Speaker Retrieval for Zero-Shot Voice Conversion. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2926-2937 (2024) - [j14]Xinhao Mei, Chutong Meng, Haohe Liu, Qiuqiang Kong, Tom Ko, Chengqi Zhao, Mark D. Plumbley, Yuexian Zou, Wenwu Wang:
WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3339-3354 (2024) - [j13]Jinbo Hu, Yin Cao, Ming Wu, Qiuqiang Kong, Feiran Yang, Mark D. Plumbley, Jun Yang:
Selective-Memory Meta-Learning With Environment Representations for Sound Event Localization and Detection. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4313-4327 (2024) - [c53]Haohe Liu, Xubo Liu, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley:
Learning Temporal Resolution in Spectrogram for Audio Classification. AAAI 2024: 13873-13881 - [c52]Junqi Zhao, Xubo Liu, Jinzheng Zhao, Yi Yuan, Qiuqiang Kong, Mark D. Plumbley, Wenwu Wang:
Universal Sound Separation with Self-Supervised Audio Masked Autoencoder. EUSIPCO 2024: 1-5 - [c51]Wei Tsung Lu, Ju-Chiang Wang, Qiuqiang Kong, Yun-Ning Hung:
Music Source Separation With Band-Split Rope Transformer. ICASSP 2024: 481-485 - [c50]Dichucheng Li, Yinghao Ma, Weixing Wei, Qiuqiang Kong, Yulun Wu, Mingjin Che, Fan Xia, Emmanouil Benetos, Wei Li:
Mertech: Instrument Playing Technique Detection Using Self-Supervised Pretrained Model with Multi-Task Finetuning. ICASSP 2024: 521-525 - [c49]Xingjian Du, Zhesong Yu, Jiaju Lin, Bilei Zhu, Qiuqiang Kong:
Joint Music and Language Attention Models for Zero-Shot Music Tagging. ICASSP 2024: 1126-1130 - [c48]Zelin Ying, Chen Li, Yu Dong, Qiuqiang Kong, Qiao Tian, Yuanyuan Huo, Yuxuan Wang:
A Unified Front-End Framework for English Text-to-Speech Synthesis. ICASSP 2024: 10181-10185 - [i77]Yuheng Lin, Zheqi Dai, Qiuqiang Kong:
MusicScore: A Dataset for Music Score Modeling and Generation. CoRR abs/2406.11462 (2024) - [i76]Junqi Zhao, Xubo Liu, Jinzheng Zhao, Yi Yuan, Qiuqiang Kong, Mark D. Plumbley, Wenwu Wang:
Universal Sound Separation with Self-Supervised Audio Masked Autoencoder. CoRR abs/2407.11745 (2024) - [i75]Yinghao Ma, Anders Øland, Anton Ragni, Bleiz Macsen Del Sette, Charalampos Saitis, Chris Donahue, Chenghua Lin, Christos Plachouras, Emmanouil Benetos, Elio Quinton, Elona Shatri, Fabio Morreale, Ge Zhang, György Fazekas, Gus Xia, Huan Zhang, Ilaria Manco, Jiawen Huang, Julien Guinot, Liwei Lin, Luca Marinelli, Max W. Y. Lam, Megha Sharma, Qiuqiang Kong, Roger B. Dannenberg, Ruibin Yuan, Shangda Wu, Shih-Lun Wu, Shuqi Dai, Shun Lei, Shiyin Kang, Simon Dixon, Wenhu Chen, Wenhao Huang, Xingjian Du, Xingwei Qu, Xu Tan, Yizhi Li, Zeyue Tian, Zhiyong Wu, Zhizheng Wu, Ziyang Ma, Ziyu Wang:
Foundation Models for Music: A Survey. CoRR abs/2408.14340 (2024) - [i74]Zhen Ye, Peiwen Sun, Jiahe Lei, Hongzhan Lin, Xu Tan, Zheqi Dai, Qiuqiang Kong, Jianyi Chen, Jiahao Pan, Qifeng Liu, Yike Guo, Wei Xue:
Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model. CoRR abs/2408.17175 (2024) - [i73]Haonan Chen, Jordan B. L. Smith, Janne Spijkervet, Ju-Chiang Wang, Pei Zou, Bochen Li, Qiuqiang Kong, Xingjian Du:
SymPAC: Scalable Symbolic Music Generation With Prompts And Constraints. CoRR abs/2409.03055 (2024) - [i72]Hao Ma, Zhiyuan Peng, Xu Li, Yukai Li, Mingjie Shao, Qiuqiang Kong, Ju Liu:
Language-Queried Target Sound Extraction Without Parallel Training Data. CoRR abs/2409.09398 (2024) - [i71]Yudong Yang, Zhan Liu, Wenyi Yu, Guangzhi Sun, Qiuqiang Kong, Chao Zhang:
Extract and Diffuse: Latent Integration for Improved Diffusion-based Speech and Vocal Enhancement. CoRR abs/2409.09642 (2024) - 2023
- [j12]Jian Guan, Youde Liu, Qiuqiang Kong, Feiyang Xiao, Qiaoxi Zhu, Jiantong Tian, Wenwu Wang:
Transformer-based autoencoder with ID constraint for unsupervised anomalous sound detection. EURASIP J. Audio Speech Music. Process. 2023(1): 42 (2023) - [c47]Yuanzhe Chen, Ming Tu, Tang Li, Xin Li, Qiuqiang Kong, Jiaxin Li, Zhichao Wang, Qiao Tian, Yuping Wang, Yuxuan Wang:
Streaming Voice Conversion via Intermediate Bottleneck Features and Non-Streaming Teacher Guidance. ICASSP 2023: 1-5 - [c46]Xubo Liu, Haohe Liu, Qiuqiang Kong, Xinhao Mei, Mark D. Plumbley, Wenwu Wang:
Simple Pooling Front-Ends for Efficient Audio Classification. ICASSP 2023: 1-5 - [c45]Xubo Liu, Qiushi Huang, Xinhao Mei, Haohe Liu, Qiuqiang Kong, Jianyuan Sun, Shengchen Li, Tom Ko, Yu Zhang, H. Lilian Tang, Mark D. Plumbley, Volkan Kiliç, Wenwu Wang:
Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention. INTERSPEECH 2023: 2838-2842 - [c44]Haohe Liu, Qiuqiang Kong, Xubo Liu, Xinhao Mei, Wenwu Wang, Mark D. Plumbley:
Ontology-aware Learning and Evaluation for Audio Tagging. INTERSPEECH 2023: 3799-3803 - [i70]Kin Wai Cheuk, Keunwoo Choi, Qiuqiang Kong, Bochen Li, Minz Won, Ju-Chiang Wang, Yun-Ning Hung, Dorien Herremans:
Jointist: Simultaneous Improvement of Multi-instrument Transcription and Music Source Separation via Joint Training. CoRR abs/2302.00286 (2023) - [i69]Xinhao Mei, Chutong Meng, Haohe Liu, Qiuqiang Kong, Tom Ko, Chengqi Zhao, Mark D. Plumbley, Yuexian Zou, Wenwu Wang:
WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research. CoRR abs/2303.17395 (2023) - [i68]Zhichao Wang, Liumeng Xue, Qiuqiang Kong, Lei Xie, Yuanzhe Chen, Qiao Tian, Yuping Wang:
Multi-level Temporal-channel Speaker Retrieval for Robust Zero-shot Voice Conversion. CoRR abs/2305.07204 (2023) - [i67]Qiuqiang Kong, Ke Chen, Haohe Liu, Xingjian Du, Taylor Berg-Kirkpatrick, Shlomo Dubnov, Mark D. Plumbley:
Universal Source Separation with Weakly Labelled Data. CoRR abs/2305.07447 (2023) - [i66]Zelin Ying, Chen Li, Yu Dong, Qiuqiang Kong, Yuanyuan Huo, Yuping Wang, Yuxuan Wang:
a unified front-end framework for english text-to-speech synthesis. CoRR abs/2305.10666 (2023) - [i65]Xubo Liu, Zhongkai Zhu, Haohe Liu, Yi Yuan, Meng Cui, Qiushi Huang, Jinhua Liang, Yin Cao, Qiuqiang Kong, Mark D. Plumbley, Wenwu Wang:
WavJourney: Compositional Audio Creation with Large Language Models. CoRR abs/2307.14335 (2023) - [i64]Xubo Liu, Qiuqiang Kong, Yan Zhao, Haohe Liu, Yi Yuan, Yuzhuo Liu, Rui Xia, Yuxuan Wang, Mark D. Plumbley, Wenwu Wang:
Separate Anything You Describe. CoRR abs/2308.05037 (2023) - [i63]Haohe Liu, Qiao Tian, Yi Yuan, Xubo Liu, Xinhao Mei, Qiuqiang Kong, Yuping Wang, Wenwu Wang, Yuxuan Wang, Mark D. Plumbley:
AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining. CoRR abs/2308.05734 (2023) - [i62]Wei Tsung Lu, Ju-Chiang Wang, Qiuqiang Kong, Yun-Ning Hung:
Music Source Separation with Band-Split RoPE Transformer. CoRR abs/2309.02612 (2023) - [i61]Jian Guan, Youde Liu, Qiuqiang Kong, Feiyang Xiao, Qiaoxi Zhu, Jiantong Tian, Wenwu Wang:
Transformer-based Autoencoder with ID Constraint for Unsupervised Anomalous Sound Detection. CoRR abs/2310.08950 (2023) - [i60]Dichucheng Li, Yinghao Ma, Weixing Wei, Qiuqiang Kong, Yulun Wu, Mingjin Che, Fan Xia, Emmanouil Benetos, Wei Li:
MERTech: Instrument Playing Technique Detection Using Self-Supervised Pretrained Model With Multi-Task Finetuning. CoRR abs/2310.09853 (2023) - [i59]Xingjian Du, Zhesong Yu, Jiaju Lin, Bilei Zhu, Qiuqiang Kong:
Joint Music and Language Attention Models for Zero-shot Music Tagging. CoRR abs/2310.10159 (2023) - [i58]Jinbo Hu, Yin Cao, Ming Wu, Qiuqiang Kong, Feiran Yang, Mark D. Plumbley, Jun Yang:
Selective-Memory Meta-Learning with Environment Representations for Sound Event Localization and Detection. CoRR abs/2312.16422 (2023) - 2022
- [j11]Jingqiao Zhao, Qiuqiang Kong, Xiaoning Song, Zhen-Hua Feng, Xiaojun Wu:
Feature Alignment for Robust Acoustic Scene Classification Across Devices. IEEE Signal Process. Lett. 29: 578-582 (2022) - [j10]Qiuqiang Kong, Bochen Li, Jitong Chen, Yuxuan Wang:
GiantMIDI-Piano: A Large-Scale MIDI Dataset for Classical Piano Music. Trans. Int. Soc. Music. Inf. Retr. 5(1): 87-98 (2022) - [c43]Jinbo Hu, Yin Cao, Ming Wu, Qiuqiang Kong, Feiran Yang, Mark D. Plumbley, Jun Yang:
Sound Event Localization and Detection for Real Spatial Sound Scenes: Event-Independent Network and Data Augmentation Chains. DCASE 2022 - [c42]Haohe Liu, Xubo Liu, Xinhao Mei, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley:
Segment-Level Metric Learning for Few-Shot Bioacoustic Event Detection. DCASE 2022 - [c41]Jinbo Hu, Yin Cao, Ming Wu, Qiuqiang Kong, Feiran Yang, Mark D. Plumbley, Jun Yang:
A Track-Wise Ensemble Event Independent Network for Polyphonic Sound Event Localization and Detection. ICASSP 2022: 9196-9200 - [c40]Xubo Liu, Haohe Liu, Qiuqiang Kong, Xinhao Mei, Jinzheng Zhao, Qiushi Huang, Mark D. Plumbley, Wenwu Wang:
Separate What You Describe: Language-Queried Audio Source Separation. INTERSPEECH 2022: 1801-1805 - [c39]Haohe Liu, Woosung Choi, Xubo Liu, Qiuqiang Kong, Qiao Tian, DeLiang Wang:
Neural Vocoder is All You Need for Speech Super-resolution. INTERSPEECH 2022: 4227-4231 - [c38]Haohe Liu, Xubo Liu, Qiuqiang Kong, Qiao Tian, Yan Zhao, DeLiang Wang, Chuanzeng Huang, Yuxuan Wang:
VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration. INTERSPEECH 2022: 4232-4236 - [c37]Lele Liu, Qiuqiang Kong, Veronica Morfi, Emmanouil Benetos:
Performance MIDI-to-score conversion by neural beat tracking. ISMIR 2022: 395-402 - [i57]Jinbo Hu, Yin Cao, Ming Wu, Qiuqiang Kong, Feiran Yang, Mark D. Plumbley, Jun Yang:
A Track-Wise Ensemble Event Independent Network for Polyphonic Sound Event Localization and Detection. CoRR abs/2203.10228 (2022) - [i56]Haohe Liu, Woosung Choi, Xubo Liu, Qiuqiang Kong, Qiao Tian, DeLiang Wang:
Neural Vocoder is All You Need for Speech Super-resolution. CoRR abs/2203.14941 (2022) - [i55]Xubo Liu, Haohe Liu, Qiuqiang Kong, Xinhao Mei, Jinzheng Zhao, Qiushi Huang, Mark D. Plumbley, Wenwu Wang:
Separate What You Describe: Language-Queried Audio Source Separation. CoRR abs/2203.15147 (2022) - [i54]Haohe Liu, Xubo Liu, Qiuqiang Kong, Qiao Tian, Yan Zhao, DeLiang Wang, Chuanzeng Huang, Yuxuan Wang:
VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration. CoRR abs/2204.05841 (2022) - [i53]Kin Wai Cheuk, Keunwoo Choi, Qiuqiang Kong, Bochen Li, Minz Won, Amy Hung, Ju-Chiang Wang, Dorien Herremans:
Jointist: Joint Learning for Multi-instrument Transcription and Its Applications. CoRR abs/2206.10805 (2022) - [i52]Haohe Liu, Xubo Liu, Xinhao Mei, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley:
Segment-level Metric Learning for Few-shot Bioacoustic Event Detection. CoRR abs/2207.07773 (2022) - [i51]Haohe Liu, Xubo Liu, Xinhao Mei, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley:
Surrey System for DCASE 2022 Task 5: Few-shot Bioacoustic Event Detection with Segment-level Metric Learning. CoRR abs/2207.10547 (2022) - [i50]Jinbo Hu, Yin Cao, Ming Wu, Qiuqiang Kong, Feiran Yang, Mark D. Plumbley, Jun Yang:
Sound Event Localization and Detection for Real Spatial Sound Scenes: Event-Independent Network and Data Augmentation Chains. CoRR abs/2209.01802 (2022) - [i49]Xubo Liu, Haohe Liu, Qiuqiang Kong, Xinhao Mei, Mark D. Plumbley, Wenwu Wang:
Simple Pooling Front-ends For Efficient Audio Classification. CoRR abs/2210.00943 (2022) - [i48]Haohe Liu, Xubo Liu, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley:
Learning the Spectrogram Temporal Resolution for Audio Classification. CoRR abs/2210.01719 (2022) - [i47]Qiuqiang Kong, Shilei Liu, Junjie Shi, Xuzhou Ye, Yin Cao, Qiaoxi Zhu, Yong Xu, Yuxuan Wang:
Neural Sound Field Decomposition with Super-resolution of Sound Direction. CoRR abs/2210.12345 (2022) - [i46]Yuanzhe Chen, Ming Tu, Tang Li, Xin Li, Qiuqiang Kong, Jiaxin Li, Zhichao Wang, Qiao Tian, Yuping Wang, Yuxuan Wang:
Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance. CoRR abs/2210.15158 (2022) - [i45]Xubo Liu, Qiushi Huang, Xinhao Mei, Haohe Liu, Qiuqiang Kong, Jianyuan Sun, Shengchen Li, Tom Ko, Yu Zhang, H. Lilian Tang, Mark D. Plumbley, Volkan Kiliç, Wenwu Wang:
Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention. CoRR abs/2210.16428 (2022) - [i44]Yin Zhu, Qiuqiang Kong, Junjie Shi, Shilei Liu, Xuzhou Ye, Ju-Chiang Wang, Junping Zhang:
Binaural Rendering of Ambisonic Signals by Neural Networks. CoRR abs/2211.02301 (2022) - [i43]Haohe Liu, Qiuqiang Kong, Xubo Liu, Xinhao Mei, Wenwu Wang, Mark D. Plumbley:
Ontology-aware Learning and Evaluation for Audio Tagging. CoRR abs/2211.12195 (2022) - 2021
- [j9]Qiuqiang Kong, Bochen Li, Xuchen Song, Yuan Wan, Yuxuan Wang:
High-Resolution Piano Transcription With Pedals by Regressing Onset and Offset Times. IEEE ACM Trans. Audio Speech Lang. Process. 29: 3707-3717 (2021) - [j8]Jie Jiang, Qiuqiang Kong, Mark D. Plumbley, Nigel Gilbert, Mark Hoogendoorn, Diederik M. Roijers:
Deep Learning-Based Energy Disaggregation and On/Off Detection of Household Appliances. ACM Trans. Knowl. Discov. Data 15(3): 50:1-50:21 (2021) - [j7]Zhao Ren, Qiuqiang Kong, Jing Han, Mark D. Plumbley, Björn W. Schuller:
CAA-Net: Conditional Atrous CNNs With Attention for Explainable Device-Robust Acoustic Scene Classification. IEEE Trans. Multim. 23: 4131-4142 (2021) - [c36]Lam Pham, Chris Baume, Qiuqiang Kong, Tassadaq Hussain, Wenwu Wang, Mark D. Plumbley:
An Audio-Based Deep Learning Framework For BBC Television Programme Classification. EUSIPCO 2021: 56-60 - [c35]Xingjian Du, Bilei Zhu, Qiuqiang Kong, Zejun Ma:
Singing Melody Extraction from Polyphonic Music based on Spectral Correlation Modeling. ICASSP 2021: 241-245 - [c34]Yin Cao, Turab Iqbal, Qiuqiang Kong, Fengyan An, Wenwu Wang, Mark D. Plumbley:
An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection. ICASSP 2021: 885-889 - [c33]Qiuqiang Kong, Haohe Liu, Xingjian Du, Li Chen, Rui Xia, Yuxuan Wang:
Speech Enhancement with Weakly Labelled Data from AudioSet. Interspeech 2021: 191-195 - [c32]Qiuqiang Kong, Yin Cao, Haohe Liu, Keunwoo Choi, Yuxuan Wang:
Decoupling Magnitude and Phase Estimation with Deep ResUNet for Music Source Separation. ISMIR 2021: 342-349 - [c31]Liwei Lin, Gus Xia, Qiuqiang Kong, Junyan Jiang:
A unified model for zero-shot music source separation, transcription and synthesis. ISMIR 2021: 381-388 - [i42]Xuchen Song, Qiuqiang Kong, Xingjian Du, Yuxuan Wang:
CatNet: music source separation system with mix-audio augmentation. CoRR abs/2102.09966 (2021) - [i41]Qiuqiang Kong, Haohe Liu, Xingjian Du, Li Chen, Rui Xia, Yuxuan Wang:
Speech enhancement with weakly labelled data from AudioSet. CoRR abs/2102.09971 (2021) - [i40]Feiyang Xiao, Jian Guan, Qiuqiang Kong, Wenwu Wang:
Time-domain Speech Enhancement with Generative Adversarial Learning. CoRR abs/2103.16149 (2021) - [i39]Lam Pham, Chris Baume, Qiuqiang Kong, Tassadaq Hussain, Wenwu Wang, Mark D. Plumbley:
An Audio-Based Deep Learning Framework ForBBC Television Programme Classification. CoRR abs/2104.01161 (2021) - [i38]Liwei Lin, Qiuqiang Kong, Junyan Jiang, Gus Xia:
A Unified Model for Zero-shot Music Source Separation, Transcription and Synthesis. CoRR abs/2108.03456 (2021) - [i37]Qiuqiang Kong, Yin Cao, Haohe Liu, Keunwoo Choi, Yuxuan Wang:
Decoupling Magnitude and Phase Estimation with Deep ResUNet for Music Source Separation. CoRR abs/2109.05418 (2021) - [i36]Haohe Liu, Qiuqiang Kong, Qiao Tian, Yan Zhao, DeLiang Wang, Chuanzeng Huang, Yuxuan Wang:
VoiceFixer: Toward General Speech Restoration With Neural Vocoder. CoRR abs/2109.13731 (2021) - [i35]Haohe Liu, Qiuqiang Kong, Jiafeng Liu:
CWS-PResUNet: Music Source Separation with Channel-wise Subband Phase-aware ResUNet. CoRR abs/2112.04685 (2021) - 2020
- [j6]Boqing Zhu, Kele Xu, Qiuqiang Kong, Huaimin Wang, Yuxing Peng:
Audio Tagging by Cross Filtering Noisy Labels. IEEE ACM Trans. Audio Speech Lang. Process. 28: 2073-2083 (2020) - [j5]Qiuqiang Kong, Yong Xu, Wenwu Wang, Mark D. Plumbley:
Sound Event Detection of Weakly Labelled Data With CNN-Transformer and Automatic Threshold Optimization. IEEE ACM Trans. Audio Speech Lang. Process. 28: 2450-2460 (2020) - [j4]Qiuqiang Kong, Yin Cao, Turab Iqbal, Yuxuan Wang, Wenwu Wang, Mark D. Plumbley:
PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 28: 2880-2894 (2020) - [c30]Yin Cao, Turab Iqbal, Qiuqiang Kong, Yue Zhong, Wenwu Wang, Mark D. Plumbley:
Event-Independent Network for Polyphonic Sound Event Localization and Detection. DCASE 2020: 11-15 - [c29]Tomoya Koike, Kun Qian, Qiuqiang Kong, Mark D. Plumbley, Björn W. Schuller, Yoshiharu Yamamoto:
Audio for Audio is Better? An Investigation on Transfer Learning Models for Heart Sound Classification. EMBC 2020: 74-77 - [c28]Qiuqiang Kong, Yuxuan Wang, Xuchen Song, Yin Cao, Wenwu Wang, Mark D. Plumbley:
Source Separation with Weakly Labelled Data: an Approach to Computational Auditory Scene Analysis. ICASSP 2020: 101-105 - [c27]Turab Iqbal, Yin Cao, Qiuqiang Kong, Mark D. Plumbley, Wenwu Wang:
Learning With Out-of-Distribution Data for Audio Classification. ICASSP 2020: 636-640 - [i34]Qiuqiang Kong, Yuxuan Wang, Xuchen Song, Yin Cao, Wenwu Wang, Mark D. Plumbley:
Source separation with weakly labelled data: An approach to computational auditory scene analysis. CoRR abs/2002.02065 (2020) - [i33]Turab Iqbal, Yin Cao, Qiuqiang Kong, Mark D. Plumbley, Wenwu Wang:
Learning with Out-of-Distribution Data for Audio Classification. CoRR abs/2002.04683 (2020) - [i32]Boqing Zhu, Kele Xu, Qiuqiang Kong, Huaimin Wang, Yuxing Peng:
Audio Tagging by Cross Filtering Noisy Labels. CoRR abs/2007.08165 (2020) - [i31]Jingqiao Zhao, Zhen-Hua Feng, Qiuqiang Kong, Xiaoning Song, Xiao-Jun Wu:
DD-CNN: Depthwise Disout Convolutional Neural Network for Low-complexity Acoustic Scene Classification. CoRR abs/2007.12864 (2020) - [i30]Yin Cao, Turab Iqbal, Qiuqiang Kong, Yue Zhong, Wenwu Wang, Mark D. Plumbley:
Event-Independent Network for Polyphonic Sound Event Localization and Detection. CoRR abs/2010.00140 (2020) - [i29]Qiuqiang Kong, Bochen Li, Xuchen Song, Yuan Wan, Yuxuan Wang:
High-resolution Piano Transcription with Pedals by Regressing Onsets and Offsets Times. CoRR abs/2010.01815 (2020) - [i28]Qiuqiang Kong, Bochen Li, Jitong Chen, Yuxuan Wang:
GiantMIDI-Piano: A large-scale MIDI dataset for classical piano music. CoRR abs/2010.07061 (2020) - [i27]Yin Cao, Turab Iqbal, Qiuqiang Kong, Fengyan An, Wenwu Wang, Mark D. Plumbley:
An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection. CoRR abs/2010.13092 (2020) - [i26]Qiuqiang Kong, Keunwoo Choi, Yuxuan Wang:
Large-Scale MIDI-based Composer Classification. CoRR abs/2010.14805 (2020) - [i25]Zhao Ren, Qiuqiang Kong, Jing Han, Mark D. Plumbley, Björn W. Schuller:
CAA-Net: Conditional Atrous CNNs with Attention for Explainable Device-robust Acoustic Scene Classification. CoRR abs/2011.09299 (2020)
2010 – 2019
- 2019
- [j3]Yingwei Fu, Kele Xu, Haibo Mi, Qiuqiang Kong, Dezhi Wang, Huaimin Wang, Tie Hong:
Multi Model-Based Distillation for Sound Event Detection. IEICE Trans. Inf. Syst. 102-D(10): 2055-2058 (2019) - [j2]Qiuqiang Kong, Yong Xu, Iwona Sobieraj, Wenwu Wang, Mark D. Plumbley:
Sound Event Detection and Time-Frequency Segmentation from Weakly Labelled Data. IEEE ACM Trans. Audio Speech Lang. Process. 27(4): 777-787 (2019) - [j1]Qiuqiang Kong, Changsong Yu, Yong Xu, Turab Iqbal, Wenwu Wang, Mark D. Plumbley:
Weakly Labelled AudioSet Tagging With Attention Neural Networks. IEEE ACM Trans. Audio Speech Lang. Process. 27(11): 1791-1802 (2019) - [c26]Yin Cao, Qiuqiang Kong, Turab Iqbal, Fengyan An, Wenwu Wang, Mark D. Plumbley:
Polyphonic Sound Event Detection and Localization using a Two-Stage Strategy. DCASE 2019: 30-34 - [c25]Zhao Ren, Jing Han, Nicholas Cummins, Qiuqiang Kong, Mark D. Plumbley, Björn W. Schuller:
Multi-instance Learning for Bipolar Disorder Diagnosis using Weakly Labelled Speech Data. PDH 2019: 79-83 - [c24]Yuanbo Hou, Qiuqiang Kong, Shengchen Li, Mark D. Plumbley:
Sound Event Detection with Sequentially Labelled Data Based on Connectionist Temporal Classification and Unsupervised Clustering. ICASSP 2019: 46-50 - [c23]Zhao Ren, Qiuqiang Kong, Jing Han, Mark D. Plumbley, Björn W. Schuller:
Attention-based Atrous Convolutional Neural Networks: Visualisation and Understanding Perspectives of Acoustic Scenes. ICASSP 2019: 56-60 - [c22]Qiuqiang Kong, Yong Xu, Turab Iqbal, Yin Cao, Wenwu Wang, Mark D. Plumbley:
Acoustic Scene Generation with Conditional Samplernn. ICASSP 2019: 925-929 - [c21]Cemre Zor, Muhammad Awais, Josef Kittler, Miroslaw Bober, Sameed Husain, Qiuqiang Kong, Christian Kroos:
Divergence Based Weighting for Information Channels in Deep Convolutional Neural Networks for Bird Audio Detection. ICASSP 2019: 3052-3056 - [c20]Qiuqiang Kong, Yong Xu, Philip J. B. Jackson, Wenwu Wang, Mark D. Plumbley:
Single-Channel Signal Separation and Deconvolution with Generative Adversarial Networks. IJCAI 2019: 2747-2753 - [i24]Qiuqiang Kong, Changsong Yu, Turab Iqbal, Yong Xu, Wenwu Wang, Mark D. Plumbley:
Weakly labelled AudioSet Classification with Attention Neural Networks. CoRR abs/1903.00765 (2019) - [i23]Qiuqiang Kong, Yin Cao, Turab Iqbal, Yong Xu, Wenwu Wang, Mark D. Plumbley:
Cross-task learning for audio tagging, sound event detection and spatial localization: DCASE 2019 baseline systems. CoRR abs/1904.03476 (2019) - [i22]Yuanbo Hou, Qiuqiang Kong, Shengchen Li, Mark D. Plumbley:
Sound Event Detection with Sequentially Labelled Data Based on Connectionist Temporal Classification and Unsupervised Clustering. CoRR abs/1904.12102 (2019) - [i21]Yin Cao, Qiuqiang Kong, Turab Iqbal, Fengyan An, Wenwu Wang, Mark D. Plumbley:
Polyphonic Sound Event Detection and Localization using a Two-Stage Strategy. CoRR abs/1905.00268 (2019) - [i20]Qiuqiang Kong, Yong Xu, Wenwu Wang, Philip J. B. Jackson, Mark D. Plumbley:
Single-Channel Signal Separation and Deconvolution with Generative Adversarial Networks. CoRR abs/1906.07552 (2019) - [i19]Jie Jiang, Qiuqiang Kong, Mark D. Plumbley, Nigel Gilbert:
Deep Learning Based Energy Disaggregation and On/Off Detection of Household Appliances. CoRR abs/1908.00941 (2019) - [i18]Qiuqiang Kong, Yong Xu, Wenwu Wang, Mark D. Plumbley:
Sound Event Detection of Weakly Labelled Data with CNN-Transformer and Automatic Threshold Optimization. CoRR abs/1912.04761 (2019) - [i17]Qiuqiang Kong, Yin Cao, Turab Iqbal, Yuxuan Wang, Wenwu Wang, Mark D. Plumbley:
PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition. CoRR abs/1912.10211 (2019) - 2018
- [c19]Yuanbo Hou, Qiuqiang Kong, Shengchen Li:
Audio Tagging With Connectionist Temporal Classification Model Using Sequentially Labelled Data. CSPS (2) 2018: 955-964 - [c18]Zhao Ren, Qiuqiang Kong, Kun Qian, Mark D. Plumbley, Björn W. Schuller:
Attention-based convolutional neural networks for acoustic scene classification. DCASE 2018: 39-43 - [c17]Yuanbo Hou, Qiuqiang Kong, Jun Wang, Shengchen Li:
Polyphonic audio tagging with sequentially labelled data using CRNN with learnable gated linear units. DCASE 2018: 78-82 - [c16]Shengyun Wei, Kele Xu, Dezhi Wang, Feifan Liao, Huaimin Wang, Qiuqiang Kong:
Sample mixed-based data augmentation for domestic audio tagging. DCASE 2018: 93-97 - [c15]Changsong Yu, Karim Said Barsim, Qiuqiang Kong, Bin Yang:
Multi-level attention model for weakly supervised audio classification. DCASE 2018: 188-192 - [c14]Turab Iqbal, Qiuqiang Kong, Mark D. Plumbley, Wenwu Wang:
General-purpose audio tagging from noisy labels using convolutional neural networks. DCASE 2018: 212-216 - [c13]Qiuqiang Kong, Turab Iqbal, Yong Xu, Wenwu Wang, Mark D. Plumbley:
DCASE 2018 Challenge Surrey cross-task convolutional neural network baseline. DCASE 2018: 217-221 - [c12]Turab Iqbal, Yong Xu, Qiuqiang Kong, Wenwu Wang:
Capsule Routing for Sound Event Detection. EUSIPCO 2018: 2255-2259 - [c11]Alfredo Zermini, Qiuqiang Kong, Yong Xu, Mark D. Plumbley, Wenwu Wang:
Improving Reverberant Speech Separation with Binaural Cues Using Temporal Context and Convolutional Neural Networks. LVA/ICA 2018: 361-371 - [c10]Yong Xu, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley:
Large-Scale Weakly Supervised Audio Classification Using Gated Convolutional Neural Network. ICASSP 2018: 121-125 - [c9]Qiuqiang Kong, Yong Xu, Wenwu Wang, Mark D. Plumbley:
Audio Set Classification with Attention Model: A Probabilistic Perspective. ICASSP 2018: 316-320 - [c8]Qiuqiang Kong, Yong Xu, Wenwu Wang, Mark D. Plumbley:
A Joint Separation-Classification Model for Sound Event Detection of Weakly Labelled Data. ICASSP 2018: 321-325 - [c7]Jie Jiang, Mark Hoogendoorn, Qiuqiang Kong, Diederik M. Roijers, Nigel Gilbert:
Predicting Appliance Usage Status In Home Like Environments. DSP 2018: 1-5 - [i16]Changsong Yu, Karim Said Barsim, Qiuqiang Kong, Bin Yang:
Multi-level Attention Model for Weakly Supervised Audio Classification. CoRR abs/1803.02353 (2018) - [i15]Qiuqiang Kong, Yong Xu, Iwona Sobieraj, Wenwu Wang, Mark D. Plumbley:
Sound Event Detection and Time-Frequency Segmentation from Weakly Labelled Data. CoRR abs/1804.04715 (2018) - [i14]Turab Iqbal, Yong Xu, Qiuqiang Kong, Wenwu Wang:
Capsule Routing for Sound Event Detection. CoRR abs/1806.04699 (2018) - [i13]Qiuqiang Kong, Turab Iqbal, Yong Xu, Wenwu Wang, Mark D. Plumbley:
DCASE 2018 Challenge baseline with convolutional neural networks. CoRR abs/1808.00773 (2018) - [i12]Yuanbo Hou, Qiuqiang Kong, Shengchen Li:
Audio Tagging With Connectionist Temporal Classification Model Using Sequential Labelled Data. CoRR abs/1808.01935 (2018) - [i11]Shengyun Wei, Kele Xu, Dezhi Wang, Feifan Liao, Huaimin Wang, Qiuqiang Kong:
Sample Mixed-Based Data Augmentation for Domestic Audio Tagging. CoRR abs/1808.03883 (2018) - [i10]Kele Xu, Boqing Zhu, Qiuqiang Kong, Haibo Mi, Bo Ding, Dezhi Wang, Huaimin Wang:
General audio tagging with ensembling convolutional neural network and statistical features. CoRR abs/1810.12832 (2018) - [i9]Dezhi Wang, Lilun Zhang, Changchun Bao, Kele Xu, Boqing Zhu, Qiuqiang Kong:
Weakly supervised CRNN system for sound event detection with large-scale unlabeled in-domain data. CoRR abs/1811.00301 (2018) - [i8]Yuanbo Hou, Qiuqiang Kong, Jun Wang, Shengchen Li:
Polyphonic audio tagging with sequentially labelled data using CRNN with learnable gated linear units. CoRR abs/1811.07072 (2018) - 2017
- [c6]Qiuqiang Kong, Yong Xu, Mark D. Plumbley:
Joint detection and classification convolutional neural network on weakly labelled bird audio detection. EUSIPCO 2017: 1749-1753 - [c5]Iwona Sobieraj, Qiuqiang Kong, Mark D. Plumbley:
Masked non-negative matrix factorization for eire detection using weakly labeled data. EUSIPCO 2017: 1769-1773 - [c4]Qiuqiang Kong, Yong Xu, Wenwu Wang, Mark D. Plumbley:
A joint detection-classification model for audio tagging of weakly labelled data. ICASSP 2017: 641-645 - [c3]Yong Xu, Qiuqiang Kong, Qiang Huang, Wenwu Wang, Mark D. Plumbley:
Convolutional gated recurrent neural network incorporating spatial features for audio tagging. IJCNN 2017: 3461-3466 - [c2]Yong Xu, Qiuqiang Kong, Qiang Huang, Wenwu Wang, Mark D. Plumbley:
Attention and Localization Based on a Deep Convolutional Recurrent Model for Weakly Supervised Audio Tagging. INTERSPEECH 2017: 3083-3087 - [i7]Yong Xu, Qiuqiang Kong, Qiang Huang, Wenwu Wang, Mark D. Plumbley:
Convolutional Gated Recurrent Neural Network Incorporating Spatial Features for Audio Tagging. CoRR abs/1702.07787 (2017) - [i6]Yong Xu, Qiuqiang Kong, Qiang Huang, Wenwu Wang, Mark D. Plumbley:
Attention and Localization based on a Deep Convolutional Recurrent Model for Weakly Supervised Audio Tagging. CoRR abs/1703.06052 (2017) - [i5]Yong Xu, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley:
Surrey-cvssp system for DCASE2017 challenge task4. CoRR abs/1709.00551 (2017) - [i4]Yong Xu, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley:
Large-scale weakly supervised audio classification using gated convolutional neural network. CoRR abs/1710.00343 (2017) - [i3]Qiuqiang Kong, Yong Xu, Wenwu Wang, Mark D. Plumbley:
Audio Set classification with attention model: A probabilistic perspective. CoRR abs/1711.00927 (2017) - [i2]Qiuqiang Kong, Yong Xu, Wenwu Wang, Mark D. Plumbley:
A joint separation-classification model for sound event detection of weakly labelled data. CoRR abs/1711.03037 (2017) - 2016
- [c1]Qiuqiang Kong, Iwona Sobieraj, Wenwu Wang, Mark D. Plumbley:
Deep Neural Network Baseline for DCASE Challenge 2016. DCASE 2016: 50-54 - [i1]Qiuqiang Kong, Yong Xu, Wenwu Wang, Mark D. Plumbley:
A Joint Detection-Classification Model for Audio Tagging of Weakly Labelled Data. CoRR abs/1610.01797 (2016)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-14 21:01 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint