default search action
Yanhua Long
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j19]Shilin Wang, Haixin Guan, Shuang Wei, Yanhua Long:
Improving low-complexity and real-time DeepFilterNet2 for personalized speech enhancement. Int. J. Speech Technol. 27(2): 299-306 (2024) - [c32]Li Li, Yijie Li, Dongxing Xu, Haoran Wei, Yanhua Long:
Accent-Specific Vector Quantization for Joint Unsupervised and Supervised Training in Accent Robust Speech Recognition. ICASSP 2024: 10201-10205 - [c31]Renchang Dong, Yijie Li, Dongxing Xu, Yanhua Long:
Cross-Modal Parallel Training for Improving end-to-end Accented Speech Recognition. ICASSP 2024: 10396-10400 - [c30]Yu Zheng, Yajun Zhang, Chuanying Niu, Yibin Zhan, Yanhua Long, Dongxing Xu:
Score Calibration Based on Consistency Measure Factor for Speaker Verification. ICASSP 2024: 12371-12375 - [i13]Qingyu Liu, Longfei Song, Dongxing Xu, Yanhua Long:
ICSD: An Open-source Dataset for Infant Cry and Snoring Detection. CoRR abs/2408.10561 (2024) - 2023
- [j18]Jiangyu Han, Yanhua Long:
Heterogeneous separation consistency training for adaptation of unsupervised speech separation. EURASIP J. Audio Speech Music. Process. 2023(1): 6 (2023) - [j17]Yibo Duan, Yanhua Long, Yijie Li:
CI-Mix: cut instance mix for robust speaker verification. Int. J. Speech Technol. 26(4): 851-857 (2023) - [j16]Li Li, Yanhua Long, Dongxing Xu, Yijie Li:
Boosting Character-based Mandarin ASR via Chinese Pinyin Representation. Int. J. Speech Technol. 26(4): 895-902 (2023) - [j15]Yifan Zhou, Yanhua Long, Haoran Wei:
Acoustic-Sensing-Based Attribute-Driven Imbalanced Compensation for Anomalous Sound Detection without Machine Identity. Sensors 23(21): 8984 (2023) - [j14]Yibo Duan, Yanhua Long, Jiaen Liang:
Dual-model self-regularization and fusion for domain adaptation of robust speaker verification. Speech Commun. 155: 103001 (2023) - [c29]Xiaoxiao Wu, Dongxing Xu, Haoran Wei, Yanhua Long:
FEW-Shot Continual Learning with Weight Alignment and Positive Enhancement for Bioacoustic Event Detection. ICASSP 2023: 1-5 - [c28]Li Li, Dongxing Xu, Haoran Wei, Yanhua Long:
Phonetic-assisted Multi-Target Units Modeling for Improving Conformer-Transducer ASR system. INTERSPEECH 2023: 2263-2267 - [c27]Jing Li, Yanhua Long, Yijie Li, Dongxing Xu:
Advanced RawNet2 with Attention-based Channel Masking for Synthetic Speech Detection. INTERSPEECH 2023: 2788-2792 - [c26]Xuefei Wang, Yanhua Long, Yijie Li, Haoran Wei:
Multi-pass Training and Cross-information Fusion for Low-resource End-to-end Accented Speech Recognition. INTERSPEECH 2023: 2923-2927 - [i12]Xuefei Wang, Yanhua Long, Yijie Li, Haoran Wei:
Multi-pass Training and Cross-information Fusion for Low-resource End-to-end Accented Speech Recognition. CoRR abs/2306.11309 (2023) - [i11]Yu Zheng, Yajun Zhang, Chuanying Niu, Yibin Zhan, Yanhua Long, Dongxing Xu:
UNISOUND System for VoxCeleb Speaker Recognition Challenge 2023. CoRR abs/2308.12526 (2023) - [i10]Yifan Zhou, Dongxing Xu, Haoran Wei, Yanhua Long:
Autoencoder with Group-based Decoder and Multi-task Optimization for Anomalous Sound Detection. CoRR abs/2311.08829 (2023) - 2022
- [j13]Yunhao Liang, Yanhua Long, Yijie Li, Jiaen Liang, Yuping Wang:
Joint framework with deep feature distillation and adaptive focal loss for weakly supervised audio tagging and acoustic event detection. Digit. Signal Process. 123: 103446 (2022) - [j12]Tiantian Tang, Yanhua Long, Yijie Li, Jiaen Liang:
Acoustic domain mismatch compensation in bird audio detection. Int. J. Speech Technol. 25(1): 251-260 (2022) - [j11]Jiangyu Han, Yan Shi, Yanhua Long, Jiaen Liang:
Exploring single channel speech separation for short-time text-dependent speaker verification. Int. J. Speech Technol. 25(1): 261-268 (2022) - [j10]Xuefei Wang, Yanhua Long, Dongxing Xu:
Universal and accent-discriminative encoders for conformer-based accent-invariant speech recognition. Int. J. Speech Technol. 25(4): 987-995 (2022) - [j9]Linqiang Wei, Yanhua Long, Haoran Wei, Yijie Li:
New Acoustic Features for Synthetic and Replay Spoofing Attack Detection. Symmetry 14(2): 274 (2022) - [c25]Jiangyu Han, Yanhua Long, Lukás Burget, Jan Cernocký:
DPCCN: Densely-Connected Pyramid Complex Convolutional Network for Robust Speech Separation and Extraction. ICASSP 2022: 7292-7296 - [c24]Xiaofeng Ge, Jiangyu Han, Yanhua Long, Haixin Guan:
PercepNet+: A Phase and SNR Aware PercepNet for Real-Time Speech Enhancement. INTERSPEECH 2022: 916-920 - [c23]Yunhao Liang, Yanhua Long, Yijie Li, Jiaen Liang:
Selective Pseudo-labeling and Class-wise Discriminative Fusion for Sound Event Detection. INTERSPEECH 2022: 1496-1500 - [i9]Yunhao Liang, Yanhua Long, Yijie Li, Jiaen Liang:
Selective Pseudo-labeling and Class-wise Discriminative Fusion for Sound Event Detection. CoRR abs/2203.02191 (2022) - [i8]Xiaofeng Ge, Jiangyu Han, Yanhua Long, Haixin Guan:
PercepNet+: A Phase and SNR Aware PercepNet for Real-Time Speech Enhancement. CoRR abs/2203.02263 (2022) - [i7]Jiangyu Han, Yanhua Long:
Heterogeneous Separation Consistency Training for Adaptation of Unsupervised Speech Separation. CoRR abs/2204.11032 (2022) - [i6]Li Li, Dongxing Xu, Haoran Wei, Yanhua Long:
Phonetic-assisted Multi-Target Units Modeling for Improving Conformer-Transducer ASR system. CoRR abs/2211.01571 (2022) - 2021
- [j8]Yanhua Long, Shuang Wei, Jie Lian, Yijie Li:
Pronunciation augmentation for Mandarin-English code-switching speech recognition. EURASIP J. Audio Speech Music. Process. 2021(1): 34 (2021) - [c22]Tiantian Tang, Xinyuan Zhou, Yanhua Long, Yijie Li, Jiaen Liang:
CNN-based Discriminative Training for Domain Compensation in Acoustic Event Detection with Frame-wise Classifier. APSIPA ASC 2021: 939-944 - [c21]Jiangyu Han, Wei Rao, Yanhua Long, Jiaen Liang:
Attention-Based Scaling Adaptation for Target Speech Extraction. ASRU 2021: 658-662 - [c20]Jiangyu Han, Xinyuan Zhou, Yanhua Long, Yijie Li:
Multi-Channel Target Speech Extraction with Channel Decorrelation and Target Speaker Adaptation. ICASSP 2021: 6094-6098 - [c19]Jiangyu Han, Wei Rao, Yannan Wang, Yanhua Long:
Improving Channel Decorrelation for Multi-Channel Target Speech Extraction. Interspeech 2021: 1847-1851 - [i5]Yunhao Liang, Yanhua Long, Yijie Li, Jiaen Liang:
Joint Weakly Supervised AT and AED Using Deep Feature Distillation and Adaptive Focal Loss. CoRR abs/2103.12388 (2021) - [i4]Tiantian Tang, Xinyuan Zhou, Yanhua Long, Yijie Li, Jiaen Liang:
CNN-based Discriminative Training for Domain Compensation in Acoustic Event Detection with Frame-wise Classifier. CoRR abs/2103.14297 (2021) - 2020
- [j7]Renke He, Yanhua Long, Yijie Li, Jiaen Liang:
Mask-based blind source separation and MVDR beamforming in ASR. Int. J. Speech Technol. 23(1): 133-140 (2020) - [c18]Laipeng He, Qiang Shi, Lang Wu, Jianqing Sun, Renke He, Yanhua Long, Jiaen Liang:
The SHNU System for Blizzard Challenge 2020. Blizzard Challenge / Voice Conversion Challenge 2020 - [c17]Xinyuan Zhou, Emre Yilmaz, Yanhua Long, Yijie Li, Haizhou Li:
Multi-Encoder-Decoder Transformer for Code-Switching Speech Recognition. INTERSPEECH 2020: 1042-1046 - [c16]Xinyuan Zhou, Grandee Lee, Emre Yilmaz, Yanhua Long, Jiaen Liang, Haizhou Li:
Self-and-Mixed Attention Decoder with Deep Acoustic Structure for Transformer-Based LVCSR. INTERSPEECH 2020: 5016-5020 - [i3]Xinyuan Zhou, Grandee Lee, Emre Yilmaz, Yanhua Long, Jiaen Liang, Haizhou Li:
Self-and-Mixed Attention Decoder with Deep Acoustic Structure for Transformer-based LVCSR. CoRR abs/2006.10407 (2020) - [i2]Xinyuan Zhou, Emre Yilmaz, Yanhua Long, Yijie Li, Haizhou Li:
Multi-Encoder-Decoder Transformer for Code-Switching Speech Recognition. CoRR abs/2006.10414 (2020) - [i1]Jiangyu Han, Yanhua Long, Jiaen Liang:
Attention-based scaling adaptation for target speech extraction. CoRR abs/2010.10923 (2020)
2010 – 2019
- 2019
- [j6]Yanhua Long, Yijie Li, Shuang Wei, Qiaozheng Zhang, Chunxia Yang:
Large-Scale Semi-Supervised Training in Deep Learning Acoustic Model for ASR. IEEE Access 7: 133615-133627 (2019) - [c15]Zhimin Feng, Qiqi Tong, Yanhua Long, Shuang Wei, Chunxia Yang, Qiaozheng Zhang:
SHNU Anti-spoofing Systems for ASVspoof 2019 Challenge. APSIPA 2019: 548-552 - 2018
- [j5]Yanhua Long, Yijie Li, Bo Zhang:
Offline to online speaker adaptation for real-time deep neural network based LVCSR systems. Multim. Tools Appl. 77(21): 28101-28119 (2018) - [c14]Yanhua Long, Hong Ye, Yijie Li, Jiaen Liang:
Active Learning for LF-MMI Trained Neural Networks in ASR. INTERSPEECH 2018: 2898-2902 - [c13]Yiyan Wang, Yanhua Long:
Keyword Spotting Based On CTC and RNN For Mandarin Chinese Speech. ISCSLP 2018: 374-378 - 2017
- [j4]Yanhua Long, Hong Ye, Jifeng Ni:
Domain compensation based on phonetically discriminative features for speaker verification. Comput. Speech Lang. 41: 161-179 (2017) - [j3]Yanhua Long, Yijie Li, Hone Ye, Hongwei Mao:
Domain adaptation of lattice-free MMI based TDNN models for speech recognition. Int. J. Speech Technol. 20(1): 171-178 (2017) - [j2]Yan Zhang, Yanhua Long, Xiangrong Shen, Haoran Wei, Min Yang, Hong Ye, Hongwei Mao:
Articulatory movement features for short-duration text-dependent speaker verification. Int. J. Speech Technol. 20(4): 753-759 (2017) - 2016
- [j1]Haoran Wei, Yanhua Long, Hongwei Mao:
Improvements on self-adaptive voice activity detector for telephone data. Int. J. Speech Technol. 19(3): 623-630 (2016) - 2013
- [c12]Pierre Lanchantin, Peter Bell, Mark J. F. Gales, Thomas Hain, Xunying Liu, Yanhua Long, Jennifer Quinnell, Steve Renals, Oscar Saz, Matthew Stephen Seigel, Pawel Swietojanski, Philip C. Woodland:
Automatic Transcription of Multi-genre Media Archives. SLAM@INTERSPEECH 2013: 26-31 - [c11]Yanhua Long, Mark J. F. Gales, Pierre Lanchantin, Xunying Liu, Matthew Stephen Seigel, Philip C. Woodland:
Improving lightly supervised training for broadcast transcription. INTERSPEECH 2013: 2187-2191 - 2012
- [c10]Peter Bell, Mark J. F. Gales, Pierre Lanchantin, Xunying Liu, Yanhua Long, Steve Renals, Pawel Swietojanski, Philip C. Woodland:
Transcription of multi-genre media archives using out-of-domain data. SLT 2012: 324-329 - 2011
- [c9]Yanhua Long, Zhi-Jie Yan, Frank K. Soong, Li-Rong Dai, Wu Guo:
Speaker characterization using spectral subband energy ratio based on Harmonic plus Noise Model. ICASSP 2011: 4520-4523 - [c8]Yanhua Long, Zhi-Jie Yan, Frank K. Soong, Li-Rong Dai, Wu Guo:
Improvements in Speaker Characterization Using Spectral Subband Energy Based on Harmonic plus Noise Model. INTERSPEECH 2011: 373-376 - 2010
- [c7]Wu Guo, Zhao Zhang, Yanhua Long, Li-Rong Dai:
N-gram nearest neighbor algorithm for voice password system. ICASSP 2010: 4438-4441 - [c6]Yanhua Long, Li-Rong Dai, Bin Ma, Wu Guo:
Effects of the phonological relevance in speaker verification. INTERSPEECH 2010: 2130-2133 - [c5]Ying Xu, Yan Song, Yanhua Long, Hai-Bing Zhong, Li-Rong Dai:
The description of iFlyTek Speech Lab system for NIST2009 Language Recognition Evaluation. ISCSLP 2010: 157-161 - [c4]Yanhua Long, Li-Rong Dai, Eryu Wang, Bin Ma, Wu Guo:
Non-negative matrix factorization based discriminative features for speaker verification. ISCSLP 2010: 291-295
2000 – 2009
- 2009
- [c3]Wu Guo, Yanhua Long, Yijie Li, Lei Pan, Eryu Wang, Li-Rong Dai:
iFLY system for the NIST 2008 speaker recognition evaluation. ICASSP 2009: 4209-4212 - [c2]Yanhua Long, Bin Ma, Haizhou Li, Wu Guo, Chng Eng Siong, Li-Rong Dai:
Exploiting prosodic information for Speaker Recognition. ICASSP 2009: 4225-4228 - 2008
- [c1]Yanhua Long, Wu Guo, Li-Rong Dai:
Interfusing the Confused Region Score of Speaker Verification Systems. ISCSLP 2008: 314-317
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 21:23 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint