default search action
Linhao Dong
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c30]Zhiyun Fan, Linhao Dong, Jun Zhang, Lu Lu, Zejun Ma:
SA-SOT: Speaker-Aware Serialized Output Training for Multi-Talker ASR. ICASSP 2024: 9986-9990 - [i15]Zhiyun Fan, Linhao Dong, Jun Zhang, Lu Lu, Zejun Ma:
SA-SOT: Speaker-Aware Serialized Output Training for Multi-Talker ASR. CoRR abs/2403.02010 (2024) - [i14]Ye Bai, Jingping Chen, Jitong Chen, Wei Chen, Zhuo Chen, Chuang Ding, Linhao Dong, Qianqian Dong, Yujiao Du, Kepan Gao, Lu Gao, Yi Guo, Minglun Han, Ting Han, Wenchao Hu, Xinying Hu, Yuxiang Hu, Deyu Hua, Lu Huang, Mingkun Huang, Youjia Huang, Jishuo Jin, Fanliu Kong, Zongwei Lan, Tianyu Li, Xiaoyang Li, Zeyang Li, Zehua Lin, Rui Liu, Shouda Liu, Lu Lu, Yizhou Lu, Jingting Ma, Shengtao Ma, Yulin Pei, Chen Shen, Tian Tan, Xiaogang Tian, Ming Tu, Bo Wang, Hao Wang, Yuping Wang, Yuxuan Wang, Hanzhang Xia, Rui Xia, Shuangyi Xie, Hongmin Xu, Meng Yang, Bihong Zhang, Jun Zhang, Wanyi Zhang, Yang Zhang, Yawei Zhang, Yijie Zheng, Ming Zou:
Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition. CoRR abs/2407.04675 (2024) - [i13]Minglun Han, Ye Bai, Chen Shen, Youjia Huang, Mingkun Huang, Zehua Lin, Linhao Dong, Lu Lu, Yuxuan Wang:
NEST-RQ: Next Token Prediction for Speech Self-Supervised Pre-Training. CoRR abs/2409.08680 (2024) - 2023
- [c29]Linhao Dong, Zhecheng An, Peihao Wu, Jun Zhang, Lu Lu, Zejun Ma:
CIF-PT: Bridging Speech and Text Representations for Spoken Language Understanding via Continuous Integrate-and-Fire Pre-Training. ACL (Findings) 2023: 8894-8907 - [c28]Zhiyun Fan, Linhao Dong, Chen Shen, Zhenlin Liang, Jun Zhang, Lu Lu, Zejun Ma:
Language-specific Boundary Learning for Improving Mandarin-English Code-switching Speech Recognition. INTERSPEECH 2023: 3322-3326 - [i12]Linhao Dong, Zhecheng An, Peihao Wu, Jun Zhang, Lu Lu, Zejun Ma:
CIF-PT: Bridging Speech and Text Representations for Spoken Language Understanding via Continuous Integrate-and-Fire Pre-Training. CoRR abs/2305.17499 (2023) - [i11]Zhiyun Fan, Linhao Dong, Chen Shen, Zhenlin Liang, Jun Zhang, Lu Lu, Zejun Ma:
Language-specific Acoustic Boundary Learning for Mandarin-English Code-switching Speech Recognition. CoRR abs/2306.05279 (2023) - 2022
- [j6]Zhiyun Fan, Linhao Dong, Meng Cai, Zejun Ma, Bo Xu:
Sequence-Level Speaker Change Detection With Difference-Based Continuous Integrate-and-Fire. IEEE Signal Process. Lett. 29: 1551-1554 (2022) - [c27]Linhao Dong, Yanbing Bai, Qingsong Xu, Erick Mas:
Optimizing the Post-disaster Resource Allocation with Q-Learning: Demonstration of 2021 China Flood. DEXA (2) 2022: 256-262 - [c26]Minglun Han, Linhao Dong, Zhenlin Liang, Meng Cai, Shiyu Zhou, Zejun Ma, Bo Xu:
Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection. ICASSP 2022: 8532-8536 - [c25]Zhiyun Fan, Zhenlin Liang, Linhao Dong, Yi Liu, Shiyu Zhou, Meng Cai, Jun Zhang, Zejun Ma, Bo Xu:
Token-level Speaker Change Detection Using Speaker Difference and Speech Content via Continuous Integrate-and-fire. INTERSPEECH 2022: 3749-3753 - [i10]Minglun Han, Linhao Dong, Zhenlin Liang, Meng Cai, Shiyu Zhou, Zejun Ma, Bo Xu:
Improving End-to-End Contextual Speech Recognition with Fine-grained Contextual Knowledge Selection. CoRR abs/2201.12806 (2022) - [i9]Zhiyun Fan, Linhao Dong, Meng Cai, Zejun Ma, Bo Xu:
Sequence-level Speaker Change Detection with Difference-based Continuous Integrate-and-fire. CoRR abs/2206.13110 (2022) - [i8]Zhiyun Fan, Zhenlin Liang, Linhao Dong, Yi Liu, Shiyu Zhou, Meng Cai, Jun Zhang, Zejun Ma, Bo Xu:
Token-level Speaker Change Detection Using Speaker Difference and Speech Content via Continuous Integrate-and-fire. CoRR abs/2211.09381 (2022) - 2021
- [c24]Minglun Han, Linhao Dong, Shiyu Zhou, Bo Xu:
Cif-Based Collaborative Decoding for End-to-End Contextual Speech Recognition. ICASSP 2021: 6528-6532 - 2020
- [c23]Linhao Dong, Bo Xu:
CIF: Continuous Integrate-And-Fire for End-To-End Speech Recognition. ICASSP 2020: 6079-6083 - [i7]Linhao Dong, Cheng Yi, Jianzong Wang, Shiyu Zhou, Shuang Xu, Xueli Jia, Bo Xu:
A Comparison of Label-Synchronous and Frame-Synchronous End-to-End Models for Speech Recognition. CoRR abs/2005.10113 (2020) - [i6]Minglun Han, Linhao Dong, Shiyu Zhou, Bo Xu:
cif-based collaborative decoding for end-to-end contextual speech recognition. CoRR abs/2012.09466 (2020)
2010 – 2019
- 2019
- [c22]Linhao Dong, Feng Wang, Bo Xu:
Self-attention Aligner: A Latency-control End-to-end Model for ASR Using Self-attention Network and Chunk-hopping. ICASSP 2019: 5656-5660 - [c21]Yuxiang Zou, Linhao Dong, Bo Xu:
Boosting Character-Based Chinese Speech Synthesis via Multi-Task Learning and Dictionary Tutoring. INTERSPEECH 2019: 2055-2059 - [i5]Linhao Dong, Feng Wang, Bo Xu:
Self-Attention Aligner: A Latency-Control End-to-End Model for ASR Using Self-Attention Network and Chunk-Hopping. CoRR abs/1902.06450 (2019) - [i4]Linhao Dong, Bo Xu:
CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition. CoRR abs/1905.11235 (2019) - 2018
- [c20]Linhao Dong, Shuang Xu, Bo Xu:
Speech-Transformer: A No-Recurrence Sequence-to-Sequence Model for Speech Recognition. ICASSP 2018: 5884-5888 - [c19]Shiyu Zhou, Linhao Dong, Shuang Xu, Bo Xu:
A Comparison of Modeling Units in Sequence-to-Sequence Speech Recognition with the Transformer on Mandarin Chinese. ICONIP (5) 2018: 210-220 - [c18]Yuanyuan Zhao, Linhao Dong, Shuang Xu, Bo Xu:
Syllable-Based Acoustic Modeling with CTC for Multi-Scenarios Mandarin speech recognition. IJCNN 2018: 1-8 - [c17]Shiyu Zhou, Linhao Dong, Shuang Xu, Bo Xu:
Syllable-Based Sequence-to-Sequence Speech Recognition with the Transformer in Mandarin Chinese. INTERSPEECH 2018: 791-795 - [c16]Linhao Dong, Shiyu Zhou, Wei Chen, Bo Xu:
Extending Recurrent Neural Aligner for Streaming End-to-End Speech Recognition in Mandarin. INTERSPEECH 2018: 816-820 - [i3]Shiyu Zhou, Linhao Dong, Shuang Xu, Bo Xu:
Syllable-Based Sequence-to-Sequence Speech Recognition with the Transformer in Mandarin Chinese. CoRR abs/1804.10752 (2018) - [i2]Shiyu Zhou, Linhao Dong, Shuang Xu, Bo Xu:
A Comparison of Modeling Units in Sequence-to-Sequence Speech Recognition with the Transformer on Mandarin Chinese. CoRR abs/1805.06239 (2018) - [i1]Linhao Dong, Shiyu Zhou, Wei Chen, Bo Xu:
Extending Recurrent Neural Aligner for Streaming End-to-End Speech Recognition in Mandarin. CoRR abs/1806.06342 (2018) - 2017
- [j5]Baihong Lin, Xiaoming Tao, Mai Xu, Linhao Dong, Jianhua Lu:
Bayesian Hyperspectral and Multispectral Image Fusions via Double Matrix Factorization. IEEE Trans. Geosci. Remote. Sens. 55(10): 5666-5678 (2017) - [c15]Linhao Dong, Dusit Niyato, Dong In Kim, Dinh Thai Hoang:
A Joint Scheduling and Content Caching Scheme for Energy Harvesting Access Points with Multicast. GLOBECOM 2017: 1-6 - [c14]Zhongxiang Wei, Sumei Sun, Xu Zhu, Yi Huang, Linhao Dong, Dong In Kim:
Wireless Information and Power Transfer: Spectral Efficiency Optimization for Asymmetric Full-Duplex Relay Systems. VTC Spring 2017: 1-5 - [c13]Zizhuo Zhang, Shaoyang Li, Xiaoming Tao, Linhao Dong, Jianhua Lu:
Online Bayesian Learning for Remote-Sensing Imagery Compression. VTC Spring 2017: 1-5 - 2016
- [j4]Xiaoming Tao, Linhao Dong, Yang Li, Jianhua Lu:
The THU multi-view face database for videoconferences and baseline evaluations. Neurocomputing 207: 48-59 (2016) - [j3]Yipeng Sun, Xiaoming Tao, Yang Li, Linhao Dong, Jianhua Lu:
HEMS: Hierarchical Exemplar-Based Matching-Synthesis for Object-Aware Image Reconstruction. IEEE Trans. Multim. 18(2): 171-181 (2016) - [c12]Jichuan Lu, Xiaoming Tao, Linhao Dong, Ning Ge:
Chunk-wise face model based gaze correction in conversational videos with single camera. CITS 2016: 1-5 - [c11]Baihong Lin, Xiaoming Tao, Shaoyang Li, Linhao Dong, Jianhua Lu:
Variational Bayesian image fusion based on combined sparse representations. ICASSP 2016: 1432-1436 - [c10]Baihong Lin, Xiaoming Tao, Linhao Dong, Jianhua Lu:
Variational EM approach for high resolution hyper-spectral imaging based on probabilistic matrix factorization. ICIP 2016: 1774-1778 - [c9]Nanyang Ye, Xiaoming Tao, Linhao Dong, Ning Ge:
Mouse calibration aided real-time gaze estimation based on boost Gaussian Bayesian learning. ICIP 2016: 2797-2801 - 2015
- [j2]Zhongxiang Wei, Xu Zhu, Sumei Sun, Yi Huang, Linhao Dong, Yufei Jiang:
Full-Duplex Versus Half-Duplex Amplify-and-Forward Relaying: Which is More Energy Efficient in 60-GHz Dual-Hop Indoor Wireless Systems? IEEE J. Sel. Areas Commun. 33(12): 2936-2947 (2015) - [j1]Xiaoming Tao, Linhao Dong, Yang Li, Jizhe Zhou, Ning Ge, Jianhua Lu:
Real-time personalized content catering via viewer sentiment feedback: a QoE perspective. IEEE Netw. 29(6): 14-19 (2015) - [c8]Shaoyang Li, Xiaoming Tao, Linhao Dong, Jianhua Lu:
A nonparametric Bayesian approach to joint multiple dictionary learning with separate image sources. GlobalSIP 2015: 1155-1159 - [c7]Yingjie Zhang, Wei Feng, Linhao Dong, Ning Ge:
Pilot sequence design for multi-cell distributed MIMO systems with large-scale CSI. ICC 2015: 1739-1744 - [c6]Linhao Dong, Xiaoming Tao, Yang Li, Jichuan Lu, Zizhuo Zhang, Jingwen Cheng, Jianhua Lu:
The THU multi-view face database for videoconferences. ICIP 2015: 417-421 - [c5]Nanyang Ye, Linhao Dong, Xiaoming Tao, Ning Ge:
Efficient Multi-Cell Clustering for Coordinated Multi-Point Transmission with Blossom Tree Algorithm. VTC Fall 2015: 1-4 - 2014
- [c4]Xiaoming Tao, Linhao Dong, Shaoyang Li, Yang Li, Ning Ge, Jianhua Lu:
A cognitive perspective for information processing in bandwidth-limited wireless communications. ICCI*CC 2014: 289-295 - 2013
- [c3]Linhao Dong, Xu Zhu, Yi Huang:
Optimal asymmetric resource allocation for dual-hop multi-relay LTE-Advanced systems in the downlink. ICC 2013: 4625-4629 - [c2]Linhao Dong, Sumei Sun, Xu Zhu, Yeow-Khiang Chia:
Power efficient 60 GHz wireless communication networks with relays. PIMRC 2013: 2808-2812 - 2011
- [c1]Linhao Dong, Xu Zhu, Yi Huang:
Optimal Asymmetric Resource Allocation for Multi-Relay Based LTE-Advanced Systems. GLOBECOM 2011: 1-5
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-15 20:41 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint