default search action
Shansong Liu
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c23]Shansong Liu, Atin Sakkeer Hussain, Chenshuo Sun, Ying Shan:
Music Understanding LLaMA: Advancing Text-to-Music Generation with Question Answering and Captioning. ICASSP 2024: 286-290 - [c22]Tianjun Mao, Shansong Liu, Yunxuan Zhang, Dian Li, Ying Shan:
Unified Pretraining Target Based Video-Music Retrieval with Music Rhythm and Video Optical Flow Information. ICASSP 2024: 7890-7894 - [c21]Shansong Liu, Xu Li, Dian Li, Ying Shan:
Humtrans: A Novel Open-Source Dataset for Humming Melody Transcription and Beyond. ICASSP 2024: 7915-7919 - 2023
- [c20]Zhihan Yang, Shansong Liu, Xu Li, Haozhe Wu, Zhiyong Wu, Ying Shan, Jia Jia:
Prosody Modeling with 3D Visual Information for Expressive Video Dubbing. INTERSPEECH 2023: 4863-4867 - [i14]Shansong Liu, Atin Sakkeer Hussain, Chenshuo Sun, Ying Shan:
Music Understanding LLaMA: Advancing Text-to-Music Generation with Question Answering and Captioning. CoRR abs/2308.11276 (2023) - [i13]Tianjun Mao, Shansong Liu, Yunxuan Zhang, Dian Li, Ying Shan:
Unified Pretraining Target Based Video-music Retrieval With Music Rhythm And Video Optical Flow Information. CoRR abs/2309.09421 (2023) - [i12]Shansong Liu, Xu Li, Dian Li, Ying Shan:
HumTrans: A Novel Open-Source Dataset for Humming Melody Transcription and Beyond. CoRR abs/2309.09623 (2023) - [i11]Atin Sakkeer Hussain, Shansong Liu, Chenshuo Sun, Ying Shan:
M2UGen: Multi-modal Music Understanding and Generation with the Power of Large Language Models. CoRR abs/2311.11255 (2023) - 2022
- [j6]Shoukang Hu, Xurong Xie, Mingyu Cui, Jiajun Deng, Shansong Liu, Jianwei Yu, Mengzhe Geng, Xunying Liu, Helen Meng:
Neural Architecture Search for LF-MMI Trained Time Delay Neural Networks. IEEE ACM Trans. Audio Speech Lang. Process. 30: 1093-1107 (2022) - [c19]Shujie Hu, Shansong Liu, Xurong Xie, Mengzhe Geng, Tianzi Wang, Shoukang Hu, Mingyu Cui, Xunying Liu, Helen Meng:
Exploiting Cross Domain Acoustic-to-Articulatory Inverted Features for Disordered Speech Recognition. ICASSP 2022: 6747-6751 - [c18]Xu Li, Shansong Liu, Ying Shan:
A Hierarchical Speaker Representation Framework for One-shot Singing Voice Conversion. INTERSPEECH 2022: 4307-4311 - [i10]Shoukang Hu, Xurong Xie, Mingyu Cui, Jiajun Deng, Shansong Liu, Jianwei Yu, Mengzhe Geng, Xunying Liu, Helen Meng:
Neural Architecture Search For LF-MMI Trained Time Delay Neural Networks. CoRR abs/2201.03943 (2022) - [i9]Mengzhe Geng, Shansong Liu, Jianwei Yu, Xurong Xie, Shoukang Hu, Zi Ye, Zengrui Jin, Xunying Liu, Helen Meng:
Spectro-Temporal Deep Features for Disordered Speech Assessment and Recognition. CoRR abs/2201.05554 (2022) - [i8]Mengzhe Geng, Xurong Xie, Shansong Liu, Jianwei Yu, Shoukang Hu, Xunying Liu, Helen Meng:
Investigation of Data Augmentation Techniques for Disordered Speech Recognition. CoRR abs/2201.05562 (2022) - [i7]Shansong Liu, Mengzhe Geng, Shoukang Hu, Xurong Xie, Mingyu Cui, Jianwei Yu, Xunying Liu, Helen Meng:
Recent Progress in the CUHK Dysarthric Speech Recognition System. CoRR abs/2201.05845 (2022) - [i6]Shujie Hu, Shansong Liu, Xurong Xie, Mengzhe Geng, Tianzi Wang, Shoukang Hu, Mingyu Cui, Xunying Liu, Helen Meng:
Exploiting Cross Domain Acoustic-to-articulatory Inverted Features For Disordered Speech Recognition. CoRR abs/2203.10274 (2022) - [i5]Xu Li, Shansong Liu, Ying Shan:
A Hierarchical Speaker Representation Framework for One-shot Singing Voice Conversion. CoRR abs/2206.13762 (2022) - 2021
- [j5]Shoukang Hu, Xurong Xie, Shansong Liu, Jianwei Yu, Zi Ye, Mengzhe Geng, Xunying Liu, Helen Meng:
Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1514-1529 (2021) - [j4]Jianwei Yu, Shi-Xiong Zhang, Bo Wu, Shansong Liu, Shoukang Hu, Mengzhe Geng, Xunying Liu, Helen Meng, Dong Yu:
Audio-Visual Multi-Channel Integration and Recognition of Overlapped Speech. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2067-2082 (2021) - [j3]Shansong Liu, Mengzhe Geng, Shoukang Hu, Xurong Xie, Mingyu Cui, Jianwei Yu, Xunying Liu, Helen Meng:
Recent Progress in the CUHK Dysarthric Speech Recognition System. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2267-2281 (2021) - [c17]Zi Ye, Shoukang Hu, Jinchao Li, Xurong Xie, Mengzhe Geng, Jianwei Yu, Junhao Xu, Boyang Xue, Shansong Liu, Xunying Liu, Helen Meng:
Development of the Cuhk Elderly Speech Recognition System for Neurocognitive Disorder Detection Using the Dementiabank Corpus. ICASSP 2021: 6433-6437 - [c16]Shoukang Hu, Xurong Xie, Shansong Liu, Mingyu Cui, Mengzhe Geng, Xunying Liu, Helen Meng:
Neural Architecture Search for LF-MMI Trained Time Delay Neural Networks. ICASSP 2021: 6758-6762 - [c15]Boyang Xue, Jianwei Yu, Junhao Xu, Shansong Liu, Shoukang Hu, Zi Ye, Mengzhe Geng, Xunying Liu, Helen Meng:
Bayesian Transformer Language Models for Speech Recognition. ICASSP 2021: 7378-7382 - [c14]Mengzhe Geng, Shansong Liu, Jianwei Yu, Xurong Xie, Shoukang Hu, Zi Ye, Zengrui Jin, Xunying Liu, Helen Meng:
Spectro-Temporal Deep Features for Disordered Speech Assessment and Recognition. Interspeech 2021: 4793-4797 - [c13]Zengrui Jin, Mengzhe Geng, Xurong Xie, Jianwei Yu, Shansong Liu, Xunying Liu, Helen Meng:
Adversarial Data Augmentation for Disordered Speech Recognition. Interspeech 2021: 4803-4807 - [c12]Jiajun Deng, Fabian Ritter Gutierrez, Shoukang Hu, Mengzhe Geng, Xurong Xie, Zi Ye, Shansong Liu, Jianwei Yu, Xunying Liu, Helen Meng:
Bayesian Parametric and Architectural Domain Adaptation of LF-MMI Trained TDNNs for Elderly and Dysarthric Speech Recognition. Interspeech 2021: 4818-4822 - [i4]Boyang Xue, Jianwei Yu, Junhao Xu, Shansong Liu, Shoukang Hu, Zi Ye, Mengzhe Geng, Xunying Liu, Helen Meng:
Bayesian Transformer Language Models for Speech Recognition. CoRR abs/2102.04754 (2021) - [i3]Zengrui Jin, Mengzhe Geng, Xurong Xie, Jianwei Yu, Shansong Liu, Xunying Liu, Helen Meng:
Adversarial Data Augmentation for Disordered Speech Recognition. CoRR abs/2108.00899 (2021) - 2020
- [c11]Jianwei Yu, Shi-Xiong Zhang, Jian Wu, Shahram Ghorbani, Bo Wu, Shiyin Kang, Shansong Liu, Xunying Liu, Helen Meng, Dong Yu:
Audio-Visual Recognition of Overlapped Speech for the LRS2 Dataset. ICASSP 2020: 6984-6988 - [c10]Mengzhe Geng, Xurong Xie, Shansong Liu, Jianwei Yu, Shoukang Hu, Xunying Liu, Helen Meng:
Investigation of Data Augmentation Techniques for Disordered Speech Recognition. INTERSPEECH 2020: 696-700 - [c9]Shansong Liu, Xurong Xie, Jianwei Yu, Shoukang Hu, Mengzhe Geng, Rongfeng Su, Shi-Xiong Zhang, Xunying Liu, Helen Meng:
Exploiting Cross-Domain Visual Feature Generation for Disordered Speech Recognition. INTERSPEECH 2020: 711-715 - [i2]Jianwei Yu, Shi-Xiong Zhang, Jian Wu, Shahram Ghorbani, Bo Wu, Shiyin Kang, Shansong Liu, Xunying Liu, Helen Meng, Dong Yu:
Audio-visual Recognition of Overlapped speech for the LRS2 dataset. CoRR abs/2001.01656 (2020) - [i1]Shoukang Hu, Xurong Xie, Shansong Liu, Mengzhe Geng, Xunying Liu, Helen Meng:
Neural Architecture Search for Speech Recognition. CoRR abs/2007.08818 (2020)
2010 – 2019
- 2019
- [c8]Shoukang Hu, Max W. Y. Lam, Xurong Xie, Shansong Liu, Jianwei Yu, Xixin Wu, Xunying Liu, Helen Meng:
Bayesian and Gaussian Process Neural Networks for Large Vocabulary Continuous Speech Recognition. ICASSP 2019: 6555-6559 - [c7]Shoukang Hu, Xurong Xie, Shansong Liu, Max W. Y. Lam, Jianwei Yu, Xixin Wu, Xunying Liu, Helen Meng:
LF-MMI Training of Bayesian and Gaussian Process Time Delay Neural Networks for Speech Recognition. INTERSPEECH 2019: 2793-2797 - [c6]Shoukang Hu, Shansong Liu, Heng Fai Chang, Mengzhe Geng, Jiani Chen, Lau Wing Chung, To Ka Hei, Jianwei Yu, Ka Ho Wong, Xunying Liu, Helen Meng:
The CUHK Dysarthric Speech Recognition Systems for English and Cantonese. INTERSPEECH 2019: 3669-3670 - [c5]Shansong Liu, Shoukang Hu, Yi Wang, Jianwei Yu, Rongfeng Su, Xunying Liu, Helen Meng:
Exploiting Visual Features Using Bayesian Gated Neural Networks for Disordered Speech Recognition. INTERSPEECH 2019: 4120-4124 - [c4]Shansong Liu, Shoukang Hu, Xunying Liu, Helen Meng:
On the Use of Pitch Features for Disordered Speech Recognition. INTERSPEECH 2019: 4130-4134 - 2018
- [j2]Shansong Liu, Kui Hua, Sijie Chen, Xuegong Zhang:
Comprehensive simulation of metagenomic sequencing data with non-uniform sampling distribution. Quant. Biol. 6(2): 175-185 (2018) - [c3]Xunying Liu, Shansong Liu, Jinze Sha, Jianwei Yu, Zhiyuan Xu, Xie Chen, Helen Meng:
Limited-Memory BFGS Optimization of Recurrent Neural Network Language Models for Speech Recognition. ICASSP 2018: 6114-6118 - [c2]Max W. Y. Lam, Shoukang Hu, Xurong Xie, Shansong Liu, Jianwei Yu, Rongfeng Su, Xunying Liu, Helen Meng:
Gaussian Process Neural Networks for Speech Recognition. INTERSPEECH 2018: 1778-1782 - [c1]Jianwei Yu, Xurong Xie, Shansong Liu, Shoukang Hu, Max W. Y. Lam, Xixin Wu, Ka Ho Wong, Xunying Liu, Helen Meng:
Development of the CUHK Dysarthric Speech Recognition System for the UA Speech Corpus. INTERSPEECH 2018: 2938-2942 - 2017
- [j1]Xuegong Zhang, Shansong Liu, Hongfei Cui, Ting Chen:
Reading the Underlying Information From Massive Metagenomic Sequencing Data. Proc. IEEE 105(3): 459-473 (2017)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-21 20:29 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint