default search action

combined dblp search
author search
venue search
publication search

ask others

Shansong Liu

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiuHSS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiuHSS24
Shansong Liu, Atin Sakkeer Hussain, Chenshuo Sun, Ying Shan:
Music Understanding LLaMA: Advancing Text-to-Music Generation with Question Answering and Captioning. ICASSP 2024: 286-290
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MaoLZLS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MaoLZLS24
Tianjun Mao, Shansong Liu, Yunxuan Zhang, Dian Li, Ying Shan:
Unified Pretraining Target Based Video-Music Retrieval with Music Rhythm and Video Optical Flow Information. ICASSP 2024: 7890-7894
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiuLLS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiuLLS24
Shansong Liu, Xu Li, Dian Li, Ying Shan:
Humtrans: A Novel Open-Source Dataset for Humming Melody Transcription and Beyond. ICASSP 2024: 7915-7919
2023
[c20]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YangLLW0S023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YangLLW0S023
Zhihan Yang, Shansong Liu, Xu Li, Haozhe Wu, Zhiyong Wu, Ying Shan, Jia Jia:
Prosody Modeling with 3D Visual Information for Expressive Video Dubbing. INTERSPEECH 2023: 4863-4867
[i14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-11276
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-11276
Shansong Liu, Atin Sakkeer Hussain, Chenshuo Sun, Ying Shan:
Music Understanding LLaMA: Advancing Text-to-Music Generation with Question Answering and Captioning. CoRR abs/2308.11276 (2023)
[i13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-09421
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-09421
Tianjun Mao, Shansong Liu, Yunxuan Zhang, Dian Li, Ying Shan:
Unified Pretraining Target Based Video-music Retrieval With Music Rhythm And Video Optical Flow Information. CoRR abs/2309.09421 (2023)
[i12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-09623
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-09623
Shansong Liu, Xu Li, Dian Li, Ying Shan:
HumTrans: A Novel Open-Source Dataset for Humming Melody Transcription and Beyond. CoRR abs/2309.09623 (2023)
[i11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-11255
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-11255
Atin Sakkeer Hussain, Shansong Liu, Chenshuo Sun, Ying Shan:
M²UGen: Multi-modal Music Understanding and Generation with the Power of Large Language Models. CoRR abs/2311.11255 (2023)
2022
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/HuXCDLYGLM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/HuXCDLYGLM22
Shoukang Hu, Xurong Xie, Mingyu Cui, Jiajun Deng, Shansong Liu, Jianwei Yu, Mengzhe Geng, Xunying Liu, Helen Meng:
Neural Architecture Search for LF-MMI Trained Time Delay Neural Networks. IEEE ACM Trans. Audio Speech Lang. Process. 30: 1093-1107 (2022)
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuLXGWHCLM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuLXGWHCLM22
Shujie Hu, Shansong Liu, Xurong Xie, Mengzhe Geng, Tianzi Wang, Shoukang Hu, Mingyu Cui, Xunying Liu, Helen Meng:
Exploiting Cross Domain Acoustic-to-Articulatory Inverted Features for Disordered Speech Recognition. ICASSP 2022: 6747-6751
[c18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiLS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiLS22
Xu Li, Shansong Liu, Ying Shan:
A Hierarchical Speaker Representation Framework for One-shot Singing Voice Conversion. INTERSPEECH 2022: 4307-4311
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2201-03943
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-03943
Shoukang Hu, Xurong Xie, Mingyu Cui, Jiajun Deng, Shansong Liu, Jianwei Yu, Mengzhe Geng, Xunying Liu, Helen Meng:
Neural Architecture Search For LF-MMI Trained Time Delay Neural Networks. CoRR abs/2201.03943 (2022)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2201-05554
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-05554
Mengzhe Geng, Shansong Liu, Jianwei Yu, Xurong Xie, Shoukang Hu, Zi Ye, Zengrui Jin, Xunying Liu, Helen Meng:
Spectro-Temporal Deep Features for Disordered Speech Assessment and Recognition. CoRR abs/2201.05554 (2022)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2201-05562
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-05562
Mengzhe Geng, Xurong Xie, Shansong Liu, Jianwei Yu, Shoukang Hu, Xunying Liu, Helen Meng:
Investigation of Data Augmentation Techniques for Disordered Speech Recognition. CoRR abs/2201.05562 (2022)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2201-05845
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-05845
Shansong Liu, Mengzhe Geng, Shoukang Hu, Xurong Xie, Mingyu Cui, Jianwei Yu, Xunying Liu, Helen Meng:
Recent Progress in the CUHK Dysarthric Speech Recognition System. CoRR abs/2201.05845 (2022)
[i6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-10274
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-10274
Shujie Hu, Shansong Liu, Xurong Xie, Mengzhe Geng, Tianzi Wang, Shoukang Hu, Mingyu Cui, Xunying Liu, Helen Meng:
Exploiting Cross Domain Acoustic-to-articulatory Inverted Features For Disordered Speech Recognition. CoRR abs/2203.10274 (2022)
[i5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-13762
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-13762
Xu Li, Shansong Liu, Ying Shan:
A Hierarchical Speaker Representation Framework for One-shot Singing Voice Conversion. CoRR abs/2206.13762 (2022)
2021
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/HuXLYYGLM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/HuXLYYGLM21
Shoukang Hu, Xurong Xie, Shansong Liu, Jianwei Yu, Zi Ye, Mengzhe Geng, Xunying Liu, Helen Meng:
Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1514-1529 (2021)
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/YuZWLHGLMY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/YuZWLHGLMY21
Jianwei Yu, Shi-Xiong Zhang, Bo Wu, Shansong Liu, Shoukang Hu, Mengzhe Geng, Xunying Liu, Helen Meng, Dong Yu:
Audio-Visual Multi-Channel Integration and Recognition of Overlapped Speech. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2067-2082 (2021)
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/LiuGHXCYLM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LiuGHXCYLM21
Shansong Liu, Mengzhe Geng, Shoukang Hu, Xurong Xie, Mingyu Cui, Jianwei Yu, Xunying Liu, Helen Meng:
Recent Progress in the CUHK Dysarthric Speech Recognition System. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2267-2281 (2021)
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YeHLXGYXXLLM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YeHLXGYXXLLM21
Zi Ye, Shoukang Hu, Jinchao Li, Xurong Xie, Mengzhe Geng, Jianwei Yu, Junhao Xu, Boyang Xue, Shansong Liu, Xunying Liu, Helen Meng:
Development of the Cuhk Elderly Speech Recognition System for Neurocognitive Disorder Detection Using the Dementiabank Corpus. ICASSP 2021: 6433-6437
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuXLCGLM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuXLCGLM21
Shoukang Hu, Xurong Xie, Shansong Liu, Mingyu Cui, Mengzhe Geng, Xunying Liu, Helen Meng:
Neural Architecture Search for LF-MMI Trained Time Delay Neural Networks. ICASSP 2021: 6758-6762
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XueYXLHYGLM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XueYXLHYGLM21
Boyang Xue, Jianwei Yu, Junhao Xu, Shansong Liu, Shoukang Hu, Zi Ye, Mengzhe Geng, Xunying Liu, Helen Meng:
Bayesian Transformer Language Models for Speech Recognition. ICASSP 2021: 7378-7382
[c14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GengLYXHYJLM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GengLYXHYJLM21
Mengzhe Geng, Shansong Liu, Jianwei Yu, Xurong Xie, Shoukang Hu, Zi Ye, Zengrui Jin, Xunying Liu, Helen Meng:
Spectro-Temporal Deep Features for Disordered Speech Assessment and Recognition. Interspeech 2021: 4793-4797
[c13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JinGXYLLM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JinGXYLLM21
Zengrui Jin, Mengzhe Geng, Xurong Xie, Jianwei Yu, Shansong Liu, Xunying Liu, Helen Meng:
Adversarial Data Augmentation for Disordered Speech Recognition. Interspeech 2021: 4803-4807
[c12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DengGHGXYLYLM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DengGHGXYLYLM21
Jiajun Deng, Fabian Ritter Gutierrez, Shoukang Hu, Mengzhe Geng, Xurong Xie, Zi Ye, Shansong Liu, Jianwei Yu, Xunying Liu, Helen Meng:
Bayesian Parametric and Architectural Domain Adaptation of LF-MMI Trained TDNNs for Elderly and Dysarthric Speech Recognition. Interspeech 2021: 4818-4822
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2102-04754
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-04754
Boyang Xue, Jianwei Yu, Junhao Xu, Shansong Liu, Shoukang Hu, Zi Ye, Mengzhe Geng, Xunying Liu, Helen Meng:
Bayesian Transformer Language Models for Speech Recognition. CoRR abs/2102.04754 (2021)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2108-00899
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-00899
Zengrui Jin, Mengzhe Geng, Xurong Xie, Jianwei Yu, Shansong Liu, Xunying Liu, Helen Meng:
Adversarial Data Augmentation for Disordered Speech Recognition. CoRR abs/2108.00899 (2021)
2020
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YuZWGWKLLMY20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YuZWGWKLLMY20
Jianwei Yu, Shi-Xiong Zhang, Jian Wu, Shahram Ghorbani, Bo Wu, Shiyin Kang, Shansong Liu, Xunying Liu, Helen Meng, Dong Yu:
Audio-Visual Recognition of Overlapped Speech for the LRS2 Dataset. ICASSP 2020: 6984-6988
[c10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GengXLYHLM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GengXLYHLM20
Mengzhe Geng, Xurong Xie, Shansong Liu, Jianwei Yu, Shoukang Hu, Xunying Liu, Helen Meng:
Investigation of Data Augmentation Techniques for Disordered Speech Recognition. INTERSPEECH 2020: 696-700
[c9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuXYHGSZLM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuXYHGSZLM20
Shansong Liu, Xurong Xie, Jianwei Yu, Shoukang Hu, Mengzhe Geng, Rongfeng Su, Shi-Xiong Zhang, Xunying Liu, Helen Meng:
Exploiting Cross-Domain Visual Feature Generation for Disordered Speech Recognition. INTERSPEECH 2020: 711-715
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2001-01656
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-01656
Jianwei Yu, Shi-Xiong Zhang, Jian Wu, Shahram Ghorbani, Bo Wu, Shiyin Kang, Shansong Liu, Xunying Liu, Helen Meng, Dong Yu:
Audio-visual Recognition of Overlapped speech for the LRS2 dataset. CoRR abs/2001.01656 (2020)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2007-08818
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-08818
Shoukang Hu, Xurong Xie, Shansong Liu, Mengzhe Geng, Xunying Liu, Helen Meng:
Neural Architecture Search for Speech Recognition. CoRR abs/2007.08818 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuLXLYWLM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuLXLYWLM19
Shoukang Hu, Max W. Y. Lam, Xurong Xie, Shansong Liu, Jianwei Yu, Xixin Wu, Xunying Liu, Helen Meng:
Bayesian and Gaussian Process Neural Networks for Large Vocabulary Continuous Speech Recognition. ICASSP 2019: 6555-6559
[c7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuXLLYWLM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuXLLYWLM19
Shoukang Hu, Xurong Xie, Shansong Liu, Max W. Y. Lam, Jianwei Yu, Xixin Wu, Xunying Liu, Helen Meng:
LF-MMI Training of Bayesian and Gaussian Process Time Delay Neural Networks for Speech Recognition. INTERSPEECH 2019: 2793-2797
[c6]
- view
  - electronic edition @ isca-speech.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/HuLCGCCHYWLM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuLCGCCHYWLM19
Shoukang Hu, Shansong Liu, Heng Fai Chang, Mengzhe Geng, Jiani Chen, Lau Wing Chung, To Ka Hei, Jianwei Yu, Ka Ho Wong, Xunying Liu, Helen Meng:
The CUHK Dysarthric Speech Recognition Systems for English and Cantonese. INTERSPEECH 2019: 3669-3670
[c5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuHWYSLM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuHWYSLM19
Shansong Liu, Shoukang Hu, Yi Wang, Jianwei Yu, Rongfeng Su, Xunying Liu, Helen Meng:
Exploiting Visual Features Using Bayesian Gated Neural Networks for Disordered Speech Recognition. INTERSPEECH 2019: 4120-4124
[c4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuHLM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuHLM19
Shansong Liu, Shoukang Hu, Xunying Liu, Helen Meng:
On the Use of Pitch Features for Disordered Speech Recognition. INTERSPEECH 2019: 4130-4134
2018
[j2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/quanbio/LiuHCZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/quanbio/LiuHCZ18
Shansong Liu, Kui Hua, Sijie Chen, Xuegong Zhang:
Comprehensive simulation of metagenomic sequencing data with non-uniform sampling distribution. Quant. Biol. 6(2): 175-185 (2018)
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiuLSYXCM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiuLSYXCM18
Xunying Liu, Shansong Liu, Jinze Sha, Jianwei Yu, Zhiyuan Xu, Xie Chen, Helen Meng:
Limited-Memory BFGS Optimization of Recurrent Neural Network Language Models for Speech Recognition. ICASSP 2018: 6114-6118
[c2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LamHXLYSLM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LamHXLYSLM18
Max W. Y. Lam, Shoukang Hu, Xurong Xie, Shansong Liu, Jianwei Yu, Rongfeng Su, Xunying Liu, Helen Meng:
Gaussian Process Neural Networks for Speech Recognition. INTERSPEECH 2018: 1778-1782
[c1]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuXLHLWWLM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuXLHLWWLM18
Jianwei Yu, Xurong Xie, Shansong Liu, Shoukang Hu, Max W. Y. Lam, Xixin Wu, Ka Ho Wong, Xunying Liu, Helen Meng:
Development of the CUHK Dysarthric Speech Recognition System for the UA Speech Corpus. INTERSPEECH 2018: 2938-2942
2017
[j1]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/pieee/ZhangLCC17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pieee/ZhangLCC17
Xuegong Zhang, Shansong Liu, Hongfei Cui, Ting Chen:
Reading the Underlying Information From Massive Metagenomic Sequencing Data. Proc. IEEE 105(3): 459-473 (2017)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.