default search action
12th ISCSLP 2021: Hong Kong, SAR, China
- 12th International Symposium on Chinese Spoken Language Processing, ISCSLP 2021, Hong Kong, January 24-27, 2021. IEEE 2021, ISBN 978-1-7281-6994-1
- Jie Li, Zhiyun Fan, Xiaorui Wang, Yan Li:
Syllable-Based Acoustic Modeling With Lattice-Free MMI for Mandarin Speech Recognition. 1-5 - Jingyan Zhou, Xiaoying Zhang, Xiaohan Feng, King Keung Wu, Helen Meng:
Automatic Extraction of Semantic Patterns in Dialogs using Convex Polytopic Model. 1-5 - Taiyang Guo, Jianwu Dang, Gaoyan Zhang, Bin Zhao, Masashi Unoki:
Frequency-specific Brain Network Dynamics during Perceiving Real Words and Pseudowords. 1-5 - Qing Wang, Wei Rao, Pengcheng Guo, Lei Xie:
Adversarial Training for Multi-domain Speaker Recognition. 1-5 - Chenglong Wang, Jiangyan Yi, Jianhua Tao, Ye Bai, Zhengkun Tian:
Hierarchically Attending Time-Frequency and Channel Features for Improving Speaker Verification. 1-5 - Junyi Ao, Tom Ko:
Improving Attention-based End-to-end ASR by Incorporating an N-gram Neural Network. 1-5 - Murong Ma, Haiwei Wu, Xuyang Wang, Lin Yang, Junjie Wang, Ming Li:
Acoustic Word Embedding System for Code-Switching Query-by-example Spoken Term Detection. 1-5 - Fan Yang, Junfeng Li, Yonghong Yan:
A New Method for Improving Generative Adversarial Networks in Speech Enhancement. 1-5 - Xiong Cai, Zhiyong Wu, Kuo Zhong, Bin Su, Dongyang Dai, Helen Meng:
Unsupervised Cross-Lingual Speech Emotion Recognition Using Domain Adversarial Neural Network. 1-5 - Cunhang Fan, Bin Liu, Jianhua Tao, Jiangyan Yi, Zhengqi Wen, Leichao Song:
Deep Time Delay Neural Network for Speech Enhancement with Full Data Learning. 1-5 - Weizhe Wang, Hongwu Yang:
Towards Realizing Sign Language to Emotional Speech Conversion by Deep Learning. 1-5 - Gan Huang, Aijun Li, Sichen Zhang, Liang Zhang:
Prosody and Dialogue Act: A Perceptual Study on Chinese Interrogatives. 1-5 - Tingzhi Mao, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Hao Huang, Eng Siong Chng:
Approaches to Improving Recognition of Underrepresented Named Entities in Hybrid ASR Systems. 1-5 - Linkai Peng, Wang Dai, Dengfeng Ke, Jinsong Zhang:
Multi-Scale Model for Mandarin Tone Recognition. 1-5 - Tuo Zhao, Yunxin Zhao, Shaojun Wang, Mei Han:
UNet++-Based Multi-Channel Speech Dereverberation and Distant Speech Recognition. 1-5 - Zhaoqi Li, Long Wu, Ta Li, Yonghong Yan:
Improves Neural Acoustic Word Embeddings Query by Example Spoken Term Detection with Wav2vec Pretraining and Circle Loss. 1-5 - Lujia Yang, Hongwei Ding:
Comparing the Rhythm of Instrumental Music and Vocal Music in Mandarin and English. 1-5 - Yi-Hsuan Wang, Jia-Hao Hsu, Chung-Hsien Wu, Tsung-Hsien Yang:
Transformer-based Empathetic Response Generation Using Dialogue Situation and Advanced-Level Definition of Empathy. 1-5 - Disong Wang, Jianwei Yu, Xixin Wu, Lifa Sun, Xunying Liu, Helen Meng:
Improved End-to-End Dysarthric Speech Recognition via Meta-learning Based Model Re-initialization. 1-5 - Tao Li, Shan Yang, Liumeng Xue, Lei Xie:
Controllable Emotion Transfer For End-to-End Speech Synthesis. 1-5 - Shuwen Chen, Peggy Pik Ki Mok:
Articulatory and Acoustic Features of Mandarin /ɹ/: A Preliminary Study. 1-5 - Yun Feng, Yan Feng, Chenwei Xie, William Shi-Yuan Wang:
Age-Related Decline of Classifier Usage in Southwestern Mandarin. 1-5 - Yu-Tao Chang, Yuan-Hong Yang, Yu-Huai Peng, Syu-Siang Wang, Tai-Shih Chi, Yu Tsao, Hsin-Min Wang:
MoEVC: A Mixture of Experts Voice Conversion System With Sparse Gating Mechanism for Online Computation Acceleration. 1-5 - Yaru Wu, Lori Lamel, Martine Adda-Decker:
Tone Realization in Mandarin Speech: A Large Corpus Based Study of Disyllabic Words. 1-5 - Xiaoyan Zhang, Aijun Li, Zhiqiang Li:
Complex Patterns of Tonal Realization in Taifeng Chinese. 1-5 - Shuai Zhang, Jiangyan Yi, Zhengkun Tian, Jianhua Tao, Ye Bai:
Rnn-transducer With Language Bias For End-to-end Mandarin-English Code-switching Speech Recognition. 1-5 - Yuqing Zhang, Zhu Li, Jinsong Zhang:
A Comparison Study on the Alignment of Prosodic and Semantic Units and Its Effects on F0 Shifting in L1 and L2 English Spontaneous Speech. 1-5 - Yuewen Cao, Songxiang Liu, Shiyin Kang, Na Hu, Peng Liu, Xunying Liu, Dan Su, Dong Yu, Helen Meng:
Exploring Cross-lingual Singing Voice Synthesis Using Speech Data. 1-5 - Meng Ge, Ruixiong Zhang, Wei Zou, Xiangang Li, Cheng Gong, Longbiao Wang, Jianwu Dang:
Order-aware Pairwise Intoxication Detection. 1-5 - Huan Lei, Jianwu Dang, Yu Chen:
An Eye-tracking Study of Transposed-letter Effect in English Word Recognition by Mandarin Speakers. 1-5 - Zheying Huang, Peng Li, Ji Xu, Pengyuan Zhang, Yonghong Yan:
Context-dependent Label Smoothing Regularization for Attention-based End-to-End Code-Switching Speech Recognition. 1-5 - Tingle Li, Jiawei Chen, Haowen Hou, Ming Li:
Sams-Net: A Sliced Attention-based Neural Network for Music Source Separation. 1-5 - Yihan Guan, Bin Li:
Usability And Practicality of Speech Recording by Mobile Phones for Phonetic Analysis. 1-5 - Wenwei Xu, Peggy Mok:
The Acoustic Correlates and Time Span of the Non-modal Phonation in Kunshan Wu Chinese. 1-5 - Sean Shensheng Xu, Man-Wai Mak, Ka Ho Wong, Helen Meng, Timothy C. Y. Kwok:
Age-Invariant Speaker Embedding for Diarization of Cognitive Assessments. 1-5 - Keiichi Funaki:
On Adaptive LASSO-based Sparse Time-Varying Complex AR Speech Analysis. 1-5 - Zhiping Zeng, Van Tung Pham, Haihua Xu, Yerbolat Khassanov, Eng Siong Chng, Chongjia Ni, Bin Ma:
Leveraging Text Data Using Hybrid Transformer-LSTM Based End-to-End ASR in Transfer Learning. 1-5 - Sixia Li, Jianwu Dang, Longbiao Wang:
Spoken Language Understanding with Sememe Knowledge as Domain Knowledge. 1-5 - Changjie Pan, Fei Chen:
Impact of Mismatched Spectral Amplitude Levels on Vowel Identification in Simulated Electric-acoustic Hearing. 1-5 - Jinru Zhu, Changchun Bao:
GAN-Based Inter-Channel Amplitude Ratio Decoding in Multi-Channel Speech Coding. 1-5 - Xun Gong, Zhengyang Chen, Yexin Yang, Shuai Wang, Lan Wang, Yanmin Qian:
Speaker Embedding Augmentation with Noise Distribution Matching. 1-5 - Yu-Chen Hung, Tzu-Hui Lin:
Rapid Word Learning of Children with Cochlear Implants: Phonological Structure and Mutual Exclusivity. 1-4 - Shanpeng Li, Wentao Gu:
Prosodic Profiles of the Mandarin Speech Conveying Ironic Compliment. 1-5 - Fengpeng Yue, Tom Ko:
An Investigation of Positional Encoding in Transformer-based End-to-end Speech Recognition. 1-5 - Xin Li, Yin Huang, Yunheng Xu, Linxin Yi, Yuming Yuan, Min Xiang:
Production of Tone 3 Sandhi by Advanced Korean Learners of Mandarin. 1-5 - Ying Zhang, Hao Che, Xiaorui Wang:
Non-parallel Sequence-to-Sequence Voice Conversion for Arbitrary Speakers. 1-5 - Feng Bao, Yuepeng Li, Shidong Shang:
Low-complexity Post-processing Method for Speech Enhancement. 1-5 - Shuai Wang, Yexin Yang, Yanmin Qian, Kai Yu:
Revisiting the Statistics Pooling Layer in Deep Speaker Embedding Learning. 1-5 - Guangyan Zhang, Shirong Qiu, Ying Qin, Tan Lee:
Estimating Mutual Information in Prosody Representation for Emotional Prosody Transfer in Speech Synthesis. 1-5 - Chunyu Qiang, Jianhua Tao, Ruibo Fu, Zhengqi Wen, Jiangyan Yi, Tao Wang, Shiming Wang:
Text Enhancement for Paragraph Processing in End-to-End Code-switching TTS. 1-5 - Anna Gutnyk, Oliver Niebuhr, Wentao Gu:
Speaker Charisma Analyzed through the Cultural Lens. 1-5 - Qiuyuan Li, Yuan Jia:
An Experimental Research on Tonal Errors in Monosyllables of Standard Spoken Chinese Language Produced by Uyghur Learners. 1-5 - Ying Qin, Yao Qian, Anastassia Loukina, Patrick L. Lange, Abhinav Misra, Keelan Evanini, Tan Lee:
Automatic Detection of Word-Level Reading Errors in Non-native English Speech Based on ASR Output. 1-5 - Mengfei Wu, Longbiao Wang, Yuke Si, Jianwu Dang:
Dialogue Act Recognition using Branch Architecture with Attention Mechanism for Imbalanced Data. 1-5 - Yu Gu, Xiang Yin, Yonghui Rao, Yuan Wan, Benlai Tang, Yang Zhang, Jitong Chen, Yuxuan Wang, Zejun Ma:
ByteSing: A Chinese Singing Voice Synthesis System Using Duration Allocated Encoder-Decoder Acoustic Models and WaveRNN Vocoders. 1-5 - Wei Zhang, Meghan Clayards, Jinsong Zhang:
Effects of Mandarin Tones on Acoustic Cue Weighting Patterns for Prominence. 1-5 - Xinran Ren, Peggy Mok:
Consonantal Effects of Aspiration on Onset F0 in Cantonese. 1-5 - Wenjie Peng, Yingming Gao, Binghuai Lin, Jinsong Zhang:
A Practical Way to Improve Automatic Phonetic Segmentation Performance. 1-5 - Lingjun Zhao, Man-Wai Mak:
Channel Interdependence Enhanced Speaker Embeddings for Far-Field Speaker Verification. 1-5 - Kun Wei, Pengcheng Guo, Hang Lv, Zhen Tu, Lei Xie:
Context-aware RNNLM Rescoring for Conversational Speech Recognition. 1-5 - Zheng Lian, Rongxiu Zhong, Zhengqi Wen, Bin Liu, Jianhua Tao:
Towards Fine-Grained Prosody Control for Voice Conversion. 1-5 - Meidan Ouyang, Rohan Kumar Das, Jichen Yang, Haizhou Li:
Capsule Network based End-to-end System for Detection of Replay Attacks. 1-5 - Chang Liu, Yang Ai, Zhenhua Ling:
Phase Spectrum Recovery for Enhancing Low-Quality Speech Captured by Laser Microphones. 1-5 - Meet H. Soni, Ashish Panda:
LDA-based Speaker Verification in Multi-Enrollment Scenario using Expected Vector Approach. 1-5 - Zezheng Xu, Ting Jiang, Chao Li, JiaCheng Yu:
An Attention-augmented Fully Convolutional Neural Network for Monaural Speech Enhancement. 1-5 - Changfeng Gao, Gaofeng Cheng, Jun Zhou, Pengyuan Zhang, Yonghong Yan:
Non-autoregressive Deliberation-Attention based End-to-End ASR. 1-5 - Qing Wang, Huaxin Wu, Zijun Jing, Feng Ma, Yi Fang, Yuxuan Wang, Tairan Chen, Jia Pan, Jun Du, Chin-Hui Lee:
A Model Ensemble Approach for Sound Event Localization and Detection. 1-5 - Xuenan Xu, Heinrich Dinkel, Mengyue Wu, Kai Yu:
Audio Caption in a Car Setting with a Sentence-Level Loss. 1-5 - Wai-Sum Lee, Irene Ching-Yin Tsoi:
Acoustical Characteristics of the Cantonese Vowels and Tones Produced by Hearing Impaired Speakers. 1-5 - Siyuan Zheng, Jun Du, Hengshun Zhou, Xue Bai, Chin-Hui Lee, Shipeng Li:
Speech Emotion Recognition Based on Acoustic Segment Model. 1-5 - Zhichao Wang, Wenshuo Ge, Xiong Wang, Shan Yang, Wendong Gan, Haitao Chen, Hai Li, Lei Xie, Xiulin Li:
Accent and Speaker Disentanglement in Many-to-many Voice Conversion. 1-5 - Guolei Jiang, Chunhong Liao, Kun Li, Pengfei Liu, Linying Jiang, Helen Meng:
Automatic Speaker-level Pronunciation Assessment of L2 Speech Using Posterior Probabilities from Multiple Utterances. 1-5
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.