default search action
Shigeki Matsuda
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2010 – 2019
- 2017
- [j15]Shigeki Matsuda, Teruaki Hayashi, Yutaka Ashikari, Yoshinori Shiga, Hidenori Kashioka, Keiji Yasuda, Hideo Okuma, Masao Uchiyama, Eiichiro Sumita, Hisashi Kawai, Satoshi Nakamura:
Development of the "VoiceTra" Multi-Lingual Speech Translation System. IEICE Trans. Inf. Syst. 100-D(4): 621-632 (2017) - [c46]Tsubasa Ochiai, Shigeki Matsuda, Hideyuki Watanabe, Shigeru Katagiri:
Automatic node selection for Deep Neural Networks using Group Lasso regularization. ICASSP 2017: 5485-5489 - 2016
- [j14]Tsubasa Ochiai, Shigeki Matsuda, Hideyuki Watanabe, Xugang Lu, Chiori Hori, Hisashi Kawai, Shigeru Katagiri:
Speaker Adaptive Training Localizing Speaker Modules in DNN for Hybrid DNN-HMM Speech Recognizers. IEICE Trans. Inf. Syst. 99-D(10): 2431-2443 (2016) - [c45]Tsubasa Ochiai, Shigeki Matsuda, Hideyuki Watanabe, Xugang Lu, Hisashi Kawai, Shigeru Katagiri:
Bottleneck linear transformation network adaptation for speaker adaptive training-based hybrid DNN-HMM speech recognizer. ICASSP 2016: 5015-5019 - [i1]Tsubasa Ochiai, Shigeki Matsuda, Hideyuki Watanabe, Shigeru Katagiri:
Automatic Node Selection for Deep Neural Networks using Group Lasso Regularization. CoRR abs/1611.05527 (2016) - 2015
- [c44]Tsubasa Ochiai, Shigeki Matsuda, Hideyuki Watanabe, Xugang Lu, Chiori Hori, Shigeru Katagiri:
Speaker adaptive training for deep neural networks embedding linear transformation networks. ICASSP 2015: 4605-4609 - 2014
- [j13]Yu Tsao, Xugang Lu, Paul R. Dixon, Ting-Yao Hu, Shigeki Matsuda, Chiori Hori:
Incorporating local information of the acoustic environments to MAP-based feature compensation and acoustic model adaptation. Comput. Speech Lang. 28(3): 709-726 (2014) - [j12]Yu Tsao, Shigeki Matsuda, Chiori Hori, Hideki Kashioka, Chin-Hui Lee:
A MAP-based Online Estimation Approach to Ensemble Speaker and Speaking Environment Modeling. IEEE ACM Trans. Audio Speech Lang. Process. 22(2): 403-416 (2014) - [j11]Hideyuki Watanabe, Tsukasa Ohashi, Shigeru Katagiri, Miho Ohsaki, Shigeki Matsuda, Hideki Kashioka:
Robust and Efficient Pattern Classification using Large Geometric Margin Minimum Classification Error Training. J. Signal Process. Syst. 74(3): 297-310 (2014) - [j10]Hideyuki Watanabe, Jun'ichi Tokuno, Tsukasa Ohashi, Shigeru Katagiri, Miho Ohsaki, Shigeki Matsuda, Hideki Kashioka:
Minimum Classification Error Training Incorporating Automatic Loss Smoothness Determination. J. Signal Process. Syst. 74(3): 311-322 (2014) - [c43]Xugang Lu, Yu Tsao, Shigeki Matsuda, Chiori Hori:
Sparse representation based on a bag of spectral exemplars for acoustic event detection. ICASSP 2014: 6255-6259 - [c42]Tsubasa Ochiai, Shigeki Matsuda, Xugang Lu, Chiori Hori, Shigeru Katagiri:
Speaker Adaptive Training using Deep Neural Networks. ICASSP 2014: 6349-6353 - [c41]Xugang Lu, Yu Tsao, Shigeki Matsuda, Chiori Hori:
Ensemble modeling of denoising autoencoder for speech spectrum restoration. INTERSPEECH 2014: 885-889 - 2013
- [j9]Xinhui Hu, Shigeki Matsuda, Chiori Hori, Hideki Kashioka:
Collecting Colloquial and Spontaneous-like Sentences from Web Resources for Constructing Chinese Language Models of Speech Recognition. Inf. Media Technol. 8(2): 449-456 (2013) - [j8]Xinhui Hu, Shigeki Matsuda, Chiori Hori, Hideki Kashioka:
Collecting Colloquial and Spontaneous-like Sentences from Web Resources for Constructing Chinese Language Models of Speech Recognition. J. Inf. Process. 21(2): 168-175 (2013) - [j7]Xugang Lu, Masashi Unoki, Shigeki Matsuda, Chiori Hori, Hideki Kashioka:
Controlling Tradeoff Between Approximation Accuracy and Complexity of a Smooth Function in a Reproducing Kernel Hilbert Space for Noise Reduction. IEEE Trans. Signal Process. 61(3): 601-610 (2013) - [c40]Chien-Lin Huang, Shigeki Matsuda, Chiori Hori:
Feature normalization using MVAW processing for spoken language recognition. APSIPA 2013: 1-4 - [c39]Shigeki Matsuda, Xugang Lu, Hideki Kashioka:
Automatic localization of a language-independent sub-network on deep neural networks trained by multi-lingual speech. ICASSP 2013: 7359-7362 - [c38]Xugang Lu, Yu Tsao, Shigeki Matsuda, Chiori Hori:
Speech enhancement based on deep denoising autoencoder. INTERSPEECH 2013: 436-440 - [c37]Xugang Lu, Shigeki Matsuda, Chiori Hori:
Speech spectrum restoration based on conditional restricted boltzmann machine. INTERSPEECH 2013: 3259-3263 - [c36]Masahiro Saiko, Shigeki Matsuda, Ken Hanazawa, Ryosuke Isotani, Chiori Hori:
Cross-lingual acoustic model adaptation based on transfer vector field smoothing with MAP. INTERSPEECH 2013: 3322-3326 - [c35]Chien-Lin Huang, Paul R. Dixon, Shigeki Matsuda, Youzheng Wu, Xugang Lu, Masahiro Saiko, Chiori Hori:
The NICT ASR system for IWSLT 2013. IWSLT (Evaluation Campaign) 2013 - [c34]Shigeki Matsuda, Xinhui Hu, Yoshinori Shiga, Hideki Kashioka, Chiori Hori, Keiji Yasuda, Hideo Okuma, Masao Uchiyama, Eiichiro Sumita, Hisashi Kawai, Satoshi Nakamura:
Multilingual Speech-to-Speech Translation System: VoiceTra. MDM (2) 2013: 229-233 - 2012
- [j6]Sakriani Sakti, Michael Paul, Andrew M. Finch, Xinhui Hu, Jinfu Ni, Noriyuki Kimura, Shigeki Matsuda, Chiori Hori, Yutaka Ashikari, Hisashi Kawai, Hideki Kashioka, Eiichiro Sumita, Satoshi Nakamura:
Distributed speech translation technologies for multiparty multilingual communication. ACM Trans. Speech Lang. Process. 9(2): 4:1-4:27 (2012) - [c33]Youzheng Wu, Xugang Lu, Hitoshi Yamamoto, Shigeki Matsuda, Chiori Hori, Hideki Kashioka:
Factored Language Model based on Recurrent Neural Network. COLING 2012: 2835-2850 - [c32]Tsukasa Ohashi, Hideyuki Watanabe, Jun'ichi Tokuno, Shigeru Katagiri, Miho Ohsaki, Shigeki Matsuda, Hideki Kashioka:
Increasing virtual samples through loss smoothness determination in large geometric margin minimum classification error training. ICASSP 2012: 2081-2084 - [c31]Yu Tsao, Chien-Lin Huang, Shigeki Matsuda, Chiori Hori, Hideki Kashioka:
A linear projection approach to environment modeling for robust speech recognition. ICASSP 2012: 4329-4332 - [c30]Hitoshi Yamamoto, Paul R. Dixon, Shigeki Matsuda, Chiori Hori, Hideki Kashioka:
Tied-State Mixture Language Model for WFST-based Speech Recognition. INTERSPEECH 2012: 174-177 - [c29]Xugang Lu, Shigeki Matsuda, Chiori Hori, Hideki Kashioka:
Speech restoration based on deep learning autoencoder with layer-wised pretraining. INTERSPEECH 2012: 1504-1507 - [c28]Shigeki Matsuda, Naoya Ito, Kosuke Tsujino, Hideki Kashioka, Shigeki Sagayama:
Speaker-Dependent Voice Activity Detection Robust to Background Speech Noise. INTERSPEECH 2012: 2626-2629 - [c27]Xinhui Hu, Youzheng Wu, Shigeki Matsuda, Chiori Hori, Hideki Kashioka:
Collecting sentences from web resources for constructing spontaneous Chinese language model. ISCSLP 2012: 197-200 - [c26]Xugang Lu, Masashi Unoki, Shigeki Matsuda, Chiori Hori, Hideki Kashioka:
Controlling the tradeoff property in a regularization framework for noise reduction. ISCSLP 2012: 201-205 - [c25]Xugang Lu, Yu Tsao, Shigeki Matsuda, Chiori Hori, Hideki Kashioka:
Acoustic space partition based on broad phonetic class for ensemble acoustic modeling. ISCSLP 2012: 311-314 - [c24]Teruhisa Misu, Shigeki Matsuda, Etsuo Mizukami, Hideki Kashioka, Haizhou Li:
Efficient Language Model Construction for Spoken Dialog Systems by Inducting Language Resources of Different Languages. IWSDS 2012: 101-110 - [c23]Hitoshi Yamamoto, Youzheng Wu, Chien-Lin Huang, Xugang Lu, Paul R. Dixon, Shigeki Matsuda, Chiori Hori, Hideki Kashioka:
The NICT ASR system for IWSLT2012. IWSLT 2012: 34-37 - [c22]Youzheng Wu, Hitoshi Yamamoto, Xugang Lu, Shigeki Matsuda, Chiori Hori, Hideki Kashioka:
Factored recurrent neural network language model in TED lecture transcription. IWSLT 2012: 222-228 - 2011
- [j5]Xugang Lu, Shigeki Matsuda, Masashi Unoki, Satoshi Nakamura:
Temporal modulation normalization for robust speech feature extraction and recognition. Multim. Tools Appl. 52(1): 187-199 (2011) - [c21]Yu Tsao, Shigeki Matsuda, Shinsuke Sakai, Ryosuke Isotani, Hisashi Kawai, Satoshi Nakamura:
A sampling-based environment population projection approach for rapid acoustic model adaptation. ICASSP 2011: 5504-5507 - [c20]István Varga, Kiyonori Ohtake, Kentaro Torisawa, Stijn De Saeger, Teruhisa Misu, Shigeki Matsuda, Jun'ichi Kazama:
Similarity Based Language Model Construction for Voice Activated Open-Domain Question Answering. IJCNLP 2011: 536-544 - [c19]Kazuhiko Abe, Youzheng Wu, Chien-Lin Huang, Paul R. Dixon, Shigeki Matsuda, Chiori Hori, Hideki Kashioka:
The NICT ASR system for IWSLT2011. IWSLT 2011: 28-33 - 2010
- [j4]Xiang Zuo, Naoto Iwahashi, Kotaro Funakoshi, Mikio Nakano, Ryo Taguchi, Shigeki Matsuda, Komei Sugiura, Natsuki Oka:
Detecting Robot-Directed Speech by Situated Understanding in Physical Interaction. Inf. Media Technol. 5(4): 1314-1326 (2010) - [j3]Xugang Lu, Shigeki Matsuda, Masashi Unoki, Satoshi Nakamura:
Temporal contrast normalization and edge-preserved smoothing of temporal modulation structures of speech for robust speech recognition. Speech Commun. 52(1): 1-11 (2010) - [c18]Satoshi Tamura, Chiyomi Miyajima, Norihide Kitaoka, Takeshi Yamada, Satoru Tsuge, Tetsuya Takiguchi, Kazumasa Yamamoto, Takanobu Nishiura, Masato Nakayama, Yuki Denda, Masakiyo Fujimoto, Shigeki Matsuda, Tetsuji Ogawa, Shingo Kuroiwa, Kazuya Takeda, Satoshi Nakamura:
CENSREC-1-AV: an audio-visual corpus for noisy bimodal speech recognition. AVSP 2010: 6 - [c17]Xiang Zuo, Naoto Iwahashi, Ryo Taguchi, Shigeki Matsuda, Komei Sugiura, Kotaro Funakoshi, Mikio Nakano, Natsuki Oka:
Robot-directed speech detection using Multimodal Semantic Confidence based on speech, image, and motion. ICASSP 2010: 2458-2461 - [c16]Xiang Zuo, Naoto Iwahashi, Ryo Taguchi, Kotaro Funakoshi, Mikio Nakano, Shigeki Matsuda, Komei Sugiura, Natsuki Oka:
Detecting robot-directed speech by situated understanding in object manipulation tasks. RO-MAN 2010: 608-613
2000 – 2009
- 2009
- [c15]Yu Tsao, Shigeki Matsuda, Satoshi Nakamura, Chin-Hui Lee:
MAP estimation of online mapping parameters in ensemble speaker and speaking environment modeling. ASRU 2009: 271-275 - [c14]Xugang Lu, Shigeki Matsuda, Masashi Unoki, Tohru Shimizu, Satoshi Nakamura:
Temporal contrast normalization and edge-preserved smoothing on temporal modulation structure for robust speech recognition. ICASSP 2009: 4573-4576 - [c13]Shizuka Nakamura, Shigeki Matsuda, Hiroaki Kato, Minoru Tsuzaki, Yoshinori Sagisaka:
Objective evaluation of English learners' timing control based on a measure reflecting perceptual characteristics. ICASSP 2009: 4837-4840 - [c12]Shigeki Matsuda, Yu Tsao, Jinyu Li, Satoshi Nakamura, Chin-Hui Lee:
A study on soft margin estimation of linear regression parameters for speaker adaptation. INTERSPEECH 2009: 1603-1606 - 2008
- [j2]Carlos Toshinori Ishi, Shigeki Matsuda, Takayuki Kanda, Takatoshi Jitsuhiro, Hiroshi Ishiguro, Satoshi Nakamura, Norihiro Hagita:
A Robust Speech Recognition System for Communication Robots in Noisy Environments. IEEE Trans. Robotics 24(3): 759-763 (2008) - [c11]Michael Paul, Hideo Okuma, Hirofumi Yamamoto, Eiichiro Sumita, Shigeki Matsuda, Tohru Shimizu, Satoshi Nakamura:
Multilingual Mobile-Phone Translation Services for World Travelers. COLING (Demos) 2008: 165-168 - [c10]Masato Nakayama, Takanobu Nishiura, Yuki Denda, Norihide Kitaoka, Kazumasa Yamamoto, Takeshi Yamada, Satoru Tsuge, Chiyomi Miyajima, Masakiyo Fujimoto, Tetsuya Takiguchi, Satoshi Tamura, Tetsuji Ogawa, Shigeki Matsuda, Shingo Kuroiwa, Kazuya Takeda, Satoshi Nakamura:
CENSREC-4: development of evaluation framework for distant-talking speech recognition under reverberant environments. INTERSPEECH 2008: 968-971 - [c9]Xugang Lu, Shigeki Matsuda, Tohru Shimizu, Satoshi Nakamura:
Noise Reduction Based Random Matrix Theory. ISCSLP 2008: 285-288 - [c8]Xugang Lu, Shigeki Matsuda, Tohru Shimizu, Satoshi Nakamura:
Normalization on Temporal Modulation Transfer Function for Robust Speech Recognition. ISUC 2008: 16-23 - 2006
- [j1]Shigeki Matsuda, Takatoshi Jitsuhiro, Konstantin Markov, Satoshi Nakamura:
ATR Parallel Decoding Based Speech Recognition System Robust to Noise and Speaking Styles. IEICE Trans. Inf. Syst. 89-D(3): 989-997 (2006) - [c7]Carlos Toshinori Ishi, Shigeki Matsuda, Takayuki Kanda, Takatoshi Jitsuhiro, Hiroshi Ishiguro, Satoshi Nakamura, Norihiro Hagita:
Robust Speech Recognition System for Communication Robots in Real Environments. Humanoids 2006: 340-345 - 2005
- [c6]Takatoshi Jitsuhiro, Shigeki Matsuda, Yutaka Ashikari, Satoshi Nakamura, Ikuko Eguchi Yairi, Seiji Igi:
Spoken dialog system and its evaluation of geographic information system for elderly persons' mobility support. INTERSPEECH 2005: 197-200 - [c5]Shigeki Matsuda, Wolfgang Herbordt, Satoshi Nakamura:
Outlier detection for acoustic model training using robust statistics. INTERSPEECH 2005: 3337-3340 - 2004
- [c4]Shigeki Matsuda, Takatoshi Jitsuhiro, Konstantin Markov, Satoshi Nakamura:
Speech recognition system robust to noise and speaking styles. INTERSPEECH 2004: 2817-2820 - 2002
- [c3]Junko Tokuno, Nobuhito Inami, Shigeki Matsuda, Mitsuru Nakai, Hiroshi Shimodaira, Shigeki Sagayama:
Context-dependent substroke model for HMM-based on-line handwriting recognition. IWFHR 2002: 78-83 - 2000
- [c2]Shigeki Matsuda, Mitsuru Nakai, Hiroshi Shimodaira, Shigeki Sagayama:
Asynchronous-transition HMM. ICASSP 2000: 1005-1008 - [c1]Shigeki Matsuda, Mitsuru Nakai, Hiroshi Shimodaira, Shigeki Sagayama:
Feature-dependent allophone clustering. INTERSPEECH 2000: 413-416
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 21:25 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint