default search action
Akinobu Lee
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j8]Sei Ueno, Akinobu Lee, Tatsuya Kawahara:
Refining Synthesized Speech Using Speaker Information and Phone Masking for Data Augmentation of Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3924-3933 (2024) - 2023
- [c50]Iago Lourenço Correa, Sei Ueno, Akinobu Lee:
Accent-Preserving Voice Conversion between Native-Nonnative Speakers for Second Language Learning. APSIPA ASC 2023: 1179-1186 - 2020
- [j7]Ryota Tanaka, Akihide Ozeki, Shugo Kato, Akinobu Lee:
Context and knowledge aware conversational model and system combination for grounded response generation. Comput. Speech Lang. 62: 101070 (2020) - [i2]Ryota Tanaka, Akinobu Lee:
Fact-based Dialogue Generation with Convergent and Divergent Decoding. CoRR abs/2005.03174 (2020)
2010 – 2019
- 2019
- [i1]Ryota Tanaka, Akihide Ozeki, Shugo Kato, Akinobu Lee:
An Ensemble Dialogue System for Facts-Based Sentence Generation. CoRR abs/1902.01529 (2019) - 2017
- [p2]Keiichi Tokuda, Akinobu Lee, Yoshihiko Nankaku, Keiichiro Oura, Kei Hashimoto, Daisuke Yamamoto, Ichi Takumi, Takahiro Uchiya, Shuhei Tsutsumi, Steve Renals, Junichi Yamagishi:
User Generated Dialogue Systems: uDialogue. Human-Harmonized Information Technology (2) 2017: 77-114 - 2015
- [c49]Siva Reddy Gangireddy, Steve Renals, Yoshihiko Nankaku, Akinobu Lee:
Prosodically-enhanced recurrent neural network language models. INTERSPEECH 2015: 2390-2394 - 2014
- [c48]Daisuke Yamamoto, Keiichiro Oura, Ryota Nishimura, Takahiro Uchiya, Akinobu Lee, Ichi Takumi, Keiichi Tokuda:
Voice interaction system with 3D-CG virtual agent for stand-alone smartphones. HAI 2014: 323-330 - 2013
- [c47]Akinobu Lee, Keiichiro Oura, Keiichi Tokuda:
Mmdagent - A fully open-source toolkit for voice interaction systems. ICASSP 2013: 8382-8385 - 2011
- [j6]Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda:
Bayesian Context Clustering Using Cross Validation for Speech Recognition. IEICE Trans. Inf. Syst. 94-D(3): 668-678 (2011) - [c46]Naoaki Ito, Yoshihiko Nankaku, Akinobu Lee:
Evaluation of Tree-Trellis Based Decoding in Over-Million LVCSR. INTERSPEECH 2011: 1937-1940 - 2010
- [j5]Keiichiro Oura, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda:
A Covariance-Tying Technique for HMM-Based Speech Synthesis. IEICE Trans. Inf. Syst. 93-D(3): 595-601 (2010) - [c45]Toyohiro Hayashi, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda:
Speaker adaptation based on nonlinear spectral transform for speech recognition. INTERSPEECH 2010: 542-545 - [c44]Akira Saito, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda:
Voice activity detection based on conditional random fields using multiple features. INTERSPEECH 2010: 2086-2089
2000 – 2009
- 2009
- [c43]Kaori Yutani, Yosuke Uto, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda:
Voice conversion based on simultaneous modelling of spectrum and F0. ICASSP 2009: 3897-3900 - [c42]Keiichiro Oura, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda:
Tying covariance matrices to reduce the footprint of HMM-based speech synthesis systems. INTERSPEECH 2009: 1759-1762 - 2008
- [j4]Tobias Cincarek, Hiromichi Kawanami, Ryuichi Nisimura, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano:
Development, Long-Term Operation and Portability of a Real-Environment Speech-Oriented Guidance System. IEICE Trans. Inf. Syst. 91-D(3): 576-587 (2008) - [j3]Keiichiro Oura, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda:
A Fully Consistent Hidden Semi-Markov Model-Based Speech Recognition System. IEICE Trans. Inf. Syst. 91-D(11): 2693-2700 (2008) - [c41]Yoshitaka Yoshimi, Ryota Kakitsuba, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda:
Probabilistic answer selection based on conditional random fields for spoken dialog system. INTERSPEECH 2008: 215-218 - [c40]Sayaka Shiota, Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda:
Acoustic modeling based on model structure annealing for speech recognition. INTERSPEECH 2008: 932-935 - [c39]Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda:
Bayesian context clustering using cross valid prior distribution for HMM-based speech recognition. INTERSPEECH 2008: 936-939 - [c38]Tatsuya Ito, Kei Hashimoto, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda:
Speaker recognition based on variational Bayesian method. INTERSPEECH 2008: 1417-1420 - 2007
- [c37]Tobias Cincarek, Ryuichi Nisimura, Akinobu Lee, Kiyohiro Shikano:
Insights Gained from Development and Long-Term Operation of a Real-Environment Speech-Oriented Guidance System. ICASSP (4) 2007: 157-160 - [c36]Hiroaki Kokubo, Nobuo Hataoka, Akinobu Lee, Tatsuya Kawahara, Kiyohiro Shikano:
Real-Time Continuous Speech Recognition System on SH-4A Microprocessor. MMSP 2007: 35-38 - [c35]Hiroyuki Sakai, Tobias Cincarek, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano, Akinobu Lee:
Voice activity detection applied to hands-free spoken dialogue robot based on decoding using acoustic and language model. ROBOCOMM 2007: 16 - 2006
- [j2]Randy Gomez, Akinobu Lee, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
Improving Rapid Unsupervised Speaker Adaptation Based on HMM-Sufficient Statistics in Noisy Environments Using Multi-Template Models. IEICE Trans. Inf. Syst. 89-D(3): 998-1005 (2006) - [j1]Hiroshi Saruwatari, Toshiya Kawamura, Tsuyoki Nishikawa, Akinobu Lee, Kiyohiro Shikano:
Blind source separation based on a fast-convergence algorithm combining ICA and beamforming. IEEE Trans. Speech Audio Process. 14(2): 666-678 (2006) - [c34]Keiichiro Oura, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda:
Hidden Semi-Markov Model Based Speech Recognition System using Weighted Finite-State Transducer. ICASSP (1) 2006: 33-36 - [c33]Tomohiro Hakamata, Akinobu Lee, Yoshihiko Nankaku, Keiichi Tokuda:
Reducing computation on parallel decoding using frame-wise confidence scores. INTERSPEECH 2006 - [c32]Keijiro Saino, Heiga Zen, Yoshihiko Nankaku, Akinobu Lee, Keiichi Tokuda:
An HMM-based singing voice synthesis system. INTERSPEECH 2006 - [c31]Yosuke Uto, Yoshihiko Nankaku, Tomoki Toda, Akinobu Lee, Keiichi Tokuda:
Voice conversion based on mixtures of factor analyzers. INTERSPEECH 2006 - [c30]Hiroaki Kokubo, Hiroaki Hataoka, Akinobu Lee, Tatsuya Kawahara, Kiyohiro Shikano:
Embedded Julius: Continuous Speech Recognition Software for Microprocessor. MMSP 2006: 378-381 - 2005
- [c29]Hiroshi Saruwatari, Katsuyuki Sawai, Tsuyoki Nishikawa, Akinobu Lee, Kiyohiro Shikano, Atsunobu Kaminuma, Masao Sakata, Daisuke Saitoh:
Speech Enhancement Based on Blind Source Separation in Car Environments. ICDE Workshops 2005: 1205 - [c28]Randy Gomez, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano:
Rapid unsupervised speaker adaptation based on multi-template HMM sufficient statistics in noisy environments. INTERSPEECH 2005: 293-296 - [c27]Ryuichi Nisimura, Akinobu Lee, Masashi Yamada, Kiyohiro Shikano:
Operating a public spoken guidance system in real environment. INTERSPEECH 2005: 845-848 - [c26]Daisuke Saitoh, Atsunobu Kaminuma, Hiroshi Saruwatari, Tsuyoki Nishikawa, Akinobu Lee:
Speech extraction in a car interior using frequency-domain ICA with rapid filter adaptations. INTERSPEECH 2005: 2301-2304 - [c25]Yasuaki Ohashi, Tsuyoki Nishikawa, Hiroshi Saruwatari, Akinobu Lee, Kiyohiro Shikano:
Noise-robust hands-free speech recognition based on spatial subtraction array and known noise superimposition. IROS 2005: 2328-2332 - 2004
- [c24]Panikos Heracleous, Yoshitaka Nakajima, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano:
Audible (normal) speech and inaudible murmur recognition using NAM microphone. EUSIPCO 2004: 329-332 - [c23]Ryuichi Nisimura, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano:
Public speech-oriented guidance system with adult and child discrimination capability. ICASSP (1) 2004: 433-436 - [c22]Akinobu Lee, Kiyohiro Shikano, Tatsuya Kawahara:
Real-time word confidence scoring using local posterior probabilities on tree trellis search. ICASSP (1) 2004: 793-796 - [c21]Akinobu Lee, Keisuke Nakamura, Ryuichi Nisimura, Hiroshi Saruwatari, Kiyohiro Shikano:
Noise robust real world spoken dialogue system using GMM based rejection of unintended inputs. INTERSPEECH 2004: 173-176 - [c20]Panikos Heracleous, Yoshitaka Nakajima, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano:
Non-audible murmur (NAM) speech recognition using a stethoscopic NAM microphone. INTERSPEECH 2004: 1469-1472 - [c19]Randy Gomez, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano:
Robust speech recognition with spectral subtraction in low SNR. INTERSPEECH 2004: 2077-2080 - [c18]Tatsuya Kawahara, Akinobu Lee, Kazuya Takeda, Katsunobu Itou, Kiyohiro Shikano:
Recent progress of open-source LVCSR engine julius and Japanese model repository. INTERSPEECH 2004: 3069-3072 - [p1]Shinichi Kawamoto, Hiroshi Shimodaira, Tsuneo Nitta, Takuya Nishimoto, Satoshi Nakamura, Katsunobu Itou, Shigeo Morishima, Tatsuo Yotsukura, Atsuhiko Kai, Akinobu Lee, Yoichi Yamashita, Takao Kobayashi, Keiichi Tokuda, Keikichi Hirose, Nobuaki Minematsu, Atsushi Yamada, Yasuharu Den, Takehito Utsuro, Shigeki Sagayama:
Galatea: Open-Source Software for Developing Anthropomorphic Spoken Dialog Agents. Life-like characters 2004: 187-212 - 2003
- [c17]Shingo Yamade, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano:
Unsupervised speaker adaptation based on HMM sufficient statistics in various noisy environments. INTERSPEECH 2003: 1493-1496 - 2002
- [c16]Shingo Yamade, Kanako Matsunami, Akira Baba, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano:
Spectral subtraction in noisy environments applied to speaker adaptation based on HMM sufficient statistics. INTERSPEECH 2002: 1045-1048 - [c15]Hiroshi Saruwatari, Katsuyuki Sawai, Akinobu Lee, Kiyohiro Shikano, Atsunobu Kaminuma, Masao Sakata:
Speech enhancement in car environment using blind source separation. INTERSPEECH 2002: 1781-1784 - [c14]Akinobu Lee, Yuichiro Mera, Hiroshi Saruwatari, Kiyohiro Shikano:
Selective multi-path acoustic model based on database likelihoods. INTERSPEECH 2002: 2661-2664 - [c13]Ryuichi Nisimura, Takashi Uchida, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano, Yoshio Matsumoto:
ASKA: receptionist robot with speech dialogue system. IROS 2002: 1314-1319 - [c12]Akinobu Lee, Tatsuya Kawahara, Kazuya Takeda, Masato Mimura, Atsushi Yamada, Akinori Ito, Katsunobu Itou, Kiyohiro Shikano:
Continuous Speech Recognition Consortium an Open Repository for CSR Tools and Models. LREC 2002 - 2001
- [c11]Akinobu Lee, Tatsuya Kawahara, Kiyohiro Shikano:
Gaussian mixture selection using context-independent HMM. ICASSP 2001: 69-72 - [c10]Miichi Yamada, Akira Baba, Shinichi Yoshizawa, Yuichiro Mera, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano:
Unsupervised noisy environment adaptation algorithm using MLLR and speaker selection. INTERSPEECH 2001: 869-872 - [c9]Shinichi Yoshizawa, Akira Baba, Kanako Matsunami, Yuichiro Mera, Miichi Yamada, Akinobu Lee, Kiyohiro Shikano:
Evaluation on unsupervised speaker adaptation based on sufficient HMM statictics of selected speakers. INTERSPEECH 2001: 1219-1222 - [c8]Akira Baba, Shinichi Yoshizawa, Miichi Yamada, Akinobu Lee, Kiyohiro Shikano:
Elderly acoustic model for large vocabulary continuous speech recognition. INTERSPEECH 2001: 1657-1660 - [c7]Akinobu Lee, Tatsuya Kawahara, Kiyohiro Shikano:
Julius - an open source real-time large vocabulary recognition engine. INTERSPEECH 2001: 1691-1694 - [c6]Ryuichi Nisimura, Kumiko Komatsu, Yuka Kuroda, Kentaro Nagatomo, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano:
Automatic n-gram language model creation from web resources. INTERSPEECH 2001: 2127-2130 - 2000
- [c5]Akinobu Lee, Tatsuya Kawahara, Kazuya Takeda, Kiyohiro Shikano:
A new phonetic tied-mixture model for efficient decoding. ICASSP 2000: 1269-1272 - [c4]Tatsuya Kawahara, Akinobu Lee, Tetsunori Kobayashi, Kazuya Takeda, Nobuaki Minematsu, Shigeki Sagayama, Katsunobu Itou, Akinori Ito, Mikio Yamamoto, Atsushi Yamada, Takehito Utsuro, Kiyohiro Shikano:
Free software toolkit for Japanese large vocabulary continuous speech recognition. INTERSPEECH 2000: 476-479 - [c3]Hiroaki Nanjo, Akinobu Lee, Tatsuya Kawahara:
Automatic diagnosis of recognition errors in large vocabulary continuous speech recognition systems. INTERSPEECH 2000: 1027-1030 - [c2]Katsunobu Itou, Kiyohiro Shikano, Tatsuya Kawahara, Kazuya Takeda, Atsushi Yamada, Akinori Ito, Takehito Utsuro, Tetsunori Kobayashi, Nobuaki Minematsu, Mikio Yamamoto, Shigeki Sagayama, Akinobu Lee:
IPA Japanese Dictation Free Software Project. LREC 2000
1990 – 1999
- 1998
- [c1]Akinobu Lee, Tatsuya Kawahara, Shuji Doshita:
An efficient two-pass search algorithm using word trellis index. ICSLP 1998
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 21:18 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint