default search action
Kenji Nagamatsu
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2021
- [c33]Koichiro Ito, Masaaki Yamamoto, Kenji Nagamatsu:
Audio-Visual Speech Enhancement Method Conditioned in the Lip Motion and Speaker-Discriminative Embeddings. ICASSP 2021: 6668-6672 - [c32]Shota Horiguchi, Paola García
, Yusuke Fujita, Shinji Watanabe
, Kenji Nagamatsu:
End-To-End Speaker Diarization as Post-Processing. ICASSP 2021: 7188-7192 - [c31]Yuki Takashima, Yusuke Fujita, Shota Horiguchi, Shinji Watanabe
, Leibny Paola García-Perera, Kenji Nagamatsu:
Semi-Supervised Training with Pseudo-Labeling for End-To-End Neural Diarization. Interspeech 2021: 3096-3100 - [c30]Yawen Xue, Shota Horiguchi, Yusuke Fujita, Yuki Takashima, Shinji Watanabe
, Leibny Paola García-Perera, Kenji Nagamatsu:
Online Streaming End-to-End Neural Diarization Handling Overlapping Speech and Flexible Numbers of Speakers. Interspeech 2021: 3116-3120 - [c29]Koichiro Ito, Takuya Fujioka, Qinghua Sun, Kenji Nagamatsu:
Audio-Visual Speech Emotion Recognition by Disentangling Emotion and Identity Attributes. Interspeech 2021: 4493-4497 - [c28]Shota Horiguchi, Yusuke Fujita, Kenji Nagamatsu:
Block-Online Guided Source Separation. SLT 2021: 236-242 - [c27]Yawen Xue, Shota Horiguchi, Yusuke Fujita, Shinji Watanabe
, Paola García, Kenji Nagamatsu:
Online End-To-End Neural Diarization with Speaker-Tracing Buffer. SLT 2021: 841-848 - [c26]Yuki Takashima, Yusuke Fujita, Shinji Watanabe
, Shota Horiguchi, Paola García, Kenji Nagamatsu:
End-to-End Speaker Diarization Conditioned on Speech Activity and Overlap Detection. SLT 2021: 849-856 - [i19]Yawen Xue, Shota Horiguchi, Yusuke Fujita, Yuki Takashima, Shinji Watanabe, Paola García, Kenji Nagamatsu:
Online End-to-End Neural Diarization Handling Overlapping Speech and Flexible Numbers of Speakers. CoRR abs/2101.08473 (2021) - [i18]Yuki Takashima, Yusuke Fujita, Shinji Watanabe, Shota Horiguchi, Paola García, Kenji Nagamatsu:
End-to-End Speaker Diarization Conditioned on Speech Activity and Overlap Detection. CoRR abs/2106.04078 (2021) - [i17]Yuki Takashima, Yusuke Fujita, Shota Horiguchi, Shinji Watanabe, Paola García, Kenji Nagamatsu:
Semi-Supervised Training with Pseudo-Labeling for End-to-End Neural Diarization. CoRR abs/2106.04764 (2021) - [i16]Takeshi Homma, Qinghua Sun, Takuya Fujioka, Ryuta Takawaki, Eriko Ankyu, Kenji Nagamatsu, Daichi Sugawara, Etsuko T. Harada:
Emotional Speech Synthesis for Companion Robot to Imitate Professional Caregiver Speech. CoRR abs/2109.12787 (2021) - [i15]Atsuki Yamaguchi, Gaku Morio, Hiroaki Ozaki, Ken-ichi Yokote, Kenji Nagamatsu:
Team Hitachi @ AutoMin 2021: Reference-free Automatic Minuting Pipeline with Argument Structure Construction over Topic-based Summarization. CoRR abs/2112.02741 (2021) - 2020
- [c25]Koichiro Ito, Quan Kong, Shota Horiguchi, Takashi Sumiyoshi, Kenji Nagamatsu:
Anticipating the Start of User Interaction for Service Robot in the Wild. ICRA 2020: 9687-9693 - [c24]Shota Horiguchi, Yusuke Fujita, Shinji Watanabe
, Yawen Xue, Kenji Nagamatsu:
End-to-End Speaker Diarization for an Unknown Number of Speakers with Encoder-Decoder Based Attractors. INTERSPEECH 2020: 269-273 - [c23]Shota Horiguchi, Yusuke Fujita, Kenji Nagamatsu:
Utterance-Wise Meeting Transcription System Using Asynchronous Distributed Microphones. INTERSPEECH 2020: 344-348 - [c22]Takuya Fujioka, Takeshi Homma
, Kenji Nagamatsu:
Meta-Learning for Speech Emotion Recognition Considering Ambiguity of Emotion Labels. INTERSPEECH 2020: 2332-2336 - [c21]Amalia Istiqlali Adiba, Takeshi Homma
, Dario Bertero, Takashi Sumiyoshi, Kenji Nagamatsu:
Delay Mitigation for Backchannel Prediction in Spoken Dialog System. IWSDS 2020: 129-143 - [i14]Yusuke Fujita, Shinji Watanabe, Shota Horiguchi, Yawen Xue, Kenji Nagamatsu:
End-to-End Neural Diarization: Reformulating Speaker Diarization as Simple Multi-label Classification. CoRR abs/2003.02966 (2020) - [i13]Shota Horiguchi, Yusuke Fujita, Shinji Watanabe, Yawen Xue, Kenji Nagamatsu:
End-to-End Speaker Diarization for an Unknown Number of Speakers with Encoder-Decoder Based Attractors. CoRR abs/2005.09921 (2020) - [i12]Yusuke Fujita, Shinji Watanabe, Shota Horiguchi, Yawen Xue, Jing Shi, Kenji Nagamatsu:
Neural Speaker Diarization with Speaker-Wise Chain Rule. CoRR abs/2006.01796 (2020) - [i11]Yawen Xue, Shota Horiguchi, Yusuke Fujita, Shinji Watanabe, Kenji Nagamatsu:
Online End-to-End Neural Diarization with Speaker-Tracing Buffer. CoRR abs/2006.02616 (2020) - [i10]Shota Horiguchi, Yusuke Fujita, Kenji Nagamatsu:
Utterance-Wise Meeting Transcription System Using Asynchronous Distributed Microphones. CoRR abs/2007.15868 (2020) - [i9]Shota Horiguchi, Yusuke Fujita, Kenji Nagamatsu:
Block-Online Guided Source Separation. CoRR abs/2011.07791 (2020) - [i8]Shota Horiguchi, Paola García, Yusuke Fujita, Shinji Watanabe, Kenji Nagamatsu:
End-to-End Speaker Diarization as Post-Processing. CoRR abs/2012.10055 (2020) - [i7]Qinghua Sun, Kenji Nagamatsu:
Building Multi lingual TTS using Cross Lingual Voice Conversion. CoRR abs/2012.14039 (2020)
2010 – 2019
- 2019
- [c20]Naoyuki Kanda, Shota Horiguchi, Yusuke Fujita, Yawen Xue, Kenji Nagamatsu, Shinji Watanabe
:
Simultaneous Speech Recognition and Speaker Diarization for Monaural Dialogue Recordings with Target-Speaker Acoustic Models. ASRU 2019: 31-38 - [c19]Yusuke Fujita, Naoyuki Kanda, Shota Horiguchi, Yawen Xue, Kenji Nagamatsu, Shinji Watanabe
:
End-to-End Neural Speaker Diarization with Self-Attention. ASRU 2019: 296-303 - [c18]Naoyuki Kanda, Yusuke Fujita, Shota Horiguchi, Rintaro Ikeshita, Kenji Nagamatsu, Shinji Watanabe
:
Acoustic Modeling for Distant Multi-talker Speech Recognition with Single- and Multi-channel Branches. ICASSP 2019: 6630-6634 - [c17]Naoyuki Kanda, Shota Horiguchi, Ryoichi Takashima, Yusuke Fujita, Kenji Nagamatsu, Shinji Watanabe
:
Auxiliary Interference Speaker Loss for Target-Speaker Speech Recognition. INTERSPEECH 2019: 236-240 - [c16]Naoyuki Kanda, Christoph Böddeker, Jens Heitkaemper, Yusuke Fujita, Shota Horiguchi, Kenji Nagamatsu, Reinhold Haeb-Umbach:
Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR. INTERSPEECH 2019: 1248-1252 - [c15]Shota Horiguchi, Naoyuki Kanda, Kenji Nagamatsu:
Multimodal Response Obligation Detection with Unsupervised Online Domain Adaptation. INTERSPEECH 2019: 4180-4184 - [c14]Yusuke Fujita, Naoyuki Kanda, Shota Horiguchi, Kenji Nagamatsu, Shinji Watanabe
:
End-to-End Neural Speaker Diarization with Permutation-Free Objectives. INTERSPEECH 2019: 4300-4304 - [i6]Naoyuki Kanda, Christoph Böddeker, Jens Heitkaemper, Yusuke Fujita, Shota Horiguchi, Kenji Nagamatsu, Reinhold Haeb-Umbach:
Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR. CoRR abs/1905.12230 (2019) - [i5]Naoyuki Kanda, Shota Horiguchi, Ryoichi Takashima, Yusuke Fujita, Kenji Nagamatsu, Shinji Watanabe:
Auxiliary Interference Speaker Loss for Target-Speaker Speech Recognition. CoRR abs/1906.10876 (2019) - [i4]Yusuke Fujita, Naoyuki Kanda, Shota Horiguchi, Kenji Nagamatsu, Shinji Watanabe:
End-to-End Neural Speaker Diarization with Permutation-Free Objectives. CoRR abs/1909.05952 (2019) - [i3]Yusuke Fujita, Naoyuki Kanda, Shota Horiguchi, Yawen Xue, Kenji Nagamatsu, Shinji Watanabe:
End-to-End Neural Speaker Diarization with Self-attention. CoRR abs/1909.06247 (2019) - [i2]Naoyuki Kanda, Shota Horiguchi, Yusuke Fujita, Yawen Xue, Kenji Nagamatsu, Shinji Watanabe:
Simultaneous Speech Recognition and Speaker Diarization for Monaural Dialogue Recordings with Target-Speaker Acoustic Models. CoRR abs/1909.08103 (2019) - [i1]Takuya Fujioka, Dario Bertero, Takeshi Homma, Kenji Nagamatsu:
Addressing Ambiguity of Emotion Labels Through Meta-Learning. CoRR abs/1911.02216 (2019) - 2018
- [c13]Naoyuki Kanda, Yusuke Fujita, Kenji Nagamatsu:
Sequence Distillation for Purely Sequence Trained Acoustic Models. ICASSP 2018: 5964-5968 - [c12]Naoyuki Kanda, Yusuke Fujita, Kenji Nagamatsu:
Lattice-free State-level Minimum Bayes Risk Training of Acoustic Models. INTERSPEECH 2018: 2923-2927 - [c11]Rintaro Ikeshita, Yohei Kawaguchi
, Kenji Nagamatsu:
Fast Multichannel Nonnegative Matrix Factorization with Constraints on Active Source Candidates. IWAENC 2018: 520-524 - [c10]Shota Horiguchi, Naoyuki Kanda, Kenji Nagamatsu:
Face-Voice Matching using Cross-modal Embeddings. ACM Multimedia 2018: 1011-1019 - 2017
- [c9]Naoyuki Kanda, Yusuke Fujita, Kenji Nagamatsu:
Investigation of lattice-free maximum mutual information-based acoustic models with sequence-level Kullback-Leibler divergence. ASRU 2017: 69-76 - [c8]Rintaro Ikeshita, Yohei Kawaguchi
, Masahito Togami, Yusuke Fujita, Kenji Nagamatsu:
Independent vector analysis with frequency range division and prior switching. EUSIPCO 2017: 2329-2333 - [c7]Rintaro Ikeshita, Masahito Togami, Yohei Kawaguchi
, Yusuke Fujita, Kenji Nagamatsu:
Local Gaussian model with source-set constraints in audio source separation. MLSP 2017: 1-6 - 2013
- [c6]Iacopo Gentilini, Kenji Nagamatsu, Kenji Shimada:
Cycle time based multi-goal path optimization for redundant robotic systems. IROS 2013: 1786-1792 - 2010
- [j1]Hongjian Liu, Defeng Guo, Quan Zhou, Kenji Nagamatsu, Qinghua Sun:
A Pre-Identification Method for Chinese Named Entity Recognition. J. Softw. 5(1): 73-80 (2010)
2000 – 2009
- 2009
- [c5]Quan Zhou, Pan Deng, Hongjian Liu, Defeng Guo, Kenji Nagamatsu:
A Hybrid Method of Chinese Prosodic Word Tagging Based on Keyword Anchor and Hidden Markov Model. IALP 2009: 71-75 - [c4]Hongjian Liu, Defeng Guo, Quan Zhou, Kenji Nagamatsu, Qinghua Sun:
Cascade Chinese Potential Name Recognition. IFITA (3) 2009: 329-333 - 2006
- [c3]Nobuo Nukaga, Ryota Kamoshida, Kenji Nagamatsu, Yoshinori Kitahara:
Scalable Implementation Of Unit Selection Based Text-To-Speech System For Embedded Solutions. ICASSP (1) 2006: 849-852 - 2004
- [c2]Nobuo Nukaga, Ryota Kamoshida, Kenji Nagamatsu:
Unit selection using pitch synchronous cross correlation for Japanese concatenative speech synthesis. SSW 2004: 43-48
1990 – 1999
- 1996
- [c1]Kenji Nagamatsu, Hidehiko Tanaka:
Estimating Point-of-View-based Similarity Using POV Reinforcement and Similarity Propagation. PACLIC 1996: 373-382
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-08-01 20:13 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint