default search action
Nobukatsu Hojo
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j7]Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo, Shogo Seki:
VoiceGrad: Non-Parallel Any-to-Many Voice Conversion With Annealed Langevin Dynamics. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2213-2226 (2024) - [c26]Saki Mizuno, Nobukatsu Hojo, Kazutoshi Shinoda, Keita Suzuki, Mana Ihori, Hiroshi Sato, Tomohiro Tanaka, Naotaka Kawata, Satoshi Kobashikawa, Ryo Masumura:
Talking Face Generation for Impression Conversion Considering Speech Semantics. ICASSP 2024: 8411-8415 - 2023
- [c25]Nobukatsu Hojo, Saki Mizuno, Satoshi Kobashikawa, Ryo Masumura:
Modeling Lead-Lag Structure in Facial Expression Synchrony for Social-Psychological Outcome Prediction from Negotiation Interaction. ICASSP Workshops 2023: 1-5 - [c24]Saki Mizuno, Nobukatsu Hojo, Satoshi Kobashikawa, Ryo Masumura:
Next-Speaker Prediction Based on Non-Verbal Information in Multi-Party Video Conversation. ICASSP 2023: 1-5 - [c23]Mana Ihori, Hiroshi Sato, Tomohiro Tanaka, Ryo Masumura, Saki Mizuno, Nobukatsu Hojo:
Transcribing Speech as Spoken and Written Dual Text Using an Autoregressive Model. INTERSPEECH 2023: 461-465 - [c22]Hiroshi Sato, Ryo Masumura, Tsubasa Ochiai, Marc Delcroix, Takafumi Moriya, Takanori Ashihara, Kentaro Shinayama, Saki Mizuno, Mana Ihori, Tomohiro Tanaka, Nobukatsu Hojo:
Downstream Task Agnostic Speech Enhancement with Self-Supervised Representation Loss. INTERSPEECH 2023: 854-858 - [c21]Nobukatsu Hojo, Saki Mizuno, Satoshi Kobashikawa, Ryo Masumura, Mana Ihori, Hiroshi Sato, Tomohiro Tanaka:
Audio-Visual Praise Estimation for Conversational Video based on Synchronization-Guided Multimodal Transformer. INTERSPEECH 2023: 2663-2667 - [c20]Ryo Masumura, Naoki Makishima, Taiga Yamane, Yoshihiko Yamazaki, Saki Mizuno, Mana Ihori, Mihiro Uchida, Keita Suzuki, Hiroshi Sato, Tomohiro Tanaka, Akihiko Takashima, Satoshi Suzuki, Takafumi Moriya, Nobukatsu Hojo, Atsushi Ando:
End-to-End Joint Target and Non-Target Speakers ASR. INTERSPEECH 2023: 2903-2907 - [i15]Hiroshi Sato, Ryo Masumura, Tsubasa Ochiai, Marc Delcroix, Takafumi Moriya, Takanori Ashihara, Kentaro Shinayama, Saki Mizuno, Mana Ihori, Tomohiro Tanaka, Nobukatsu Hojo:
Downstream Task Agnostic Speech Enhancement with Self-Supervised Representation Loss. CoRR abs/2305.14723 (2023) - [i14]Ryo Masumura, Naoki Makishima, Taiga Yamane, Yoshihiko Yamazaki, Saki Mizuno, Mana Ihori, Mihiro Uchida, Keita Suzuki, Hiroshi Sato, Tomohiro Tanaka, Akihiko Takashima, Satoshi Suzuki, Takafumi Moriya, Nobukatsu Hojo, Atsushi Ando:
End-to-End Joint Target and Non-Target Speakers ASR. CoRR abs/2306.02273 (2023) - 2022
- [c19]Ryo Masumura, Yoshihiro Yamazaki, Saki Mizuno, Naoki Makishima, Mana Ihori, Mihiro Uchida, Hiroshi Sato, Tomohiro Tanaka, Akihiko Takashima, Satoshi Suzuki, Shota Orihashi, Takafumi Moriya, Nobukatsu Hojo, Atsushi Ando:
End-to-End Joint Modeling of Conversation History-Dependent and Independent ASR Systems with Multi-History Training. INTERSPEECH 2022: 3218-3222 - [c18]Nobukatsu Hojo, Satoshi Kobashikawa, Saki Mizuno, Ryo Masumura:
Multimodal Negotiation Corpus with Various Subjective Assessments for Social-Psychological Outcome Prediction from Non-Verbal Cues. LREC 2022: 6794-6801 - 2021
- [j6]Katsuki Inoue, Sunao Hara, Masanobu Abe, Nobukatsu Hojo, Yusuke Ijima:
Model architectures to extrapolate emotional expressions in DNN-based text-to-speech. Speech Commun. 126: 35-43 (2021) - [j5]Hirokazu Kameoka, Wen-Chin Huang, Kou Tanaka, Takuhiro Kaneko, Nobukatsu Hojo, Tomoki Toda:
Many-to-Many Voice Transformer Network. IEEE ACM Trans. Audio Speech Lang. Process. 29: 656-670 (2021) - [c17]Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Nobukatsu Hojo:
Maskcyclegan-VC: Learning Non-Parallel Voice Conversion with Filling in Frames. ICASSP 2021: 5919-5923 - [i13]Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Nobukatsu Hojo:
MaskCycleGAN-VC: Learning Non-parallel Voice Conversion with Filling in Frames. CoRR abs/2102.12841 (2021) - 2020
- [j4]Hirokazu Kameoka, Kou Tanaka, Damian Kwasny, Takuhiro Kaneko, Nobukatsu Hojo:
ConvS2S-VC: Fully Convolutional Sequence-to-Sequence Voice Conversion. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1849-1863 (2020) - [j3]Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo:
Nonparallel Voice Conversion With Augmented Classifier Star Generative Adversarial Networks. IEEE ACM Trans. Audio Speech Lang. Process. 28: 2982-2995 (2020) - [c16]Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Nobukatsu Hojo:
CycleGAN-VC3: Examining and Improving CycleGAN-VCs for Mel-Spectrogram Conversion. INTERSPEECH 2020: 2017-2021 - [i12]Hirokazu Kameoka, Wen-Chin Huang, Kou Tanaka, Takuhiro Kaneko, Nobukatsu Hojo, Tomoki Toda:
Many-to-Many Voice Transformer Network. CoRR abs/2005.08445 (2020) - [i11]Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo, Shogo Seki:
VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed Langevin Dynamics. CoRR abs/2010.02977 (2020) - [i10]Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Nobukatsu Hojo:
CycleGAN-VC3: Examining and Improving CycleGAN-VCs for Mel-spectrogram Conversion. CoRR abs/2010.11672 (2020)
2010 – 2019
- 2019
- [j2]Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo:
ACVAE-VC: Non-Parallel Voice Conversion With Auxiliary Classifier Variational Autoencoder. IEEE ACM Trans. Audio Speech Lang. Process. 27(9): 1432-1443 (2019) - [c15]Kou Tanaka, Hirokazu Kameoka, Takuhiro Kaneko, Nobukatsu Hojo:
ATTS2S-VC: Sequence-to-sequence Voice Conversion with Attention and Context Preservation Mechanisms. ICASSP 2019: 6805-6809 - [c14]Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Nobukatsu Hojo:
Cyclegan-VC2: Improved Cyclegan-based Non-parallel Voice Conversion. ICASSP 2019: 6820-6824 - [c13]Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Nobukatsu Hojo:
StarGAN-VC2: Rethinking Conditional Methods for StarGAN-Based Voice Conversion. INTERSPEECH 2019: 679-683 - [c12]Nobukatsu Hojo, Noboru Miyazaki:
Evaluating Intention Communication by TTS Using Explicit Definitions of Illocutionary Act Performance. INTERSPEECH 2019: 1536-1540 - [i9]Kou Tanaka, Hirokazu Kameoka, Takuhiro Kaneko, Nobukatsu Hojo:
WaveCycleGAN2: Time-domain Neural Post-filter for Speech Waveform Generation. CoRR abs/1904.02892 (2019) - [i8]Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Nobukatsu Hojo:
CycleGAN-VC2: Improved CycleGAN-based Non-parallel Voice Conversion. CoRR abs/1904.04631 (2019) - [i7]Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Nobukatsu Hojo:
StarGAN-VC2: Rethinking Conditional Methods for StarGAN-Based Voice Conversion. CoRR abs/1907.12279 (2019) - 2018
- [j1]Nobukatsu Hojo, Yusuke Ijima, Hideyuki Mizuno:
DNN-Based Speech Synthesis Using Speaker Codes. IEICE Trans. Inf. Syst. 101-D(2): 462-472 (2018) - [c11]Nobukatsu Hojo, Hirokazu Kameoka, Kou Tanaka, Takuhiro Kaneko:
Automatic Speech Pronunciation Correction with Dynamic Frequency Warping-Based Spectral Conversion. EUSIPCO 2018: 2310-2314 - [c10]Keisuke Oyamada, Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo, Hiroyasu Ando:
Generative adversarial network-based approach to signal reconstruction from magnitude spectrogram. EUSIPCO 2018: 2514-2518 - [c9]Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo:
StarGAN-VC: non-parallel many-to-many Voice Conversion Using Star Generative Adversarial Networks. SLT 2018: 266-273 - [c8]Kou Tanaka, Takuhiro Kaneko, Nobukatsu Hojo, Hirokazu Kameoka:
Synthetic-to-Natural Speech Waveform Conversion Using Cycle-Consistent Adversarial Networks. SLT 2018: 632-639 - [i6]Keisuke Oyamada, Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo, Hiroyasu Ando:
Generative adversarial network-based approach to signal reconstruction from magnitude spectrograms. CoRR abs/1804.02181 (2018) - [i5]Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo:
StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial networks. CoRR abs/1806.02169 (2018) - [i4]Hirokazu Kameoka, Takuhiro Kaneko, Kou Tanaka, Nobukatsu Hojo:
ACVAE-VC: Non-parallel many-to-many voice conversion with auxiliary classifier variational autoencoder. CoRR abs/1808.05092 (2018) - [i3]Kou Tanaka, Takuhiro Kaneko, Nobukatsu Hojo, Hirokazu Kameoka:
WaveCycleGAN: Synthetic-to-natural speech waveform conversion using cycle-consistent adversarial networks. CoRR abs/1809.10288 (2018) - [i2]Hirokazu Kameoka, Kou Tanaka, Takuhiro Kaneko, Nobukatsu Hojo:
ConvS2S-VC: Fully convolutional sequence-to-sequence voice conversion. CoRR abs/1811.01609 (2018) - [i1]Kou Tanaka, Hirokazu Kameoka, Takuhiro Kaneko, Nobukatsu Hojo:
AttS2S-VC: Sequence-to-Sequence Voice Conversion with Attention and Context Preservation Mechanisms. CoRR abs/1811.04076 (2018) - 2017
- [c7]Katsuki Inoue, Sunao Hara, Masanobu Abe, Nobukatsu Hojo, Yusuke Ijima:
An investigation to transplant emotional expressions in DNN-based TTS synthesis. APSIPA 2017: 1253-1258 - [c6]Takuhiro Kaneko, Hirokazu Kameoka, Nobukatsu Hojo, Yusuke Ijima, Kaoru Hiramatsu, Kunio Kashino:
Generative adversarial network-based postfilter for statistical parametric speech synthesis. ICASSP 2017: 4910-4914 - [c5]Yusuke Ijima, Nobukatsu Hojo, Ryo Masumura, Taichi Asami:
Prosody Aware Word-Level Encoder Based on BLSTM-RNNs for DNN-Based Speech Synthesis. INTERSPEECH 2017: 764-768 - [c4]Nobukatsu Hojo, Yasuhito Ohsugi, Yusuke Ijima, Hirokazu Kameoka:
DNN-SPACE: DNN-HMM-Based Generative Model of Voice F0 Contours for Statistical Phrase/Accent Command Estimation. INTERSPEECH 2017: 1074-1078 - 2016
- [c3]Nobukatsu Hojo, Yusuke Ijima, Hideyuki Mizuno:
An Investigation of DNN-Based Speech Synthesis Using Speaker Codes. INTERSPEECH 2016: 2278-2282 - 2014
- [c2]Kento Kadowaki, Tatsuma Ishihara, Nobukatsu Hojo, Hirokazu Kameoka:
Speech prosody generation for text-to-speech synthesis based on generative model of F0 contours. INTERSPEECH 2014: 2322-2326 - 2013
- [c1]Nobukatsu Hojo, Kota Yoshizato, Hirokazu Kameoka, Daisuke Saito, Shigeki Sagayama:
Text-to-speech synthesizer based on combination of composite wavelet and hidden Markov models. SSW 2013: 129-134
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 21:15 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint