default search action

combined dblp search
author search
venue search
publication search

ask others

Yuma Shirahata

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShimizuYKSDKT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShimizuYKSDKT24
Reo Shimizu, Ryuichi Yamamoto, Masaya Kawamura, Yuma Shirahata, Hironori Doi, Tatsuya Komatsu, Kentaro Tachibana:
PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions. ICASSP 2024: 12672-12676
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-07969
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-07969
Masaya Kawamura, Ryuichi Yamamoto, Yuma Shirahata, Takuya Hasumi, Kentaro Tachibana:
LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning. CoRR abs/2406.07969 (2024)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-12194
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-12194
Robin Scheibler, Yusuke Fujita, Yuma Shirahata, Tatsuya Komatsu:
Universal Score-based Speech Enhancement with High Content Preservation. CoRR abs/2406.12194 (2024)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-17452
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-17452
Ryuichi Yamamoto, Yuma Shirahata, Masaya Kawamura, Kentaro Tachibana:
Description-based Controllable Text-to-Speech with Cross-Lingual Voice Control. CoRR abs/2409.17452 (2024)
2023
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KawamuraSYT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KawamuraSYT23
Masaya Kawamura, Yuma Shirahata, Ryuichi Yamamoto, Kentaro Tachibana:
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform. ICASSP 2023: 1-5
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShirahataYSTKT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShirahataYSTKT23
Yuma Shirahata, Ryuichi Yamamoto, Eunwoo Song, Ryo Terashima, Jae-Min Kim, Kentaro Tachibana:
Period VITS: Variational Inference with Explicit Pitch Modeling for End-To-End Emotional Speech Synthesis. ICASSP 2023: 1-5
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-08140
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-08140
Reo Shimizu, Ryuichi Yamamoto, Masaya Kawamura, Yuma Shirahata, Hironori Doi, Tatsuya Komatsu, Kentaro Tachibana:
PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-to-Speech Using Natural Language Descriptions. CoRR abs/2309.08140 (2023)
2022
[c5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TerashimaYSSYKT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TerashimaYSSYKT22
Ryo Terashima, Ryuichi Yamamoto, Eunwoo Song, Yuma Shirahata, Hyun-Wook Yoon, Jae-Min Kim, Kentaro Tachibana:
Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation. INTERSPEECH 2022: 3018-3022
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-10020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-10020
Ryo Terashima, Ryuichi Yamamoto, Eunwoo Song, Yuma Shirahata, Hyun-Wook Yoon, Jae-Min Kim, Kentaro Tachibana:
Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation. CoRR abs/2204.10020 (2022)
[i2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-15964
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-15964
Yuma Shirahata, Ryuichi Yamamoto, Eunwoo Song, Ryo Terashima, Jae-Min Kim, Kentaro Tachibana:
Period VITS: Variational Inference with Explicit Pitch Modeling for End-to-end Emotional Speech Synthesis. CoRR abs/2210.15964 (2022)
[i1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-15975
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-15975
Masaya Kawamura, Yuma Shirahata, Ryuichi Yamamoto, Kentaro Tachibana:
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform. CoRR abs/2210.15975 (2022)
2021
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/ococosda/WatanabeSRM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ococosda/WatanabeSRM21
Michiko Watanabe, Yuma Shirahata, Ralph Rose, Kikuo Maekawa:
How Do Speakers Pause and Hesitate in English and Japanese? - A Comparison Using Parallel Corpora of English and Japanese Presentation Speeches -. O-COCOSDA 2021: 164-167
2020
[c3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShirahataSM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShirahataSM20
Yuma Shirahata, Daisuke Saito, Nobuaki Minematsu:
Discriminative Method to Extract Coarse Prosodic Structure and its Application for Statistical Phrase/Accent Command Estimation. INTERSPEECH 2020: 4427-4431

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/blizzard/GotoSKSSM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/blizzard/GotoSKSSM19
Shunsuke Goto, Yuma Shirahata, Gaku Kotani, Hitoshi Suda, Daisuke Saito, Nobuaki Minematsu:
The UTokyo speech synthesis system for Blizzard Challenge 2019. Blizzard Challenge 2019
[c1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ssw/ShirahataSM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ssw/ShirahataSM19
Yuma Shirahata, Daisuke Saito, Nobuaki Minematsu:
Generative Modeling of F0 Contours Leveraged by Phrase Structure and Its Application to Statistical Focus Control. SSW 2019: 228-233

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.