default search action
Gustav Eje Henter
Person information
- affiliation: KTH Royal Institute of Technology, Stockholm, Sweden
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j14]Taras Kucherenko, Pieter Wolfert, Youngwoo Yoon, Carla Viegas, Teodor Nikolov, Mihail Tsakov, Gustav Eje Henter:
Evaluating Gesture Generation in a Large-scale Open Challenge: The GENEA Challenge 2022. ACM Trans. Graph. 43(3): 32:1-32:28 (2024) - [c59]Shivam Mehta, Anna Deichler, Jim O'Regan, Birger Moëll, Jonas Beskow, Gustav Eje Henter, Simon Alexanderson:
Fake it to make it: Using synthetic data to remedy the data shortage in joint multi-modal speech-and-gesture synthesis. CVPR Workshops 2024: 1952-1964 - [c58]Shivam Mehta, Ruibo Tu, Simon Alexanderson, Jonas Beskow, Éva Székely, Gustav Eje Henter:
Unified Speech and Gesture Synthesis Using Flow Matching. ICASSP 2024: 8220-8224 - [c57]Shivam Mehta, Ruibo Tu, Jonas Beskow, Éva Székely, Gustav Eje Henter:
Matcha-TTS: A Fast TTS Architecture with Conditional Flow Matching. ICASSP 2024: 11341-11345 - [i48]Ulme Wennberg, Gustav Eje Henter:
Exploring Internal Numeracy in Language Models: A Case Study on ALBERT. CoRR abs/2404.16574 (2024) - [i47]Shivam Mehta, Anna Deichler, Jim O'Regan, Birger Moëll, Jonas Beskow, Gustav Eje Henter, Simon Alexanderson:
Fake it to make it: Using synthetic data to remedy the data shortage in joint multimodal speech-and-gesture synthesis. CoRR abs/2404.19622 (2024) - [i46]Shivam Mehta, Harm Lameris, Rajiv Punmiya, Jonas Beskow, Éva Székely, Gustav Eje Henter:
Should you use a probabilistic duration model in TTS? Probably! Especially for spontaneous speech. CoRR abs/2406.05401 (2024) - [i45]Ruibo Tu, Zineb Senane, Lele Cao, Cheng Zhang, Hedvig Kjellström, Gustav Eje Henter:
Causality for Tabular Data Synthesis: A High-Order Structure Causal Benchmark Framework. CoRR abs/2406.08311 (2024) - [i44]Lauri Juvela, Pablo Pérez Zarazaga, Gustav Eje Henter, Zofia Malisz:
HiFi-Glot: Neural Formant Synthesis with Differentiable Resonant Filters. CoRR abs/2409.14823 (2024) - [i43]Jacob J. Webber, Oliver Watts, Gustav Eje Henter, Jennifer Williams, Simon King:
Voice Conversion-based Privacy through Adversarial Information Hiding. CoRR abs/2409.14919 (2024) - [i42]Rajmund Nagy, Hendric Voß, Youngwoo Yoon, Taras Kucherenko, Teodor Nikolov, Thanh Hoang-Minh, Rachel McDonnell, Stefan Kopp, Michael Neff, Gustav Eje Henter:
Towards a GENEA Leaderboard - an Extended, Living Benchmark for Evaluating and Advancing Conversational Motion Synthesis. CoRR abs/2410.06327 (2024) - 2023
- [j13]Simbarashe Nyatsanga, Taras Kucherenko, Chaitanya Ahuja, Gustav Eje Henter, Michael Neff:
A Comprehensive Review of Data-Driven Co-Speech Gesture Generation. Comput. Graph. Forum 42(2): 569-596 (2023) - [j12]Simon Alexanderson, Rajmund Nagy, Jonas Beskow, Gustav Eje Henter:
Listen, Denoise, Action! Audio-Driven Motion Synthesis with Diffusion Models. ACM Trans. Graph. 42(4): 44:1-44:20 (2023) - [c56]Harm Lameris, Shivam Mehta, Gustav Eje Henter, Joakim Gustafson, Éva Székely:
Prosody-Controllable Spontaneous TTS with Neural HMMS. ICASSP 2023: 1-5 - [c55]Siyang Wang, Gustav Eje Henter, Joakim Gustafson, Éva Székely:
A Comparative Study of Self-Supervised Speech Representations in Read and Spontaneous TTS. ICASSP Workshops 2023: 1-5 - [c54]Jacob J. Webber, Cassia Valentini-Botinhao, Evelyn Williams, Gustav Eje Henter, Simon King:
Autovocoder: Fast Waveform Generation from a Learned Speech Representation Using Differentiable Digital Signal Processing. ICASSP 2023: 1-5 - [c53]Pablo Pérez Zarazaga, Gustav Eje Henter, Zofia Malisz:
A Processing Framework to Access Large Quantities of Whispered Speech Found in ASMR. ICASSP 2023: 1-5 - [c52]Pieter Wolfert, Gustav Eje Henter, Tony Belpaeme:
"Am I listening?", Evaluating the Quality of Generated Data-driven Listening Motion. ICMI Companion 2023: 6-10 - [c51]Taras Kucherenko, Rajmund Nagy, Youngwoo Yoon, Jieyeon Woo, Teodor Nikolov, Mihail Tsakov, Gustav Eje Henter:
The GENEA Challenge 2023: A large-scale evaluation of gesture generation models in monadic and dyadic settings. ICMI 2023: 792-801 - [c50]Youngwoo Yoon, Taras Kucherenko, Jieyeon Woo, Pieter Wolfert, Rajmund Nagy, Gustav Eje Henter:
GENEA Workshop 2023: The 4th Workshop on Generation and Evaluation of Non-verbal Behaviour for Embodied Agents. ICMI 2023: 822-823 - [c49]Shivam Mehta, Ambika Kirkland, Harm Lameris, Jonas Beskow, Éva Székely, Gustav Eje Henter:
OverFlow: Putting flows on top of neural transducers for better TTS. INTERSPEECH 2023: 4279-4283 - [c48]Pablo Pérez Zarazaga, Zofia Malisz, Gustav Eje Henter, Lauri Juvela:
Speaker-independent neural formant synthesis. INTERSPEECH 2023: 5556-5560 - [c47]Ambika Kirkland, Shivam Mehta, Harm Lameris, Gustav Eje Henter, Éva Székely, Joakim Gustafson:
Stuck in the MOS pit: A critical analysis of MOS test methodology in TTS evaluation. SSW 2023: 41-47 - [c46]Shivam Mehta, Siyang Wang, Simon Alexanderson, Jonas Beskow, Éva Székely, Gustav Eje Henter:
Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis. SSW 2023: 150-156 - [c45]Siyang Wang, Gustav Eje Henter, Joakim Gustafson, Éva Székely:
On the Use of Self-Supervised Speech Representations in Spontaneous Speech Synthesis. SSW 2023: 163-169 - [i41]Simbarashe Nyatsanga, Taras Kucherenko, Chaitanya Ahuja, Gustav Eje Henter, Michael Neff:
A Comprehensive Review of Data-Driven Co-Speech Gesture Generation. CoRR abs/2301.05339 (2023) - [i40]Carlos Puerto-Santana, Concha Bielza, Pedro Larrañaga, Gustav Eje Henter:
Context-specific kernel-based hidden Markov model for time series analysis. CoRR abs/2301.09870 (2023) - [i39]Siyang Wang, Gustav Eje Henter, Joakim Gustafson, Éva Székely:
A Comparative Study of Self-Supervised Speech Representations in Read and Spontaneous TTS. CoRR abs/2303.02719 (2023) - [i38]Pablo Pérez Zarazaga, Gustav Eje Henter, Zofia Malisz:
A processing framework to access large quantities of whispered speech found in ASMR. CoRR abs/2303.07442 (2023) - [i37]Taras Kucherenko, Pieter Wolfert, Youngwoo Yoon, Carla Viegas, Teodor Nikolov, Mihail Tsakov, Gustav Eje Henter:
Evaluating gesture-generation in a large-scale open challenge: The GENEA Challenge 2022. CoRR abs/2303.08737 (2023) - [i36]Shivam Mehta, Siyang Wang, Simon Alexanderson, Jonas Beskow, Éva Székely, Gustav Eje Henter:
Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis. CoRR abs/2306.09417 (2023) - [i35]Siyang Wang, Gustav Eje Henter, Joakim Gustafson, Éva Székely:
On the Use of Self-Supervised Speech Representations in Spontaneous Speech Synthesis. CoRR abs/2307.05132 (2023) - [i34]Taras Kucherenko, Rajmund Nagy, Youngwoo Yoon, Jieyeon Woo, Teodor Nikolov, Mihail Tsakov, Gustav Eje Henter:
The GENEA Challenge 2023: A large scale evaluation of gesture generation models in monadic and dyadic settings. CoRR abs/2308.12646 (2023) - [i33]Shivam Mehta, Ruibo Tu, Jonas Beskow, Éva Székely, Gustav Eje Henter:
Matcha-TTS: A fast TTS architecture with conditional flow matching. CoRR abs/2309.03199 (2023) - [i32]Shivam Mehta, Ruibo Tu, Simon Alexanderson, Jonas Beskow, Éva Székely, Gustav Eje Henter:
Unified speech and gesture synthesis using flow matching. CoRR abs/2310.05181 (2023) - 2022
- [j11]Jan G. De Gooijer, Gustav Eje Henter, Ao Yuan:
Kernel-based hidden Markov conditional densities. Comput. Stat. Data Anal. 169: 107431 (2022) - [c44]Taras Kucherenko, Rajmund Nagy, Michael Neff, Hedvig Kjellström, Gustav Eje Henter:
Multimodal Analysis of the Predictability of Hand-gesture Properties. AAMAS 2022: 770-779 - [c43]Gustavo Teodoro Döhler Beck, Ulme Wennberg, Zofia Malisz, Gustav Eje Henter:
Wavebender GAN: An Architecture for Phonetically Meaningful Speech Manipulation. ICASSP 2022: 6187-6191 - [c42]Shivam Mehta, Éva Székely, Jonas Beskow, Gustav Eje Henter:
Neural HMMS Are All You Need (For High-Quality Attention-Free TTS). ICASSP 2022: 7457-7461 - [c41]Youngwoo Yoon, Pieter Wolfert, Taras Kucherenko, Carla Viegas, Teodor Nikolov, Mihail Tsakov, Gustav Eje Henter:
The GENEA Challenge 2022: A large evaluation of data-driven co-speech gesture generation. ICMI 2022: 736-747 - [c40]Pieter Wolfert, Taras Kucherenko, Carla Viegas, Zerrin Yumak, Youngwoo Yoon, Gustav Eje Henter:
GENEA Workshop 2022: The 3rd Workshop on Generation and Evaluation of Non-verbal Behaviour for Embodied Agents. ICMI 2022: 799-800 - [c39]Cassia Valentini-Botinhao, Manuel Sam Ribeiro, Oliver Watts, Korin Richmond, Gustav Eje Henter:
Predicting pairwise preferences between TTS audio stimuli using parallel ratings data and anti-symmetric twin neural networks. INTERSPEECH 2022: 471-475 - [c38]Jason Fong, Daniel Lyth, Gustav Eje Henter, Hao Tang, Simon King:
Speech Audio Corrector: using speech from non-target speakers for one-off correction of mispronunciations in grapheme-input text-to-speech. INTERSPEECH 2022: 1213-1217 - [i31]Gustavo Teodoro Döhler Beck, Ulme Wennberg, Zofia Malisz, Gustav Eje Henter:
Wavebender GAN: An architecture for phonetically meaningful speech manipulation. CoRR abs/2202.10973 (2022) - [i30]Youngwoo Yoon, Pieter Wolfert, Taras Kucherenko, Carla Viegas, Teodor Nikolov, Mihail Tsakov, Gustav Eje Henter:
The GENEA Challenge 2022: A large evaluation of data-driven co-speech gesture generation. CoRR abs/2208.10441 (2022) - [i29]Cassia Valentini-Botinhao, Manuel Sam Ribeiro, Oliver Watts, Korin Richmond, Gustav Eje Henter:
Predicting pairwise preferences between TTS audio stimuli using parallel ratings data and anti-symmetric twin neural networks. CoRR abs/2209.11003 (2022) - [i28]Shivam Mehta, Ambika Kirkland, Harm Lameris, Jonas Beskow, Éva Székely, Gustav Eje Henter:
OverFlow: Putting flows on top of neural transducers for better TTS. CoRR abs/2211.06892 (2022) - [i27]Jacob J. Webber, Cassia Valentini-Botinhao, Evelyn Williams, Gustav Eje Henter, Simon King:
Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing. CoRR abs/2211.06989 (2022) - [i26]Simon Alexanderson, Rajmund Nagy, Jonas Beskow, Gustav Eje Henter:
Listen, denoise, action! Audio-driven motion synthesis with diffusion models. CoRR abs/2211.09707 (2022) - [i25]Harm Lameris, Shivam Mehta, Gustav Eje Henter, Joakim Gustafson, Éva Székely:
Prosody-controllable spontaneous TTS with neural HMMs. CoRR abs/2211.13533 (2022) - 2021
- [j10]Patrik Jonell, Birger Moëll, Krister Håkansson, Gustav Eje Henter, Taras Kucherenko, Olga Mikheeva, Göran Hagman, Jasper Holleman, Miia Kivipelto, Hedvig Kjellström, Joakim Gustafson, Jonas Beskow:
Multimodal Capture of Patient Behaviour for Improved Detection of Early Dementia: Clinical Feasibility and Preliminary Results. Frontiers Comput. Sci. 3: 642633 (2021) - [j9]Taras Kucherenko, Dai Hasegawa, Naoshi Kaneko, Gustav Eje Henter, Hedvig Kjellström:
Moving Fast and Slow: Analysis of Representations and Post-Processing in Speech-Driven Automatic Gesture Generation. Int. J. Hum. Comput. Interact. 37(14): 1300-1316 (2021) - [j8]Guillermo Valle Pérez, Gustav Eje Henter, Jonas Beskow, Andre Holzapfel, Pierre-Yves Oudeyer, Simon Alexanderson:
Transflower: probabilistic autoregressive dance generation with multimodal attention. ACM Trans. Graph. 40(6): 195:1-195:14 (2021) - [c37]Ulme Wennberg, Gustav Eje Henter:
The Case for Translation-Invariant Self-Attention in Transformer-Based Language Models. ACL/IJCNLP (2) 2021: 130-140 - [c36]Moein Sorkhei, Gustav Eje Henter, Hedvig Kjellström:
Full-Glow: Fully Conditional Glow for More Realistic Image Generation. GCPR 2021: 697-711 - [c35]Siyang Wang, Simon Alexanderson, Joakim Gustafson, Jonas Beskow, Gustav Eje Henter, Éva Székely:
Integrated Speech and Gesture Synthesis. ICMI 2021: 177-185 - [c34]Patrik Jonell, Youngwoo Yoon, Pieter Wolfert, Taras Kucherenko, Gustav Eje Henter:
HEMVIP: Human Evaluation of Multiple Videos in Parallel. ICMI 2021: 707-711 - [c33]Taras Kucherenko, Patrik Jonell, Youngwoo Yoon, Pieter Wolfert, Zerrin Yumak, Gustav Eje Henter:
GENEA Workshop 2021: The 2nd Workshop on Generation and Evaluation of Non-verbal Behaviour for Embodied Agents. ICMI 2021: 872-873 - [c32]Taras Kucherenko, Patrik Jonell, Youngwoo Yoon, Pieter Wolfert, Gustav Eje Henter:
A Large, Crowdsourced Evaluation of Gesture Generation Systems on Common Data: The GENEA Challenge 2020. IUI 2021: 11-21 - [c31]Taras Kucherenko, Rajmund Nagy, Patrik Jonell, Michael Neff, Hedvig Kjellström, Gustav Eje Henter:
Speech2Properties2Gestures: Gesture-Property Prediction as a Tool for Generating Representational Gestures from Speech. IVA 2021: 145-147 - [i24]Simon Alexanderson, Éva Székely, Gustav Eje Henter, Taras Kucherenko, Jonas Beskow:
Generating coherent spontaneous speech and gesture from text. CoRR abs/2101.05684 (2021) - [i23]Patrik Jonell, Youngwoo Yoon, Pieter Wolfert, Taras Kucherenko, Gustav Eje Henter:
HEMVIP: Human Evaluation of Multiple Videos in Parallel. CoRR abs/2101.11898 (2021) - [i22]Anubhab Ghosh, Antoine Honoré, Dong Liu, Gustav Eje Henter, Saikat Chatterjee:
Robust Classification using Hidden Markov Models and Mixtures of Normalizing Flows. CoRR abs/2102.07284 (2021) - [i21]Taras Kucherenko, Patrik Jonell, Youngwoo Yoon, Pieter Wolfert, Gustav Eje Henter:
A large, crowdsourced evaluation of gesture generation systems on common data: The GENEA Challenge 2020. CoRR abs/2102.11617 (2021) - [i20]Ulme Wennberg, Gustav Eje Henter:
The Case for Translation-Invariant Self-Attention in Transformer-Based Language Models. CoRR abs/2106.01950 (2021) - [i19]Guillermo Valle Pérez, Gustav Eje Henter, Jonas Beskow, Andre Holzapfel, Pierre-Yves Oudeyer, Simon Alexanderson:
Transflower: probabilistic autoregressive dance generation with multimodal attention. CoRR abs/2106.13871 (2021) - [i18]Taras Kucherenko, Rajmund Nagy, Patrik Jonell, Michael Neff, Hedvig Kjellström, Gustav Eje Henter:
Speech2Properties2Gestures: Gesture-Property Prediction as a Tool for Generating Representational Gestures from Speech. CoRR abs/2106.14736 (2021) - [i17]Anubhab Ghosh, Antoine Honoré, Dong Liu, Gustav Eje Henter, Saikat Chatterjee:
Normalizing Flow based Hidden Markov Models for Classification of Speech Phones with Explainability. CoRR abs/2107.00730 (2021) - [i16]Taras Kucherenko, Rajmund Nagy, Michael Neff, Hedvig Kjellström, Gustav Eje Henter:
Multimodal analysis of the predictability of hand-gesture properties. CoRR abs/2108.05762 (2021) - [i15]Siyang Wang, Simon Alexanderson, Joakim Gustafson, Jonas Beskow, Gustav Eje Henter, Éva Székely:
Integrated Speech and Gesture Synthesis. CoRR abs/2108.11436 (2021) - [i14]Shivam Mehta, Éva Székely, Jonas Beskow, Gustav Eje Henter:
Neural HMMs are all you need (for high-quality attention-free TTS). CoRR abs/2108.13320 (2021) - 2020
- [j7]Simon Alexanderson, Gustav Eje Henter, Taras Kucherenko, Jonas Beskow:
Style-Controllable Speech-Driven Gesture Synthesis Using Normalising Flows. Comput. Graph. Forum 39(2): 487-496 (2020) - [j6]Gustav Eje Henter, Simon Alexanderson, Jonas Beskow:
MoGlow: probabilistic and controllable motion synthesis using normalising flows. ACM Trans. Graph. 39(6): 236:1-236:14 (2020) - [c30]Éva Székely, Gustav Eje Henter, Jonas Beskow, Joakim Gustafson:
Breathing and Speech Planning in Spontaneous Speech Synthesis. ICASSP 2020: 7649-7653 - [c29]Taras Kucherenko, Patrik Jonell, Sanne van Waveren, Gustav Eje Henter, Simon Alexandersson, Iolanda Leite, Hedvig Kjellström:
Gesticulator: A framework for semantically-aware speech-driven gesture generation. ICMI 2020: 242-250 - [c28]Simon Alexanderson, Éva Székely, Gustav Eje Henter, Taras Kucherenko, Jonas Beskow:
Generating coherent spontaneous speech and gesture from text. IVA 2020: 1:1-1:3 - [c27]Patrik Jonell, Taras Kucherenko, Gustav Eje Henter, Jonas Beskow:
Let's Face It: Probabilistic Multi-modal Interlocutor-aware Generation of Facial Gestures in Dyadic Settings. IVA 2020: 31:1-31:8 - [c26]Anubhab Ghosh, Antoine Honoré, Dong Liu, Gustav Eje Henter, Saikat Chatterjee:
Robust Classification Using Hidden Markov Models and Mixtures of Normalizing Flows. MLSP 2020: 1-6 - [i13]Taras Kucherenko, Patrik Jonell, Sanne van Waveren, Gustav Eje Henter, Simon Alexanderson, Iolanda Leite, Hedvig Kjellström:
Gesticulator: A framework for semantically-aware speech-driven gesture generation. CoRR abs/2001.09326 (2020) - [i12]Simon Alexanderson, Gustav Eje Henter:
Robust model training and generalisation with Studentising flows. CoRR abs/2006.06599 (2020) - [i11]Patrik Jonell, Taras Kucherenko, Gustav Eje Henter, Jonas Beskow:
Let's face it: Probabilistic multi-modal interlocutor-aware generation of facial gestures in dyadic settings. CoRR abs/2006.09888 (2020) - [i10]Taras Kucherenko, Dai Hasegawa, Naoshi Kaneko, Gustav Eje Henter, Hedvig Kjellström:
Moving fast and slow: Analysis of representations and post-processing in speech-driven automatic gesture generation. CoRR abs/2007.09170 (2020) - [i9]Moein Sorkhei, Gustav Eje Henter, Hedvig Kjellström:
Full-Glow: Fully conditional Glow for more realistic image generation. CoRR abs/2012.05846 (2020)
2010 – 2019
- 2019
- [c25]Taras Kucherenko, Dai Hasegawa, Naoshi Kaneko, Gustav Eje Henter, Hedvig Kjellström:
On the Importance of Representations for Speech-Driven Gesture Generation. AAMAS 2019: 2072-2074 - [c24]Éva Székely, Gustav Eje Henter, Joakim Gustafson:
Casting to Corpus: Segmenting and Selecting Spontaneous Dialogue for Tts with a Cnn-lstm Speaker-dependent Breath Detector. ICASSP 2019: 6925-6929 - [c23]Éva Székely, Gustav Eje Henter, Jonas Beskow, Joakim Gustafson:
Off the Cuff: Exploring Extemporaneous Speech Delivery with TTS. INTERSPEECH 2019: 3687-3688 - [c22]Éva Székely, Gustav Eje Henter, Jonas Beskow, Joakim Gustafson:
Spontaneous Conversational Speech Synthesis from Found Data. INTERSPEECH 2019: 4435-4439 - [c21]Taras Kucherenko, Dai Hasegawa, Gustav Eje Henter, Naoshi Kaneko, Hedvig Kjellström:
Analyzing Input and Output Representations for Speech-Driven Gesture Generation. IVA 2019: 97-104 - [c20]Petra Wagner, Jonas Beskow, Simon Betz, Jens Edlund, Joakim Gustafson, Gustav Eje Henter, Sébastien Le Maguer, Zofia Malisz, Éva Székely, Christina Tånnander, Jana Voße:
Speech Synthesis Evaluation - State-of-the-Art Assessment and Suggestion for a Novel Research Program. SSW 2019: 105-110 - [c19]Oliver Watts, Gustav Eje Henter, Jason Fong, Cassia Valentini-Botinhao:
Where do the improvements come from in sequence-to-sequence neural TTS? SSW 2019: 217-222 - [c18]Éva Székely, Gustav Eje Henter, Jonas Beskow, Joakim Gustafson:
How to train your fillers: uh and um in spontaneous speech synthesis. SSW 2019: 245-250 - [i8]Taras Kucherenko, Dai Hasegawa, Gustav Eje Henter, Naoshi Kaneko, Hedvig Kjellström:
Analyzing Input and Output Representations for Speech-Driven Gesture Generation. CoRR abs/1903.03369 (2019) - [i7]Gustav Eje Henter, Simon Alexanderson, Jonas Beskow:
MoGlow: Probabilistic and controllable motion synthesis using normalising flows. CoRR abs/1905.06598 (2019) - [i6]Seyyed Saeed Sarfjoo, Xin Wang, Gustav Eje Henter, Jaime Lorenzo-Trueba, Shinji Takaki, Junichi Yamagishi:
Transformation of low-quality device-recorded speech to high-quality speech using improved SEGAN model. CoRR abs/1911.03952 (2019) - 2018
- [j5]Jaime Lorenzo-Trueba, Gustav Eje Henter, Shinji Takaki, Junichi Yamagishi, Yosuke Morino, Yuta Ochiai:
Investigating different representations for modeling and controlling multiple emotions in DNN-based speech synthesis. Speech Commun. 99: 135-143 (2018) - [c17]Gustav Eje Henter, Jaime Lorenzo-Trueba, Xin Wang, Mariko Kondo, Junichi Yamagishi:
Cyborg Speech: Deep Multilingual Speech Synthesis for Generating Segmental Foreign Accent with Natural Prosody. ICASSP 2018: 4799-4803 - [i5]Gustav Eje Henter, Simon King, Thomas Merritt, Gilles Degottex:
Analysing Shortcomings of Statistical Parametric Speech Synthesis. CoRR abs/1807.10941 (2018) - [i4]Gustav Eje Henter, Arne Leijon, W. Bastiaan Kleijn:
Kernel Density Estimation-Based Markov Models with Hidden State. CoRR abs/1807.11320 (2018) - [i3]Gustav Eje Henter, Xin Wang, Junichi Yamagishi:
Deep Encoder-Decoder Models for Unsupervised Learning of Controllable Speech Synthesis. CoRR abs/1807.11470 (2018) - 2017
- [c16]Hieu-Thi Luong, Shinji Takaki, Gustav Eje Henter, Junichi Yamagishi:
Adapting and controlling DNN-based speech synthesis using input codes. ICASSP 2017: 4905-4909 - [c15]Jaime Lorenzo-Trueba, Cassia Valentini-Botinhao, Gustav Eje Henter, Junichi Yamagishi:
Misperceptions of the Emotional Content of Natural and Vocoded Speech in a Car. INTERSPEECH 2017: 606-610 - [c14]Gustav Eje Henter, Jaime Lorenzo-Trueba, Xin Wang, Junichi Yamagishi:
Principles for Learning Controllable TTS from Annotated and Latent Variation. INTERSPEECH 2017: 3956-3960 - [i2]Sang Phan, Gustav Eje Henter, Yusuke Miyao, Shin'ichi Satoh:
Consensus-based Sequence Training for Video Captioning. CoRR abs/1712.09532 (2017) - 2016
- [j4]Gustav Eje Henter, W. Bastiaan Kleijn:
Minimum Entropy Rate Simplification of Stochastic Processes. IEEE Trans. Pattern Anal. Mach. Intell. 38(12): 2487-2500 (2016) - [j3]Leijon Leijon, Gustav Eje Henter, Martin Dahlquist:
Bayesian Analysis of Phoneme Confusion Matrices. IEEE ACM Trans. Audio Speech Lang. Process. 24(3): 469-482 (2016) - [c13]Gustav Eje Henter, Srikanth Ronanki, Oliver Watts, Mirjam Wester, Zhizheng Wu, Simon King:
Robust TTS duration modelling using DNNS. ICASSP 2016: 5130-5134 - [c12]Rasmus Dall, Sandrine Brognaux, Korin Richmond, Cassia Valentini-Botinhao, Gustav Eje Henter, Julia Hirschberg, Junichi Yamagishi, Simon King:
Testing the consistency assumption: Pronunciation variant forced alignment in read and spontaneous speech synthesis. ICASSP 2016: 5155-5159 - [c11]Oliver Watts, Gustav Eje Henter, Thomas Merritt, Zhizheng Wu, Simon King:
From HMMS to DNNS: Where do the improvements come from? ICASSP 2016: 5505-5509 - [c10]Takenori Yoshimura, Gustav Eje Henter, Oliver Watts, Mirjam Wester, Junichi Yamagishi, Keiichi Tokuda:
A Hierarchical Predictor of Synthetic Speech Naturalness Using Neural Networks. INTERSPEECH 2016: 342-346 - [c9]Srikanth Ronanki, Gustav Eje Henter, Zhizheng Wu, Simon King:
A Template-Based Approach for Speech Synthesis Intonation Generation Using LSTMs. INTERSPEECH 2016: 2463-2467 - [c8]Srikanth Ronanki, Oliver Watts, Simon King, Gustav Eje Henter:
Median-based generation of synthetic speech durations using a non-parametric approach. SLT 2016: 686-692 - [i1]Srikanth Ronanki, Oliver Watts, Simon King, Gustav Eje Henter:
Median-Based Generation of Synthetic Speech Durations using a Non-Parametric Approach. CoRR abs/1608.06134 (2016) - 2015
- [c7]Mirjam Wester, Cassia Valentini-Botinhao, Gustav Eje Henter:
Are we using enough listeners? no! - an empirically-supported critique of interspeech 2014 TTS evaluations. INTERSPEECH 2015: 3476-3480 - 2014
- [c6]Matthew P. Aylett, Rasmus Dall, Arnab Ghoshal, Gustav Eje Henter, Thomas Merritt:
A flexible front-end for HTS. INTERSPEECH 2014: 1283-1287 - [c5]Gustav Eje Henter, Thomas Merritt, Matt Shannon, Catherine Mayo, Simon King:
Measuring the perceptual effects of modelling assumptions in speech synthesis using stimuli constructed from repeated natural speech. INTERSPEECH 2014: 1504-1508 - 2013
- [b1]Gustav Eje Henter:
Probabilistic Sequence Models with Speech and Language Applications. Royal Institute of Technology, Stockholm, Sweden, 2013 - [j2]Gustav Eje Henter, W. Bastiaan Kleijn:
Picking up the pieces: Causal states in noisy data, and how to recover them. Pattern Recognit. Lett. 34(5): 587-594 (2013) - [j1]Petko Nikolov Petkov, Gustav Eje Henter, W. Bastiaan Kleijn:
Maximizing Phoneme Recognition Accuracy for Enhanced Speech Intelligibility in Noise. IEEE Trans. Speech Audio Process. 21(5): 1035-1045 (2013) - 2012
- [c4]Gustav Eje Henter, Marcus R. Frean, W. Bastiaan Kleijn:
Gaussian process dynamical models for nonparametric speech representation and synthesis. ICASSP 2012: 4505-4508 - [c3]Petko Nikolov Petkov, W. Bastiaan Kleijn, Gustav Eje Henter:
Enhancing Subjective Speech Intelligibility Using a Statistical Model of Speech. INTERSPEECH 2012: 166-169 - 2011
- [c2]Gustav Eje Henter, W. Bastiaan Kleijn:
Intermediate-State HMMs to Capture Continuously-Changing Signal Features. INTERSPEECH 2011: 1817-1820 - 2010
- [c1]Gustav Eje Henter, W. Bastiaan Kleijn:
Simplified probability models for generative tasks: A rate-distortion approach. EUSIPCO 2010: 1159-1163
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-18 20:47 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint