default search action
Daisuke Saito
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j15]Yui Ono, Daisuke Saito, Hironori Washizaki, Yoshiaki Fukazawa:
Measuring Complexity in Visual Programming for Elementary School Students. J. Inf. Process. 32: 103-112 (2024) - [c113]Yui Ono, Daisuke Saito, Hironori Washizaki:
Evaluating Preschoolers' Block Programming Using Complexity and Personality Traits. CSEE&T 2024: 1-5 - [c112]Shinnosuke Takamichi, Hiroki Maeda, Joonyong Park, Daisuke Saito, Hiroshi Saruwatari:
Do Learned Speech Symbols Follow Zipf's Law? ICASSP 2024: 12526-12530 - [c111]Daisuke Saito, Yojiro Mori, Kohei Hosokawa, Shigeyuki Yanagimachi, Hiroshi Hasegawa:
Cost-Effective Capacity Enhancement of Survivable Optical Networks by Supplemental Band Expansion and Backup Resource Sharing. OFC 2024: 1-3 - [c110]Ruochen Tian, Daisuke Saito, Hironori Washizaki, Yoshiaki Fukazawa, Hiroshi Kobayashi, Ayumi Tsuji:
Enhancing Programming Education through Game-Based Learning: Design and Implementation of a Puyo Puyo-Inspired Teaching Tool. SIGCSE (2) 2024: 1838-1839 - [i8]Kentaro Onda, Joonyong Park, Nobuaki Minematsu, Daisuke Saito:
A Pilot Study of GSLM-based Simulation of Foreign Accentuation Only Using Native Speech Corpora. CoRR abs/2407.11370 (2024) - [i7]Haopeng Geng, Daisuke Saito, Nobuaki Minematsu:
Simulating Native Speaker Shadowing for Nonnative Speech Assessment with Latent Speech Representations. CoRR abs/2409.11742 (2024) - [i6]Haopeng Geng, Daisuke Saito, Nobuaki Minematsu:
A Pilot Study of Applying Sequence-to-Sequence Voice Conversion to Evaluate the Intelligibility of L2 Speech Using a Native Speaker's Shadowings. CoRR abs/2410.02239 (2024) - 2023
- [j14]Naotake Masuda, Daisuke Saito:
Quality-diversity for Synthesizer Sound Matching. J. Inf. Process. 31: 220-228 (2023) - [j13]Naotake Masuda, Daisuke Saito:
Improving Semi-Supervised Differentiable Synthesizer Sound Matching for Practical Applications. IEEE ACM Trans. Audio Speech Lang. Process. 31: 863-875 (2023) - [c109]Daisuke Saito, Ruochen Tian, Hironori Washizaki, Yoshiaki Fukazawa:
Programming Education for Young People using the Falling-Puzzle Game, "Puyo Puyo". EDUCON 2023: 1-5 - [c108]Yurun He, Nobuaki Minematsu, Daisuke Saito:
Multiple Acoustic Features Speech Emotion Recognition Using Cross-Attention Transformer. ICASSP 2023: 1-5 - [c107]Yingxiang Gao, Jaehyun Choi, Nobuaki Minematsu, Noriko Nakanishi, Daisuke Saito:
Automatic Prediction of Language Learners' Listenability Using Speech and Text Features Extracted from Listening Drills. INTERSPEECH 2023: 979-983 - [c106]Daisuke Saito, Yojiro Mori, Kohei Hosokawa, Shigeyuki Yanagimachi, Hiroshi Hasegawa:
Cost-effective Network Capacity Enhancement with Multi-band Virtual Bypass Links. OFC 2023: 1-3 - [c105]Rose Niousha, Daisuke Saito, Hironori Washizaki, Yoshiaki Fukazawa:
Gender Characteristics and Computational Thinking in Scratch. SIGCSE (2) 2023: 1344 - [c104]Chihiro Shoda, Yingxiang Gao, Yurun He, Nobuaki Minematsu, Noriko Nakanishi, Daisuke Saito:
Learners' Prosodic Control in the Task of Expressive Storytelling and Predicted Native Listeners' Impressions of the Learners' Speech. SLaTE 2023: 46-50 - [c103]Yusuke Shozui, Nobuaki Minematsu, Noriko Nakanishi, Daisuke Saito:
Density and Entropy of Spoken Syllables in American English and Japanese English Estimated with Acoustic Word Embeddings. SLaTE 2023: 131-135 - [c102]Daisuke Saito, Hironori Washizaki, Yui Ono, Yoshiaki Fukazawa, Mio Ezure:
Work-in-Progress: Relating Logical Thinking Skills to Program Complexity in Children's Programming Education. TALE 2023: 1-4 - [i5]Shinnosuke Takamichi, Hiroki Maeda, Joonyong Park, Daisuke Saito, Hiroshi Saruwatari:
Do learned speech symbols follow Zipf's law? CoRR abs/2309.09690 (2023) - 2022
- [j12]Hitoshi Suda, Gaku Kotani, Daisuke Saito:
INmfCA Algorithm for Training of Nonparallel Voice Conversion Systems Based on Non-Negative Matrix Factorization. IEICE Trans. Inf. Syst. 105-D(6): 1196-1210 (2022) - [j11]Hitoshi Suda, Daisuke Saito, Satoru Fukayama, Tomoyasu Nakano, Masataka Goto:
Singer Diarization for Polyphonic Music With Unison Singing. IEEE ACM Trans. Audio Speech Lang. Process. 30: 1531-1545 (2022) - [j10]Gaku Kotani, Daisuke Saito, Nobuaki Minematsu:
Voice Conversion Based on Deep Neural Networks for Time-Variant Linear Transformations. IEEE ACM Trans. Audio Speech Lang. Process. 30: 2981-2992 (2022) - [c101]Eisuke Konno, Daisuke Saito, Nobuaki Minematsu:
Quantifying Discriminability between NMF Bases. ICASSP 2022: 691-695 - [c100]Takeru Gorai, Daisuke Saito, Nobuaki Minematsu:
Text-to-speech synthesis using spectral modeling based on non-negative autoencoder. INTERSPEECH 2022: 1621-1625 - [c99]Takuya Kunihara, Chuanbo Zhu, Daisuke Saito, Nobuaki Minematsu, Noriko Nakanishi:
Detection of Learners' Listening Breakdown with Oral Dictation and Its Use to Model Listening Skill Improvement Exclusively Through Shadowing. INTERSPEECH 2022: 4461-4465 - [c98]Zhuo Gong, Daisuke Saito, Longfei Yang, Takahiro Shinozaki, Sheng Li, Hisashi Kawai, Nobuaki Minematsu:
Self-Adaptive Multilingual ASR Rescoring with Language Identification and Unified Language Model. Odyssey 2022: 415-420 - [c97]Chuanbo Zhu, Takuya Kunihara, Daisuke Saito, Nobuaki Minematsu, Noriko Nakanishi:
Automatic Prediction of Intelligibility of Words and Phonemes Produced Orally by Japanese Learners of English. SLT 2022: 1029-1036 - [c96]Rose Niousha, Daisuke Saito, Hironori Washizaki, Yoshiaki Fukazawa:
Scratch Project Analysis: Relationship Between Gender and Computational Thinking Skill. TALE 2022: 567-571 - 2021
- [c95]Ruiyan Chen, Tazuko Nishimura, Nobuaki Minematsu, Daisuke Saito:
Acoustic Simulation of Body-conducted Speech and Its Use to Convert One's Recorded Voices to One's Own Voices. APSIPA ASC 2021: 821-828 - [c94]Chuanbo Zhu, Ryo Hakoda, Daisuke Saito, Nobuaki Minematsu, Noriko Nakanishi, Tazuko Nishimura:
Multi-Granularity Annotation of Instantaneous Intelligibility of Learners' Utterances Based on Shadowing Techniques. ASRU 2021: 1071-1078 - [c93]Koki Miura, Daisuke Saito, Hironori Washizaki, Yoshiaki Fukazawa:
Automated Educational Program Mapping on Learning Standards in Computer Science. COMPSAC 2021: 1405-1406 - [c92]Yasuhiro Watanabe, Hironori Washizaki, Kazunori Sakamoto, Daisuke Saito, Kiyoshi Honda, Naohiko Tsuda, Yoshiaki Fukazawa, Nobukazu Yoshioka:
Preliminary Literature Review of Machine Learning System Development Practices. COMPSAC 2021: 1407-1408 - [c91]Naotake Masuda, Daisuke Saito:
Quality Diversity for Synthesizer Sound Matching. DAFx 2021: 300-307 - [c90]Shota Kaieda, Daisuke Saito, Hironori Washizaki, Yoshiaki Fukazawa:
Work-in-Progress: Analysis of the use of Mentoring with Online Mob Programming. EDUCON 2021: 1424-1428 - [c89]Shintaro Ando, Nobuaki Minematsu, Daisuke Saito:
Lexical Density Analysis of Word Productions in Japanese English Using Acoustic Word Embeddings. Interspeech 2021: 4433-4437 - [c88]Naotake Masuda, Daisuke Saito:
Synthesizer Sound Matching with Differentiable DSP. ISMIR 2021: 428-434 - [c87]Yang Shen, Ayano Yasukagawa, Daisuke Saito, Nobuaki Minematsu, Kazuya Saito:
Optimized Prediction of Fluency of L2 English Based on Interpretable Network Using Quantity of Phonation and Quality of Pronunciation. SLT 2021: 698-704 - [c86]Daisuke Saito, Kazunori Sakamoto, Hironori Washizaki, Yoshiaki Fukazawa, Shuichi Uchiyama, Ramzi Ramzi:
Development of a Game to Foster Programming Thinking for Learning through Reading Program. TALE 2021: 1-6 - [c85]Daisuke Saito, Toshiyuki Kobayashi, Hiroki Koga, Nicolo Ronchi, Kaustuv Banerjee, Yusuke Shuto, Jun Okuno, Kenta Konishi, Luca Di Piazza, Arindam Mallik, Jan Van Houdt, Masanori Tsukamoto, Kazunobu Ohkuri, Taku Umebayashi, Takayuki Ezaki:
Analog In-memory Computing in FeFET-based 1T1R Array for Edge AI Applications. VLSI Circuits 2021: 1-2 - [c84]Makoto Shiraishi, Hironori Washizaki, Daisuke Saito, Yoshiaki Fukazawa:
Comparing Participants' Brainwaves During Solo, Pair, and Mob Programming. XP 2021: 200-209 - 2020
- [j9]Daisuke Saito, Nobuaki Minematsu, Keikichi Hirose:
Tensor Factor Analysis for Arbitrary Speaker Conversion. IEICE Trans. Inf. Syst. 103-D(6): 1395-1405 (2020) - [j8]Daisuke Saito, Shota Kaieda, Hironori Washizaki, Yoshiaki Fukazawa:
Rubric for Measuring and Visualizing the Effects of Learning Computer Programming for Elementary School Students. J. Inf. Technol. Educ. Innov. Pract. 19: 203-227 (2020) - [c83]Tatsuma Ishihara, Daisuke Saito:
Attention-Based Speaker Embeddings for One-Shot Voice Conversion. INTERSPEECH 2020: 806-810 - [c82]Zhenchao Lin, Ryo Takashima, Daisuke Saito, Nobuaki Minematsu, Noriko Nakanishi:
Shadowability Annotation with Fine Granularity on L2 Utterances and its Improvement with Native Listeners' Script-Shadowing. INTERSPEECH 2020: 3865-3869 - [c81]Yuma Shirahata, Daisuke Saito, Nobuaki Minematsu:
Discriminative Method to Extract Coarse Prosodic Structure and its Application for Statistical Phrase/Accent Command Estimation. INTERSPEECH 2020: 4427-4431 - [c80]Hitoshi Suda, Gaku Kotani, Daisuke Saito:
Nonparallel Training of Exemplar-Based Voice Conversion System Using INCA-Based Alignment Technique. INTERSPEECH 2020: 4681-4685 - [c79]Masaki Okamoto, Daisuke Saito:
Interpretable Driver Models Discovery in Data. ITSC 2020: 1-7 - [c78]Daisuke Saito, Shota Kaieda, Risei Yajima, Hironori Washizaki, Yoshiaki Fukazawa, Hidetoshi Omiya, Misaki Onodera, Idumi Sato:
Assessing Elementary School Students' Programming Thinking Skills using Rubrics. TALE 2020: 181-188
2010 – 2019
- 2019
- [j7]Tetsuya Hashimoto, Daisuke Saito, Nobuaki Minematsu:
Many-to-Many and Completely Parallel-Data-Free Voice Conversion Based on Eigenspace DNN. IEEE ACM Trans. Audio Speech Lang. Process. 27(2): 332-341 (2019) - [c77]Gaku Kotani, Hitoshi Suda, Daisuke Saito, Nobuaki Minematsu:
Experimental investigation on the efficacy of Affine-DTW in the quality of voice conversion. APSIPA 2019: 119-124 - [c76]Shunsuke Goto, Daisuke Saito, Nobuaki Minematsu:
DNN-based Statistical Parametric Speech Synthesis Incorporating Non-negative Matrix Factorization. APSIPA 2019: 148-153 - [c75]Daisuke Saito, So Suzuki, Nobuaki Minematsu:
Speech representation based on tensor factor analysis and its application to speaker recognition and language identification. APSIPA 2019: 402-406 - [c74]Shunsuke Goto, Yuma Shirahata, Gaku Kotani, Hitoshi Suda, Daisuke Saito, Nobuaki Minematsu:
The UTokyo speech synthesis system for Blizzard Challenge 2019. Blizzard Challenge 2019 - [c73]Yusaku Korematsu, Daisuke Saito, Nobuaki Minematsu:
Cooking State Recognition based on Acoustic Event Detection. CEA@ICMR 2019: 41-44 - [c72]Tasavat Trisitichoke, Shintaro Ando, Daisuke Saito, Nobuaki Minematsu:
Analysis of Native Listeners' Facial Microexpressions While Shadowing Non-Native Speech - Potential of Shadowers' Facial Expressions for Comprehensibility Prediction. INTERSPEECH 2019: 1861-1865 - [c71]Shintaro Ando, Zhenchao Lin, Tasavat Trisitichoke, Yusuke Inoue, Fuki Yoshizawa, Daisuke Saito, Nobuaki Minematsu:
A Large Collection of Sentences Read Aloud by Vietnamese Learners of Japanese and Native Speaker's Reverse Shadowings. O-COCOSDA 2019: 1-6 - [c70]Daisuke Saito, Hironori Washizaki, Yoshiaki Fukazawa, Mariko Tamura, Yuki Sakuragi:
Rubric to Evaluate Programming Learning of Elementary School Students. SIGCSE 2019: 1280 - [c69]Zhenchao Lin, Yusuke Inoue, Tasavat Trisitichoke, Shintaro Ando, Daisuke Saito, Nobuaki Minematsu:
Native Listeners' Shadowing of Non-native Utterances as Spoken Annotation Representing Comprehensibility of the Utterances. SLaTE 2019: 43-47 - [c68]Hitoshi Suda, Daisuke Saito, Nobuaki Minematsu:
Voice Conversion without Explicit Separation of Source and Filter Components Based on Non-negative Matrix Factorization. SSW 2019: 69-74 - [c67]Gaku Kotani, Daisuke Saito:
Voice conversion based on full-covariance mixture density networks for time-variant linear transformations. SSW 2019: 75-80 - [c66]Yuma Shirahata, Daisuke Saito, Nobuaki Minematsu:
Generative Modeling of F0 Contours Leveraged by Phrase Structure and Its Application to Statistical Focus Control. SSW 2019: 228-233 - [c65]Daisuke Saito, Hironori Washizaki, Yoshiaki Fukazawa, Tetusya Yoshida, Isumu Kaneko, Hirotaka Kamo:
Learning Effects in Programming Learning Using Python and Raspberry Pi: Case Study with Elementary School Students. TALE 2019: 1-8 - [i4]Yasuhiro Watanabe, Hironori Washizaki, Kazunori Sakamoto, Daisuke Saito, Kiyoshi Honda, Naohiko Tsuda, Yoshiaki Fukazawa, Nobukazu Yoshioka:
Preliminary Systematic Literature Review of Machine Learning System Development Process. CoRR abs/1910.05528 (2019) - 2018
- [j6]Yi Zhao, Shinji Takaki, Hieu-Thi Luong, Junichi Yamagishi, Daisuke Saito, Nobuaki Minematsu:
Wasserstein GAN and Waveform Loss-Based Acoustic Model Training for Multi-Speaker Text-to-Speech Synthesis Systems Using a WaveNet Vocoder. IEEE Access 6: 60478-60488 (2018) - [c64]Hitoshi Suda, Gaku Kotani, Shinnosuke Takamichi, Daisuke Saito:
A Revisit to Feature Handling for High-quality Voice Conversion Based on Gaussian Mixture Model. APSIPA 2018: 816-822 - [c63]Kenta Nezu, Yuki Sato, Mitsuru Shinagawa, Daisuke Saito, Ken Seo, Kyoji Oohashi:
Analysis of Unintentional Signal Propagation in Intra-Body Communication. GCCE 2018: 101-104 - [c62]Kenta Nezu, Rikuma Ashizawa, Mitsuru Shinagawa, Daisuke Saito, Ken Seo, Kyoji Oohashi:
Analysis of Transient Signal Due to Person Movement in Gate System Using Intra-Body Communication. ICST 2018: 363-366 - [c61]Yasuhito Ohsugi, Daisuke Saito, Nobuaki Minematsu:
A Comparative Study of Statistical Conversion of Face to Voice Based on Their Subjective Impressions. INTERSPEECH 2018: 1001-1005 - [c60]Yusuke Inoue, Suguru Kabashima, Daisuke Saito, Nobuaki Minematsu, Kumi Kanamura, Yutaka Yamauchi:
A Study of Objective Measurement of Comprehensibility through Native Speakers' Shadowing of Learners' Utterances. INTERSPEECH 2018: 1651-1655 - [c59]Tomi Kinnunen, Jaime Lorenzo-Trueba, Junichi Yamagishi, Tomoki Toda, Daisuke Saito, Fernando Villavicencio, Zhen-Hua Ling:
A Spoofing Benchmark for the 2018 Voice Conversion Challenge: Leveraging from Spoofing Countermeasures for Speech Artifact Assessment. Odyssey 2018: 187-194 - [c58]Jaime Lorenzo-Trueba, Junichi Yamagishi, Tomoki Toda, Daisuke Saito, Fernando Villavicencio, Tomi Kinnunen, Zhen-Hua Ling:
The Voice Conversion Challenge 2018: Promoting Development of Parallel and Nonparallel Methods. Odyssey 2018: 195-202 - [c57]Suguru Kabashima, Yusuke Inoue, Daisuke Saito, Nobuaki Minematsu:
DNN-Based Scoring of Language Learners' Proficiency Using Learners' Shadowings and Native Listeners' Responsive Shadowings. SLT 2018: 971-978 - [c56]Yutaro Toyoshima, Yoshiki Matsui, Ryota Kato, Kenta Nezu, Mitsuru Shinagawa, Daisuke Saito, Ken Seo, Kyoji Oohashi:
Noise Reduction Method for Intra-Body Communication by Using Compensation Electrode. TENCON 2018: 1733-1736 - [i3]Jaime Lorenzo-Trueba, Junichi Yamagishi, Tomoki Toda, Daisuke Saito, Fernando Villavicencio, Tomi Kinnunen, Zhen-Hua Ling:
The Voice Conversion Challenge 2018: Promoting Development of Parallel and Nonparallel Methods. CoRR abs/1804.04262 (2018) - [i2]Tomi Kinnunen, Jaime Lorenzo-Trueba, Junichi Yamagishi, Tomoki Toda, Daisuke Saito, Fernando Villavicencio, Zhen-Hua Ling:
A Spoofing Benchmark for the 2018 Voice Conversion Challenge: Leveraging from Spoofing Countermeasures for Speech Artifact Assessment. CoRR abs/1804.08438 (2018) - [i1]Yi Zhao, Shinji Takaki, Hieu-Thi Luong, Junichi Yamagishi, Daisuke Saito, Nobuaki Minematsu:
Wasserstein GAN and Waveform Loss-based Acoustic Model Training for Multi-speaker Text-to-Speech Synthesis Systems Using a WaveNet Vocoder. CoRR abs/1807.11679 (2018) - 2017
- [j5]Daisuke Saito, Hironori Washizaki, Yoshiaki Fukazawa:
Comparison of Text-Based and Visual-Based Programming Input Methods for First-Time Learners. J. Inf. Technol. Educ. Res. 16: 209-226 (2017) - [c55]Gaku Kotani, Daisuke Saito, Nobuaki Minematsu:
Voice conversion based on deep neural networks for time-variant linear transformations. APSIPA 2017: 1259-1262 - [c54]Shinnosuke Takamichi, Daisuke Saito, Hiroshi Saruwatari, Nobuaki Minematsu:
The UTokyo speech synthesis system for Blizzard Challenge 2017. Blizzard Challenge 2017 - [c53]Shohei Toyama, Daisuke Saito, Nobuaki Minematsu:
Use of Global and Acoustic Features Associated with Contextual Factors to Adapt Language Models for Spontaneous Speech Recognition. INTERSPEECH 2017: 543-547 - [c52]Hidetsugu Uchida, Daisuke Saito, Nobuaki Minematsu:
Acoustic-to-Articulatory Mapping Based on Mixture of Probabilistic Canonical Correlation Analysis. INTERSPEECH 2017: 989-993 - [c51]Tetsuya Hashimoto, Hidetsugu Uchida, Daisuke Saito, Nobuaki Minematsu:
Parallel-Data-Free Many-to-Many Voice Conversion Based on DNN Integrated with Eigenspace Using a Non-Parallel Speech Corpus. INTERSPEECH 2017: 1278-1282 - [c50]Junwei Yue, Fumiya Shiozawa, Shohei Toyama, Yutaka Yamauchi, Kayoko Ito, Daisuke Saito, Nobuaki Minematsu:
Automatic Scoring of Shadowing Speech Based on DNN Posteriors and Their DTW. INTERSPEECH 2017: 1422-1426 - [c49]Nobuaki Minematsu, Daisuke Saito:
New Features and Effectiveness of Suzuki-kun, the First and Only Prosodic Reading Tutor of Tokyo Japanese. SLaTE 2017: 188 - [c48]Junwei Yue, Daisuke Saito, Nobuaki Minematsu, Yutaka Yamauchi, Kayoko Ito:
Development and Maintenance of Practical and In-service Systems for Recording Shadowing Utterances and Their Assessment. SLaTE 2017: 189 - [c47]Daisuke Saito, Ayana Sasaki, Hironori Washizaki, Yoshiaki Fukazawa, Yusuke Muto:
Quantitative learning effect evaluation of programming learning tools. TALE 2017: 209-216 - 2016
- [j4]Zhizheng Wu, Phillip L. De Leon, Cenk Demiroglu, Ali Khodabakhsh, Simon King, Zhen-Hua Ling, Daisuke Saito, Bryan Stewart, Tomoki Toda, Mirjam Wester, Junichi Yamagishi:
Anti-Spoofing for Text-Independent Speaker Verification: An Initial Database, Comparison of Countermeasures, and Human Performance. IEEE ACM Trans. Audio Speech Lang. Process. 24(4): 768-783 (2016) - [c46]Tetsuya Hashimoto, Daisuke Saito, Nobuaki Minematsu:
Arbitrary speaker conversion based on speaker space bases constructed by deep neural networks. APSIPA 2016: 1-4 - [c45]Yi Zhao, Xiu You, Daisuke Saito, Nobuaki Minematsu:
The UTokyo System for Blizzard Challenge 2016. Blizzard Challenge 2016 - [c44]Yosuke Kashiwagi, Congying Zhang, Daisuke Saito, Nobuaki Minematsu:
Divergence estimation based on deep neural networks and its use for language identification. ICASSP 2016: 5435-5439 - [c43]Yi Yang, Hidetsugu Uchida, Daisuke Saito, Nobuaki Minematsu:
Voice Conversion Based on Matrix Variate Gaussian Mixture Model Using Multiple Frame Features. INTERSPEECH 2016: 302-306 - [c42]Hidetsugu Uchida, Daisuke Saito, Nobuaki Minematsu:
Prediction of the Articulatory Movements of Unseen Phonemes of a Speaker Using the Speech Structure of Another Speaker. INTERSPEECH 2016: 450-454 - [c41]Tomoki Toda, Ling-Hui Chen, Daisuke Saito, Fernando Villavicencio, Mirjam Wester, Zhizheng Wu, Junichi Yamagishi:
The Voice Conversion Challenge 2016. INTERSPEECH 2016: 1632-1636 - [c40]Yi Zhao, Daisuke Saito, Nobuaki Minematsu:
Speaker Representations for Speaker Adaptation in Multiple Speakers' BLSTM-RNN-Based Speech Synthesis. INTERSPEECH 2016: 2268-2272 - [c39]Shuju Shi, Yosuke Kashiwagi, Shohei Toyama, Junwei Yue, Yutaka Yamauchi, Daisuke Saito, Nobuaki Minematsu:
Automatic Assessment and Error Detection of Shadowing Speech: Case of English Spoken by Japanese Learners. INTERSPEECH 2016: 3142-3146 - [c38]Daisuke Saito, Hironori Washizaki, Yoshiaki Fukazawa:
Influence of the Programming Environment on Programming Education. ITiCSE 2016: 354 - [c37]Fumiya Shiozawa, Daisuke Saito, Nobuaki Minematsu:
Improved prediction of the accent gap between speakers of English for individual-based clustering of World Englishes. SLT 2016: 129-135 - [c36]Nobuaki Minematsu, Daisuke Saito, Nobuyuki Nishizawa:
Prosodic Reading Tutor of Japanese, Suzuki-kun: The first and only educational tool to teach the formal Japanese. SSW 2016: 122 - 2015
- [c35]Zhizheng Wu, Ali Khodabakhsh, Cenk Demiroglu, Junichi Yamagishi, Daisuke Saito, Tomoki Toda, Simon King:
SAS: A speaker verification spoofing database containing diverse attacks. ICASSP 2015: 4440-4444 - [c34]Tianze Shi, Shun Kasahara, Teeraphon Pongkittiphan, Nobuaki Minematsu, Daisuke Saito, Keikichi Hirose:
A measure of phonetic similarity to quantify pronunciation variation by using ASR technology. ICPhS 2015 - [c33]Hidetsugu Uchida, Daisuke Saito, Nobuaki Minematsu, Keikichi Hirose:
Statistical acoustic-to-articulatory mapping unified with speaker normalization based on voice conversion. INTERSPEECH 2015: 588-592 - [c32]Yuichi Sato, Yosuke Kashiwagi, Nobuaki Minematsu, Daisuke Saito, Keikichi Hirose:
Noise-robust and stress-free visualization of pronunciation diversity of World Englishes using a learner's self-centered viewpoint. O-COCOSDA/CASLRE 2015: 1-6 - [c31]Teeraphon Pongkittiphan, Nobuaki Minematsu, Takehiko Makino, Daisuke Saito, Keikichi Hirose:
Automatic prediction of intelligibility of English words spoken with Japanese accents - comparative study of features and models used for prediction. SLaTE 2015: 19-22 - [c30]Nobuaki Minematsu, Hiroya Hashimoto, Hiroko Hirano, Daisuke Saito:
Development of a prosodic reading tutor of Japanese - effective use of TTS and F0 contour modeling techniques for CALL. SLaTE 2015: 189 - [c29]Daisuke Saito, Hironori Washizaki, Yoshiaki Fukazawa:
Work in progress: A comparison of programming way: Illustration-based programming and text-based programming. TALE 2015: 220-223 - 2014
- [c28]Daisuke Saito, Toshiyuki Murakami:
A turning control of electric wheeled walker device by PSD camera information. AMC 2014: 616-620 - [c27]Yi Luan, Daisuke Saito, Yosuke Kashiwagi, Nobuaki Minematsu, Keikichi Hirose:
Semi-supervised noise dictionary adaptation for exemplar-based noise robust speech recognition. ICASSP 2014: 1745-1748 - [c26]Shun Kasahara, S. Kitahara, Nobuaki Minematsu, Han-Ping Shen, Takehiko Makino, Daisuke Saito, K. Hiorse:
Improved and robust prediction of pronunciation distance for individual-basis clustering of World Englishes pronunciation. ICASSP 2014: 3216-3220 - [c25]Daisuke Saito, Hidenobu Doi, Nobuaki Minematsu, Keikichi Hirose:
Application of matrix variate Gaussian mixture model to statistical voice conversion. INTERSPEECH 2014: 2504-2508 - [c24]Daisuke Saito, Akira Takebayashi, Tsuneo Yamaura:
Minecraft-based preparatory training for software development project. IPCC 2014: 1-9 - [c23]Yuji Kawase, Nobuaki Minematsu, Daisuke Saito, Keikichi Hirose:
Visualization of pronunciation diversity of world Englishes from a speaker's self-centered viewpoint. O-COCOSDA 2014: 1-5 - [c22]Nobuaki Minematsu, Shun Kasahara, Takehiko Makino, Daisuke Saito, Keikichi Hirose:
Speaker-basis Accent Clustering Using Invariant Structure Analysis and the Speech Accent Archive. Odyssey 2014: 158-165 - 2013
- [j3]Daisuke Saito, Tsuneo Yamaura:
A New Approach to Programming Language Education for Beginners with Top-Down Learning. Int. J. Eng. Pedagog. 3(S4): 16-21 (2013) - [c21]Yosuke Kashiwagi, Daisuke Saito, Nobuaki Minematsu, Keikichi Hirose:
Discriminative piecewise linear transformation based on deep learning for noise robust automatic speech recognition. ASRU 2013: 350-355 - [c20]Yinghui Zhou, Daisuke Saito, Lei Jing:
Adaptive template adjustment for personalized gesture recognition based on a finger-worn device. iCAST/UMEDIA 2013: 610-614 - [c19]Tatsuma Ishihara, Hirokazu Kameoka, Kota Yoshizato, Daisuke Saito, Shigeki Sagayama:
Probabilistic speech F0 contour model incorporating statistical vocabulary model of phrase-accent command sequence. INTERSPEECH 2013: 1017-1021 - [c18]Nobukatsu Hojo, Kota Yoshizato, Hirokazu Kameoka, Daisuke Saito, Shigeki Sagayama:
Text-to-speech synthesizer based on combination of composite wavelet and hidden Markov models. SSW 2013: 129-134 - 2012
- [j2]Daisuke Saito, Shinji Watanabe, Atsushi Nakamura, Nobuaki Minematsu:
Statistical Voice Conversion Based on Noisy Channel Model. IEEE Trans. Speech Audio Process. 20(6): 1784-1794 (2012) - [c17]Masao Inui, Masaru Kato, Keitaro Inomata, Machiko Sato, Yoshihiko Azuma, Daisuke Saito, Tota Mizuno, Takao Kikuchi, Sakuji Yoshimura:
Correcting for non-uniform illumination when photographing the mural in the royal tomb of Amenophis III (III) Correcting mural images. CGIV 2012: 92-96 - [c16]Miquel Espi, Masakiyo Fujimoto, Daisuke Saito, Nobutaka Ono, Shigeki Sagayama:
A tandem connectionist model using combination of multi-scale spectro-temporal features for acoustic event detection. ICASSP 2012: 4293-4296 - [c15]Satoru Fukayama, Daisuke Saito, Shigeki Sagayama:
Assistance for Novice Users on Creating Songs from Japanese Lyrics. ICMC 2012 - [c14]Daisuke Saito, Nobuaki Minematsu, Keikichi Hirose:
Effects of Speaker Adaptive Training on Tensor-based Arbitrary Speaker Conversion. INTERSPEECH 2012: 98-101 - [c13]Kota Yoshizato, Hirokazu Kameoka, Daisuke Saito, Shigeki Sagayama:
Hidden Markov Convolutive Mixture Model for Pitch Contour Analysis of Speech. INTERSPEECH 2012: 390-393 - 2011
- [c12]Daisuke Saito, Shinji Watanabe, Atsushi Nakamura, Nobuaki Minematsu:
High accurate model-integration-based voice conversion using dynamic features and model structure optimization. ICASSP 2011: 4576-4579 - [c11]Daisuke Saito, Keisuke Yamamoto, Nobuaki Minematsu, Keikichi Hirose:
One-to-Many Voice Conversion Based on Tensor Representation of Speaker Space. INTERSPEECH 2011: 653-656 - [c10]Keikichi Hirose, Keiko Ochi, Ryusuke Mihara, Hiroya Hashimoto, Daisuke Saito, Nobuaki Minematsu:
Adaptation of Prosody in Speech Synthesis by Changing Command Values of the Generation Process Model of Fundamental Frequency. INTERSPEECH 2011: 2793-2796 - [c9]Aki Kunikoshi, Yu Qiao, Daisuke Saito, Nobuaki Minematsu, Keikichi Hirose:
Gesture Design of Hand-to-Speech Converter Derived from Speech-to-Hand Converter Based on Probabilistic Integration Model. INTERSPEECH 2011: 3025-3028 - 2010
- [c8]Yu Qiao, Daisuke Saito, Nobuaki Minematsu:
HMM-based sequence-to-frame mapping for voice conversion. ICASSP 2010: 4830-4833 - [c7]Daisuke Saito, Shinji Watanabe, Atsushi Nakamura, Nobuaki Minematsu:
Probabilistic integration of joint density model and speaker model for voice conversion. INTERSPEECH 2010: 1728-1731 - [c6]Miaomiao Wang, Miaomiao Wen, Daisuke Saito, Keikichi Hirose, Nobuaki Minematsu:
Improved generation of prosodic features in HMM-based Mandarin speech synthesis. SSW 2010: 359-364
2000 – 2009
- 2009
- [j1]Kenji Imadera, Yasuaki Kishimoto, Daisuke Saito, Jiquan Li, Takayuki Utsumi:
A numerical method for solving the Vlasov-Poisson equation based on the conservative IDO scheme. J. Comput. Phys. 228(23): 8919-8943 (2009) - [c5]Daisuke Saito, Yu Qiao, Nobuaki Minematsu, Keikichi Hirose:
Optimal event search using a structural cost function - improvement of structure to speech conversion. INTERSPEECH 2009: 2047-2050 - 2008
- [c4]Daisuke Saito, Ryo Matsuura, Satoshi Asakawa, Nobuaki Minematsu, Keikichi Hirose:
Directional dependency of cepstrum on vocal tract length. ICASSP 2008: 4485-4488 - [c3]Daisuke Saito, Nobuaki Minematsu, Keikichi Hirose:
Decomposition of rotational distortion caused by VTL difference using eigenvalues of its transformation matrix. INTERSPEECH 2008: 1361-1364 - [c2]Daisuke Saito, Satoshi Asakawa, Nobuaki Minematsu, Keikichi Hirose:
Structure to speech conversion - speech generation based on infant-like vocal imitation. INTERSPEECH 2008: 1837-1840 - 2006
- [c1]Daisuke Saito, Keiichi Saito, Kazuhiro Notomi, Masao Saito:
The effect of Age on Web-safe Color Visibility for a White Background. EMBC 2006: 5145-5148
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-08 21:33 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint