default search action
Roger Hsiao
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j5]Inhee Lee, Roger Hsiao, Gordy Carichner, Chin-Wei Hsu, Mingyu Yang, Sara Shoouri, Katherine Ernst, Tess Carichner, Yuyang Li, Jaechan Lim, Cole R. Julick, Eunseong Moon, Yi Sun, Jamie Phillips, Kristi L. Montooth, Delbert A. Green II, Hun-Seok Kim, David T. Blaauw:
mSAIL: Milligram-Scale Multi-Modal Sensor Platform for Monarch Butterfly Migration Tracking. Commun. ACM 67(6): 93-101 (2024) - [i9]Roger Hsiao, Liuhui Deng, Erik McDermott, Ruchir Travadi, Xiaodan Zhuang:
Optimizing Byte-level Representation for End-to-end ASR. CoRR abs/2406.09676 (2024) - 2023
- [c42]Stefan Braun, Erik McDermott, Roger Hsiao:
Neural Transducer Training: Reduced Memory Consumption with Sample-Wise Computation. ICASSP 2023: 1-5 - [c41]Pawel Swietojanski, Stefan Braun, Dogan Can, Thiago Fraga da Silva, Arnab Ghoshal, Takaaki Hori, Roger Hsiao, Henry Mason, Erik McDermott, Honza Silovsky, Ruchir Travadi, Xiaodan Zhuang:
Variable Attention Masking for Configurable Transformer Transducer Speech Recognition. ICASSP 2023: 1-5 - [i8]Jan Silovský, Liuhui Deng, Arturo Argueta, Tresi Arvizo, Roger Hsiao, Sasha Kuznietsov, Yiu-Chang Lin, Xiaoqiang Xiao, Yuanyuan Zhang:
Cross-lingual Knowledge Transfer and Iterative Pseudo-labeling for Low-Resource Speech Recognition with Transducers. CoRR abs/2305.13652 (2023) - 2022
- [j4]Inhee Lee, Roger Hsiao, Gordy Carichner, Chin-Wei Hsu, Mingyu Yang, Sara Shoouri, Katherine Ernst, Tess Carichner, Yuyang Li, Jaechan Lim, Cole R. Julick, Eunseong Moon, Yi Sun, Jamie Phillips, Kristi L. Montooth, Delbert A. Green II, Hun-Seok Kim, David T. Blaauw:
Tracking the Migration of the Monarch Butterflies with the World's Smallest Computer. GetMobile Mob. Comput. Commun. 26(1): 25-29 (2022) - [c40]Liuhui Deng, Roger Hsiao, Arnab Ghoshal:
Bilingual End-to-End ASR with Byte-Level Subwords. ICASSP 2022: 6417-6421 - [i7]Liuhui Deng, Roger Hsiao, Arnab Ghoshal:
Bilingual End-to-End ASR with Byte-Level Subwords. CoRR abs/2205.00485 (2022) - [i6]Thien Nguyen, Nathalie Tran, Liuhui Deng, Thiago Fraga da Silva, Matthew Radzihovsky, Roger Hsiao, Henry Mason, Stefan Braun, Erik McDermott, Dogan Can, Pawel Swietojanski, Lyan Verwimp, Sibel Oyman, Tresi Arvizo, Honza Silovsky, Arnab Ghoshal, Mathieu Martel, Bharat Ram Ambati, Mohamed Ali:
Optimizing Bilingual Neural Transducer with Synthetic Code-switching Text Generation. CoRR abs/2210.12214 (2022) - [i5]Pawel Swietojanski, Stefan Braun, Dogan Can, Thiago Fraga da Silva, Arnab Ghoshal, Takaaki Hori, Roger Hsiao, Henry Mason, Erik McDermott, Honza Silovsky, Ruchir Travadi, Xiaodan Zhuang:
Variable Attention Masking for Configurable Transformer Transducer Speech Recognition. CoRR abs/2211.01438 (2022) - [i4]Stefan Braun, Erik McDermott, Roger Hsiao:
Neural Transducer Training: Reduced Memory Consumption with Sample-wise Computation. CoRR abs/2211.16270 (2022) - 2021
- [c39]Inhee Lee, Roger Hsiao, Gordy Carichner, Chin-Wei Hsu, Mingyu Yang, Sara Shoouri, Katherine Ernst, Tess Carichner, Yuyang Li, Jaechan Lim, Cole R. Julick, Eunseong Moon, Yi Sun, Jamie Phillips, Kristi L. Montooth, Delbert A. Green II, Hun-Seok Kim, David T. Blaauw:
mSAIL: milligram-scale multi-modal sensor platform for monarch butterfly migration tracking. MobiCom 2021: 517-530 - 2020
- [j3]Roger Hsiao, Dogan Can, Tim Ng, Ruchir Travadi, Arnab Ghoshal:
Online Automatic Speech Recognition With Listen, Attend and Spell Model. IEEE Signal Process. Lett. 27: 1889-1893 (2020) - [c38]Mingyu Yang, Roger Hsiao, Gordy Carichner, Katherine Ernst, Jaechan Lim, Delbert A. Green II, Inhee Lee, David T. Blaauw, Hun-Seok Kim:
Migrating Monarch Butterfly Localization Using Multi-Modal Sensor Fusion Neural Networks. EUSIPCO 2020: 1792-1796 - [c37]Andrew Titus, Jan Silovský, Nanxin Chen, Roger Hsiao, Mary Young, Arnab Ghoshal:
Improving Language Identification for Multilingual Speakers. ICASSP 2020: 8284-8288 - [i3]Andrew Titus, Jan Silovský, Nanxin Chen, Roger Hsiao, Mary Young, Arnab Ghoshal:
Improving Language Identification for Multilingual Speakers. CoRR abs/2001.11019 (2020) - [i2]Roger Hsiao, Dogan Can, Tim Ng, Ruchir Travadi, Arnab Ghoshal:
Online Automatic Speech Recognition with Listen, Attend and Spell Model. CoRR abs/2008.05514 (2020)
2010 – 2019
- 2019
- [i1]Mingyu Yang, Roger Hsiao, Gordy Carichner, Katherine Ernst, Jaechan Lim, Delbert A. Green II, Inhee Lee, David T. Blaauw, Hun-Seok Kim:
Migrating Monarch Butterfly Localization Using Multi-Sensor Fusion Neural Networks. CoRR abs/1912.06907 (2019) - 2017
- [c36]Roger Hsiao, Tim Ng, Man-Hung Siu:
Unsupervised adaptation for deep neural networks using Alternating Direction Method of Multipliers. ICASSP 2017: 5180-5184 - [c35]William Hartmann, Roger Hsiao, Stavros Tsakalidis:
Alternative networks for monolingual bottleneck features. ICASSP 2017: 5290-5294 - [c34]Tanel Alumäe, Damianos G. Karakos, William Hartmann, Roger Hsiao, Le Zhang, Long Nguyen, Stavros Tsakalidis, Richard M. Schwartz:
The 2016 BBN Georgian telephone speech keyword spotting system. ICASSP 2017: 5755-5759 - [c33]William Hartmann, Damianos G. Karakos, Roger Hsiao, Le Zhang, Tanel Alumäe, Stavros Tsakalidis, Richard M. Schwartz:
Analysis of keyword spotting performance across IARPA babel languages. ICASSP 2017: 5765-5769 - [c32]William Hartmann, Roger Hsiao, Tim Ng, Jeff Z. Ma, Francis Keith, Man-Hung Siu:
Improved Single System Conversational Telephone Speech Recognition with VGG Bottleneck Features. INTERSPEECH 2017: 112-116 - 2016
- [c31]William Hartmann, Le Zhang, Kerri Barnes, Roger Hsiao, Stavros Tsakalidis, Richard M. Schwartz:
Comparison of Multiple System Combination Techniques for Keyword Spotting. INTERSPEECH 2016: 1913-1917 - [c30]William Hartmann, Tim Ng, Roger Hsiao, Stavros Tsakalidis, Richard M. Schwartz:
Two-Stage Data Augmentation for Low-Resourced Speech Recognition. INTERSPEECH 2016: 2378-2382 - [c29]Roger Hsiao, Ralf Meermeier, Tim Ng, Zhongqiang Huang, Maxwell Jordan, Enoch Kan, Tanel Alumäe, Jan Silovský, William Hartmann, Francis Keith, Omer Lang, Man-Hung Siu, Owen Kimball:
Sage: The New BBN Speech Processing Platform. INTERSPEECH 2016: 3022-3026 - 2015
- [c28]Roger Hsiao, Jeff Z. Ma, William Hartmann, Martin Karafiát, Frantisek Grézl, Lukás Burget, Igor Szöke, Jan Cernocký, Shinji Watanabe, Zhuo Chen, Sri Harish Reddy Mallidi, Hynek Hermansky, Stavros Tsakalidis, Richard M. Schwartz:
Robust speech recognition in unknown reverberant and noisy conditions. ASRU 2015: 533-538 - [c27]Le Zhang, Damianos G. Karakos, William Hartmann, Roger Hsiao, Richard M. Schwartz, Stavros Tsakalidis:
Enhancing low resource keyword spotting with automatically retrieved web documents. INTERSPEECH 2015: 839-843 - [c26]Roger Hsiao, Tim Ng, Stavros Tsakalidis, Long Nguyen, Richard M. Schwartz:
Unsupervised adaptation for deep neural network using linear least square method. INTERSPEECH 2015: 2887-2891 - 2014
- [c25]Stavros Tsakalidis, Roger Hsiao, Damianos G. Karakos, Tim Ng, Shivesh Ranjan, Guruprasad Saikumar, Le Zhang, Long Nguyen, Richard M. Schwartz, John Makhoul:
The 2013 BBN Vietnamese telephone speech keyword spotting system. ICASSP 2014: 7829-7833 - [c24]Tim Ng, Roger Hsiao, Le Zhang, Damianos G. Karakos, Sri Harish Reddy Mallidi, Martin Karafiát, Karel Veselý, Igor Szöke, Bing Zhang, Long Nguyen, Richard M. Schwartz:
Progress in the BBN keyword search system for the DARPA RATS program. INTERSPEECH 2014: 959-963 - [c23]Roger Hsiao, Tim Ng, Le Zhang, Shivesh Ranjan, Stavros Tsakalidis, Long Nguyen, Richard M. Schwartz:
Improving semi-supervised deep neural network for keyword search in low resource languages. INTERSPEECH 2014: 1088-1091 - 2013
- [c22]Damianos G. Karakos, Richard M. Schwartz, Stavros Tsakalidis, Le Zhang, Shivesh Ranjan, Tim Ng, Roger Hsiao, Guruprasad Saikumar, Ivan Bulyko, Long Nguyen, John Makhoul, Frantisek Grézl, Mirko Hannemann, Martin Karafiát, Igor Szöke, Karel Veselý, Lori Lamel, Viet Bac Le:
Score normalization and system combination for improved keyword spotting. ASRU 2013: 210-215 - [c21]Roger Hsiao, Tim Ng, Frantisek Grézl, Damianos G. Karakos, Stavros Tsakalidis, Long Nguyen, Richard M. Schwartz:
Discriminative semi-supervised training for keyword search in low resource languages. ASRU 2013: 440-445 - 2012
- [c20]Roger Hsiao, Tanja Schultz:
Towards single pass discriminative training for speech recognition. ICASSP 2012: 4093-4096 - [c19]Stavros Tsakalidis, Xiaodan Zhuang, Roger Hsiao, Shuang Wu, Pradeep Natarajan, Rohit Prasad, Prem Natarajan:
Robust Event Detection From Spoken Content In Consumer Domain Videos. INTERSPEECH 2012: 2101-2104 - 2011
- [c18]Roger Hsiao, Tanja Schultz:
Generalized Baum-Welch Algorithm and its Implication to a New Extended Baum-Welch Algorithm. INTERSPEECH 2011: 773-776 - 2010
- [c17]Roger Hsiao, Florian Metze, Tanja Schultz:
Improvements to generalized discriminative feature transformation for speech recognition. INTERSPEECH 2010: 1361-1364 - [c16]Florian Metze, Roger Hsiao, Qin Jin, Udhyakumar Nallasamy, Tanja Schultz:
The 2010 CMU GALE speech-to-text system. INTERSPEECH 2010: 1501-1504
2000 – 2009
- 2009
- [c15]Hassan Al-Haj, Roger Hsiao, Ian R. Lane, Alan W. Black, Alex Waibel:
Pronunciation modeling for dialectal arabic speech recognition. ASRU 2009: 525-528 - [c14]Roger Hsiao, Yik-Cheung Tam, Tanja Schultz:
Generalized Baum-Welch algorithm for discriminative training on large vocabulary continuous speech recognition system. ICASSP 2009: 3769-3772 - [c13]Roger Hsiao, Tanja Schultz:
Generalized discriminative feature transformation for speech recognition. INTERSPEECH 2009: 664-667 - [c12]Nguyen Bach, Roger Hsiao, Matthias Eck, Paisarn Charoenpornsawat, Stephan Vogel, Tanja Schultz, Ian R. Lane, Alex Waibel, Alan W. Black:
Incremental Adaptation of Speech-to-Speech Translation. HLT-NAACL (Short Papers) 2009: 149-152 - 2008
- [c11]Roger Hsiao, Mark C. Fuhs, Yik-Cheung Tam, Qin Jin, Tanja Schultz:
The CMU-interACT 2008 Mandarin transcription system. INTERSPEECH 2008: 1445-1448 - 2007
- [j2]Brian Kan-Wing Mak, Roger Wend-Huu Hsiao:
Kernel Eigenspace-Based MLLR Adaptation. IEEE Trans. Speech Audio Process. 15(3): 784-795 (2007) - [c10]Brian Kan-Wing Mak, Roger Wend-Huu Hsiao:
Robustness of several kernel-based fast adaptation methods on noisy LVCSR. INTERSPEECH 2007: 266-269 - 2006
- [j1]Brian Kan-Wing Mak, Roger Wend-Huu Hsiao, Simon Ka-Lung Ho, James T. Kwok:
Embedded kernel eigenvoice speaker adaptation and its implication to reference speaker weighting. IEEE Trans. Speech Audio Process. 14(4): 1267-1280 (2006) - [c9]Brian Mak, Tsz-Chung Lai, Roger Wend-Huu Hsiao:
Improving Reference Speaker Weighting Adaptation by the Use of Maximum-Likelihood Reference Speakers. ICASSP (1) 2006: 229-232 - [c8]Man-Wai Mak, Roger Wend-Huu Hsiao, Brian Mak:
A Comparison of Various Adaptation Methods for Speaker Verification With Limited Enrollment Data. ICASSP (1) 2006: 929-932 - [c7]Roger Hsiao, Ashish Venugopal, Thilo Köhler, Ying Zhang, Paisarn Charoenpornsawat, Andreas Zollmann, Stephan Vogel, Alan W. Black, Tanja Schultz, Alex Waibel:
Optimizing components for handheld two-way speech translation for an English-iraqi Arabic system. INTERSPEECH 2006 - 2005
- [c6]Roger Wend-Huu Hsiao, Brian Kan-Wing Mak:
Kernel Eigenspace-based MLLR Adaptation Using Multiple Regression Classes. ICASSP (1) 2005: 985-988 - [c5]Roger Wend-Huu Hsiao, Brian Kan-Wing Mak:
A comparative study of two kernel eigenspace-based speaker adaptation methods on large vocabulary continuous speech recognition. INTERSPEECH 2005: 1797-1800 - 2004
- [c4]Roger Hsiao, Brian Mak:
Discriminative feature transformation by guided discriminative training. ICASSP (1) 2004: 897-900 - [c3]Brian Kan-Wing Mak, Roger Wend-Huu Hsiao:
Improving eigenspace-based MLLR adaptation by kernel PCA. INTERSPEECH 2004: 13-16 - 2003
- [c2]M. S. Lin, Ling Chen, J. Y. Lee, H. T. Liu, C. K. Chou, K. H. Wan, H. M. Chen, Kevin Chou, Roger Hsiao, Eric Lin:
A new IC interconnection scheme and design architecture for high performance ICs at very low fabrication cost - post passivation interconnection. CICC 2003: 533-536 - [c1]Brian Mak, Yik-Cheung Tam, Roger Hsiao:
Discriminative training of auditory filters of different shapes for robust speech recognition. ICASSP (2) 2003: 45-48
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-15 20:38 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint