default search action
Arnab Ghoshal
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
Journal Articles
- 2020
- [j6]Roger Hsiao, Dogan Can, Tim Ng, Ruchir Travadi, Arnab Ghoshal:
Online Automatic Speech Recognition With Listen, Attend and Spell Model. IEEE Signal Process. Lett. 27: 1889-1893 (2020) - 2014
- [j5]Pawel Swietojanski, Arnab Ghoshal, Steve Renals:
Convolutional Neural Networks for Distant Speech Recognition. IEEE Signal Process. Lett. 21(9): 1120-1124 (2014) - [j4]Liang Lu, Arnab Ghoshal, Steve Renals:
Cross-Lingual Subspace Gaussian Mixture Models for Low-Resource Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 22(1): 17-27 (2014) - 2013
- [j3]Liang Lu, K. K. Chin, Arnab Ghoshal, Stephen Renals:
Joint Uncertainty Decoding for Noise Robust Subspace Gaussian Mixture Models. IEEE Trans. Speech Audio Process. 21(9): 1791-1804 (2013) - 2011
- [j2]Daniel Povey, Lukás Burget, Mohit Agarwal, Pinar Akyazi, Kai Feng, Arnab Ghoshal, Ondrej Glembek, Nagendra K. Goel, Martin Karafiát, Ariya Rastrow, Richard C. Rose, Petr Schwarz, Samuel Thomas:
The subspace Gaussian mixture model - A structured model for speech recognition. Comput. Speech Lang. 25(2): 404-439 (2011) - [j1]Liang Lu, Arnab Ghoshal, Steve Renals:
Regularized Subspace Gaussian Mixture Models for Speech Recognition. IEEE Signal Process. Lett. 18(7): 419-422 (2011)
Conference and Workshop Papers
- 2023
- [c32]Pawel Swietojanski, Stefan Braun, Dogan Can, Thiago Fraga da Silva, Arnab Ghoshal, Takaaki Hori, Roger Hsiao, Henry Mason, Erik McDermott, Honza Silovsky, Ruchir Travadi, Xiaodan Zhuang:
Variable Attention Masking for Configurable Transformer Transducer Speech Recognition. ICASSP 2023: 1-5 - 2022
- [c31]Liuhui Deng, Roger Hsiao, Arnab Ghoshal:
Bilingual End-to-End ASR with Byte-Level Subwords. ICASSP 2022: 6417-6421 - 2020
- [c30]Andrew Titus, Jan Silovský, Nanxin Chen, Roger Hsiao, Mary Young, Arnab Ghoshal:
Improving Language Identification for Multilingual Speakers. ICASSP 2020: 8284-8288 - 2017
- [c29]Xiaodan Zhuang, Arnab Ghoshal, Antti-Veikko Rosti, Matthias Paulik, Daben Liu:
Improving DNN Bluetooth Narrowband Acoustic Models by Cross-Bandwidth and Cross-Lingual Initialization. INTERSPEECH 2017: 2148-2152 - 2014
- [c28]Matthew P. Aylett, Rasmus Dall, Arnab Ghoshal, Gustav Eje Henter, Thomas Merritt:
A flexible front-end for HTS. INTERSPEECH 2014: 1283-1287 - 2013
- [c27]Pawel Swietojanski, Arnab Ghoshal, Steve Renals:
Hybrid acoustic models for distant and multichannel large vocabulary speech recognition. ASRU 2013: 285-290 - [c26]Liang Lu, Arnab Ghoshal, Steve Renals:
Acoustic data-driven pronunciation lexicon for large vocabulary speech recognition. ASRU 2013: 374-379 - [c25]Pawel Swietojanski, Arnab Ghoshal, Steve Renals:
Revisiting hybrid and GMM-HMM system combination techniques. ICASSP 2013: 6744-6748 - [c24]Arnab Ghoshal, Pawel Swietojanski, Steve Renals:
Multilingual training of deep neural networks. ICASSP 2013: 7319-7323 - [c23]Karel Veselý, Arnab Ghoshal, Lukás Burget, Daniel Povey:
Sequence-discriminative training of deep neural networks. INTERSPEECH 2013: 2345-2349 - [c22]Liang Lu, Arnab Ghoshal, Steve Renals:
Noise adaptive training for subspace Gaussian mixture models. INTERSPEECH 2013: 3492-3496 - 2012
- [c21]Daniel Povey, Mirko Hannemann, Gilles Boulianne, Lukás Burget, Arnab Ghoshal, Milos Janda, Martin Karafiát, Stefan Kombrink, Petr Motlícek, Yanmin Qian, Korbinian Riedhammer, Karel Veselý, Ngoc Thang Vu:
Generating exact lattices in the WFST framework. ICASSP 2012: 4213-4216 - [c20]Korbinian Riedhammer, Tobias Bocklet, Arnab Ghoshal, Daniel Povey:
Revisiting semi-continuous hidden Markov models. ICASSP 2012: 4721-4724 - [c19]Liang Lu, Arnab Ghoshal, Steve Renals:
Maximum a posteriori adaptation of subspace Gaussian mixture models for cross-lingual speech recognition. ICASSP 2012: 4877-4880 - [c18]Liang Lu, Arnab Ghoshal, Steve Renals:
Joint uncertainty decoding with unscented transform for noise robust subspace Gaussian mixture models. SAPA@INTERSPEECH 2012: 40-45 - [c17]Liang Lu, K. K. Chin, Arnab Ghoshal, Steve Renals:
Noise Compensation for Subspace Gaussian Mixture Models. INTERSPEECH 2012: 306-309 - [c16]Eva Hasler, Peter Bell, Arnab Ghoshal, Barry Haddow, Philipp Koehn, Fergus McInnes, Steve Renals, Pawel Swietojanski:
The UEDIN systems for the IWSLT 2012 evaluation. IWSLT 2012: 46-53 - [c15]Pawel Swietojanski, Arnab Ghoshal, Steve Renals:
Unsupervised cross-lingual knowledge transfer in DNN-based LVCSR. SLT 2012: 246-251 - 2011
- [c14]Liang Lu, Arnab Ghoshal, Steve Renals:
Regularized subspace Gaussian mixture models for cross-lingual speech recognition. ASRU 2011: 365-370 - [c13]Daniel Povey, Martin Karafiát, Arnab Ghoshal, Petr Schwarz:
A symmetrization of the Subspace Gaussian Mixture Model. ICASSP 2011: 4504-4507 - 2010
- [c12]Ken'ichi Kumatani, Liang Lu, John W. McDonough, Arnab Ghoshal, Dietrich Klakow:
Maximum negentropy beamforming with superdirectivity. EUSIPCO 2010: 2067-2071 - [c11]Arnab Ghoshal, Daniel Povey, Mohit Agarwal, Pinar Akyazi, Lukás Burget, Kai Feng, Ondrej Glembek, Nagendra Goel, Martin Karafiát, Ariya Rastrow, Richard C. Rose, Petr Schwarz, Samuel Thomas:
A novel estimation of feature-space MLLR for full-covariance models. ICASSP 2010: 4310-4313 - [c10]Daniel Povey, Lukás Burget, Mohit Agarwal, Pinar Akyazi, Kai Feng, Arnab Ghoshal, Ondrej Glembek, Nagendra K. Goel, Martin Karafiát, Ariya Rastrow, Richard C. Rose, Petr Schwarz, Samuel Thomas:
Subspace Gaussian Mixture Models for speech recognition. ICASSP 2010: 4330-4333 - [c9]Lukás Burget, Petr Schwarz, Mohit Agarwal, Pinar Akyazi, Kai Feng, Arnab Ghoshal, Ondrej Glembek, Nagendra K. Goel, Martin Karafiát, Daniel Povey, Ariya Rastrow, Richard C. Rose, Samuel Thomas:
Multilingual acoustic modeling for speech recognition based on subspace Gaussian Mixture Models. ICASSP 2010: 4334-4337 - [c8]Nagendra Goel, Samuel Thomas, Mohit Agarwal, Pinar Akyazi, Lukás Burget, Kai Feng, Arnab Ghoshal, Ondrej Glembek, Martin Karafiát, Daniel Povey, Ariya Rastrow, Richard C. Rose, Petr Schwarz:
Approaches to automatic lexicon learning with limited training examples. ICASSP 2010: 5094-5097 - 2009
- [c7]Arnab Ghoshal, Sanjeev Khudanpur, Dietrich Klakow:
Impact of novel sources on content-based image and video retrieval. ICASSP 2009: 1937-1940 - [c6]Arnab Ghoshal, Martin Jansche, Sanjeev Khudanpur, Michael Riley, Morgan Ulinski:
WEB-derived pronunciations. ICASSP 2009: 4289-4292 - [c5]Dogan Can, Erica Cooper, Arnab Ghoshal, Martin Jansche, Sanjeev Khudanpur, Bhuvana Ramabhadran, Michael Riley, Murat Saraclar, Abhinav Sethy, Morgan Ulinski, Christopher M. White:
Web derived pronunciations for spoken term detection. SIGIR 2009: 83-90 - 2006
- [c4]Arnab Ghoshal, Sanjeev Khudanpur:
Source Adaptation for Improved Content-Based Video Retrieval. ICASSP (2) 2006: 133-136 - [c3]Arnab Ghoshal, Sanjeev Khudanpur, João Magalhães, Simon E. Overell, Stefan M. Rüger, Alexei Yavlinsky:
Imperial College and Johns Hopkins University at TRECVID. TRECVID 2006 - 2005
- [c2]Arnab Ghoshal, Pavel Ircing, Sanjeev Khudanpur:
Hidden Markov models for automatic annotation and content-based retrieval of images and video. SIGIR 2005: 544-551 - [c1]Brock Pytlik, Arnab Ghoshal, Damianos G. Karakos, Sanjeev Khudanpur:
TRECVID 2005 Experiment at Johns Hopkins University: Using Hidden Markov Models for Video Retrieval. TRECVID 2005
Informal and Other Publications
- 2022
- [i5]Liuhui Deng, Roger Hsiao, Arnab Ghoshal:
Bilingual End-to-End ASR with Byte-Level Subwords. CoRR abs/2205.00485 (2022) - [i4]Thien Nguyen, Nathalie Tran, Liuhui Deng, Thiago Fraga da Silva, Matthew Radzihovsky, Roger Hsiao, Henry Mason, Stefan Braun, Erik McDermott, Dogan Can, Pawel Swietojanski, Lyan Verwimp, Sibel Oyman, Tresi Arvizo, Honza Silovsky, Arnab Ghoshal, Mathieu Martel, Bharat Ram Ambati, Mohamed Ali:
Optimizing Bilingual Neural Transducer with Synthetic Code-switching Text Generation. CoRR abs/2210.12214 (2022) - [i3]Pawel Swietojanski, Stefan Braun, Dogan Can, Thiago Fraga da Silva, Arnab Ghoshal, Takaaki Hori, Roger Hsiao, Henry Mason, Erik McDermott, Honza Silovsky, Ruchir Travadi, Xiaodan Zhuang:
Variable Attention Masking for Configurable Transformer Transducer Speech Recognition. CoRR abs/2211.01438 (2022) - 2020
- [i2]Andrew Titus, Jan Silovský, Nanxin Chen, Roger Hsiao, Mary Young, Arnab Ghoshal:
Improving Language Identification for Multilingual Speakers. CoRR abs/2001.11019 (2020) - [i1]Roger Hsiao, Dogan Can, Tim Ng, Ruchir Travadi, Arnab Ghoshal:
Online Automatic Speech Recognition with Listen, Attend and Spell Model. CoRR abs/2008.05514 (2020)
Coauthor Index
aka: Stephen Renals
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 21:25 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint