default search action
Vishwa Gupta
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2023
- [c56]Vishwa Gupta, Gilles Boulianne:
Improvements in Language Modeling, Voice Activity Detection, and Lexicon in OpenASR21 Low Resource Languages. SPECOM (2) 2023: 73-86 - 2022
- [c55]Vishwa Gupta, Gilles Boulianne:
Progress in Multilingual Speech Recognition for Low Resource Languages Kurmanji Kurdish, Cree and Inuktut. LREC 2022: 6420-6428 - [c54]Vishwa Gupta, Gilles Boulianne:
CRIM's Speech Recognition System for OpenASR21 Evaluation with Conformer and Voice Activity Detector Embeddings. SPECOM 2022: 238-251 - 2020
- [c53]Roland Kuhn, Fineen Davis, Alain Désilets, Eric Joanis, Anna Kazantseva, Rebecca Knowles, Patrick Littell, Delaney Lothian, Aidan Pine, Caroline Running Wolf, Eddie Antonio Santos, Darlene A. Stewart, Gilles Boulianne, Vishwa Gupta, Brian Maracle Owennatékha, Akwiratékha' Martin, Christopher Cox, Marie-Odile Junker, Olivia Sammons, Delasie Torkornoo, Nathan Thanyehténhas Brinklow, Sara Child, Benoit Farley, David Huggins-Daines, Daisy Rosenblum, Heather Souter:
The Indigenous Languages Technology project at NRC Canada: An empowerment-oriented approach to developing language software. COLING 2020: 5866-5878 - [c52]Vishwa Gupta, Gilles Boulianne:
Automatic Transcription Challenges for Inuktitut, a Low-Resource Polysynthetic Language. LREC 2020: 2521-2527 - [c51]Vishwa Gupta, Gilles Boulianne:
Speech Transcription Challenges for Resource Constrained Indigenous Language Cree. SLTU-CCURL@LREC 2020: 362-367
2010 – 2019
- 2019
- [c50]Vishwa Gupta, Lise Rebout, Gilles Boulianne, Pierre André Ménard, Jahangir Alam:
CRIM's Speech Transcription and Call Sign Detection System for the ATC Airbus Challenge Task. INTERSPEECH 2019: 3018-3022 - 2018
- [c49]Vishwa Gupta, Gilles Boulianne:
CRIM's System for the MGB-3 English Multi-Genre Broadcast Media Transcription. INTERSPEECH 2018: 2653-2657 - [c48]Gautam Bhattacharya, Jahangir Alam, Vishwa Gupta, Patrick Kenny:
Deeply Fused Speaker Embeddings for Text-Independent Speaker Verification. INTERSPEECH 2018: 3588-3592 - 2017
- [c47]Chahid Ouali, Pierre Dumouchel, Vishwa Gupta:
Robust video fingerprints using positions of salient regions. ICASSP 2017: 3041-3045 - 2016
- [j9]Chahid Ouali, Pierre Dumouchel, Vishwa Gupta:
A spectrogram-based audio fingerprinting system for content-based copy detection. Multim. Tools Appl. 75(15): 9145-9165 (2016) - [j8]Chahid Ouali, Pierre Dumouchel, Vishwa Gupta:
Fast Audio Fingerprinting System Using GPU and a Clustering-Based Technique. IEEE ACM Trans. Audio Speech Lang. Process. 24(6): 1106-1118 (2016) - [c46]Md. Jahangir Alam, Patrick Kenny, Vishwa Gupta:
Tandem Features for Text-Dependent Speaker Verification on the RedDots Corpus. INTERSPEECH 2016: 420-424 - [c45]Patrick Kenny, Themos Stafylakis, Jahangir Alam, Vishwa Gupta, Marcel Kockmann:
Uncertainty Modeling Without Subspace Methods For Text-Dependent Speaker Recognition. Odyssey 2016: 16-23 - [c44]Md. Jahangir Alam, Patrick Kenny, Vishwa Gupta, Themos Stafylakis:
Spoofing Detection on the ASVspoof2015 Challenge Corpus Employing Deep Neural Networks. Odyssey 2016: 270-276 - [c43]Themos Stafylakis, Patrick Kenny, Vishwa Gupta, Jahangir Alam, Marcel Kockmann:
Compensation for phonetic nuisance variability in speaker recognition using DNNs. Odyssey 2016: 340-345 - [c42]Gautam Bhattacharya, Jahangir Alam, Patrick Kenny, Vishwa Gupta:
Modelling speaker and channel variability using deep neural networks for robust speaker verification. SLT 2016: 192-198 - 2015
- [j7]Md. Jahangir Alam, Vishwa Gupta, Patrick Kenny, Pierre Dumouchel:
Speech recognition in reverberant and noisy environments employing multiple feature extractors and i-vector speaker adaptation. EURASIP J. Adv. Signal Process. 2015: 50 (2015) - [c41]Vishwa Gupta, Paul Deléglise, Gilles Boulianne, Yannick Estève, Sylvain Meignier, Anthony Rousseau:
CRIM and LIUM approaches for multi-genre broadcast media transcription. ASRU 2015: 681-686 - [c40]Chahid Ouali, Pierre Dumouchel, Vishwa Gupta:
GPU implementation of an audio fingerprints similarity search algorithm. CBMI 2015: 1-6 - [c39]Chahid Ouali, Pierre Dumouchel, Vishwa Gupta:
Efficient spectrogram-based binary image feature for audio copy detection. ICASSP 2015: 1792-1796 - [c38]Vishwa Gupta:
Speaker change point detection using deep neural nets. ICASSP 2015: 4420-4424 - [c37]Chahid Ouali, Pierre Dumouchel, Vishwa Gupta:
Content-Based Multimedia Copy Detection. ISM 2015: 597-600 - 2014
- [c36]Chahid Ouali, Pierre Dumouchel, Vishwa Gupta:
A robust audio fingerprinting method for content-based copy detection. CBMI 2014: 1-6 - [c35]Vishwa Gupta, Patrick Kenny, Pierre Ouellet, Themos Stafylakis:
I-vector-based speaker adaptation of deep neural networks for French broadcast audio transcription. ICASSP 2014: 6334-6338 - [c34]Chahid Ouali, Pierre Dumouchel, Vishwa Gupta:
Robust features for content-based audio copy detection. INTERSPEECH 2014: 2395-2399 - [c33]Patrick Kenny, Themos Stafylakis, Pierre Ouellet, Vishwa Gupta, Md. Jahangir Alam:
Deep Neural Networks for extracting Baum-Welch statistics for Speaker Recognition. Odyssey 2014: 293-298 - [c32]Anthony Rousseau, Gilles Boulianne, Paul Deléglise, Yannick Estève, Vishwa Gupta, Sylvain Meignier:
LIUM and CRIM ASR System Combination for the REPERE Evaluation Campaign. TSD 2014: 441-448 - 2013
- [c31]Themos Stafylakis, Patrick Kenny, Vishwa Gupta, Pierre Dumouchel:
Compensation for inter-frame correlations in speaker diarization and recognition. ICASSP 2013: 7731-7735 - [c30]Vishwa Gupta, Gilles Boulianne:
Comparing computation in Gaussian mixture and neural network based large-vocabulary speech recognition. INTERSPEECH 2013: 617-621 - 2012
- [j6]Vishwa Gupta, Gilles Boulianne, Patrick Cardinal:
CRIM's content-based audio copy detection system for TRECVID 2009. Multim. Tools Appl. 60(2): 371-387 (2012) - [c29]Vishwa Gupta, Parisa Darvish Zadeh Varcheie, Langis Gagnon, Gilles Boulianne:
Content-based video copy detection using nearest-neighbor mapping. ISSPA 2012: 918-923 - 2011
- [c28]Vishwa Gupta, Parisa Darvish Zadeh Varcheie, Langis Gagnon, Gilles Boulianne:
CRIM AT TRECVID-2011: Content-Based Copy Detection using Nearest-Neighbor Mapping. TRECVID 2011 - 2010
- [c27]Vishwa Gupta, Gilles Boulianne, Patrick Cardinal:
Crim's content-based audio copy detection system for TRECVID 2009. CBMI 2010: 1-6 - [c26]Langis Gagnon, Claude Chapdelaine, David Byrns, Samuel Foucher, Maguelonne Héritier, Vishwa Gupta:
A computer-vision-assisted system for Videodescription scripting. CVPR Workshops 2010: 41-48 - [c25]Vishwa Gupta, Gilles Boulianne, Patrick Cardinal:
Content-based audio copy detection using nearest-neighbor mapping. ICASSP 2010: 261-264 - [c24]Richard C. Rose, Atta Norouzian, Aarthi M. Reddy, André Coy, Vishwa Gupta, Martin Karafiát:
Subword-based spoken term detection in audio course lectures. ICASSP 2010: 5282-5285 - [c23]Patrick Cardinal, Vishwa Gupta, Gilles Boulianne:
Content-based advertisement detection. INTERSPEECH 2010: 2214-2217
2000 – 2009
- 2009
- [c22]Maguelonne Héritier, Vishwa Gupta, Langis Gagnon, Gilles Boulianne, Samuel Foucher, Patrick Cardinal:
CRIM´s Content-Based Copy Detection System for TRECVID. TRECVID 2009 - 2008
- [j5]Patrick Kenny, Pierre Ouellet, Najim Dehak, Vishwa Gupta, Pierre Dumouchel:
A Study of Interspeaker Variability in Speaker Verification. IEEE Trans. Speech Audio Process. 16(5): 980-988 (2008) - [c21]Vishwa Gupta, Gilles Boulianne, Patrick Kenny, Pierre Ouellet, Pierre Dumouchel:
Speaker diarization of French broadcast news. ICASSP 2008: 4365-4368 - [c20]Patrick Kenny, Najim Dehak, Pierre Ouellet, Vishwa Gupta, Pierre Dumouchel:
Development of the primary CRIM system for the NIST 2008 speaker recognition evaluation. INTERSPEECH 2008: 1401-1404 - [c19]Vishwa Gupta, Gilles Boulianne, Patrick Kenny, Pierre Dumouchel:
Advertisement detection in French broadcast news using acoustic repetition and Gaussian mixture models. INTERSPEECH 2008: 2538-2541 - [c18]Patrick Kenny, Najim Dehak, Réda Dehak, Vishwa Gupta, Pierre Dumouchel:
The role of speaker factors in the NIST extended data task. Odyssey 2008: 11 - 2007
- [j4]Vishwa Gupta, Patrick Kenny, Pierre Ouellet, Gilles Boulianne, Pierre Dumouchel:
Combining Gaussianized/Non-Gaussianized Features to Improve Speaker Diarization of Telephone Conversations. IEEE Signal Process. Lett. 14(12): 1040-1043 (2007) - [c17]Vishwa Gupta, Patrick Kenny, Pierre Ouellet, Gilles Boulianne, Pierre Dumouchel:
Multiple feature combination to improve speaker diarization of telephone conversations. ASRU 2007: 705-710 - 2006
- [c16]Patrick Kenny, Vishwa Gupta, Gilles Boulianne, Pierre Ouellet, Pierre Dumouchel:
Feature normalization using smoothed mixture transformations. INTERSPEECH 2006 - 2000
- [j3]Vishwa Gupta, Serge Robillard, Claude Pelletier:
Automation of locality recognition in ADAS plus. Speech Commun. 31(4): 321-328 (2000)
1990 – 1999
- 1999
- [c15]Jian-Xiong Wu, Vishwa Gupta:
Application of simultaneous decoding algorithms to automatic transcription of known and unknown words. ICASSP 1999: 589-592 - 1996
- [c14]Rivarol Vergin, Douglas D. O'Shaughnessy, Vishwa Gupta:
Compensated mel frequency cepstrum coefficients. ICASSP 1996: 323-326 - 1993
- [j2]Patrick Kenny, Rene Hollan, Vishwa Gupta, Matthew Lennig, Paul Mermelstein, Douglas D. O'Shaughnessy:
A*-admissible heuristics for rapid lexical access. IEEE Trans. Speech Audio Process. 1(1): 49-58 (1993) - 1992
- [c13]Yan Ming Cheng, Douglas D. O'Shaughnessy, Vishwa Gupta, Patrick Kenny, Matthew Lennig, Paul Mermelstein, Sarangarajan Parthasarathy:
Hybrid segmental-LVQ/HMM for large vocabulary speech recognition. ICASSP 1992: 593-596 - [c12]Matthew Lennig, Douglas Sharp, Patrick Kenny, Vishwa Gupta, Kristin Precoda:
Flexible vocabulary recognition of speech. ICSLP 1992: 93-96 - 1991
- [j1]Li Deng, Patrick Kenny, Matthew Lennig, Vishwa Gupta, Franz Seitz, Paul Mermelstein:
Phonemic hidden Markov models with continuous mixture output densities for large vocabulary word recognition. IEEE Trans. Signal Process. 39(7): 1677-1681 (1991) - [c11]Vishwa Gupta, Matthew Lennig, Paul Mermelstein, Patrick Kenny, Franz Seitz, Douglas D. O'Shaughnessy:
Using phoneme duration and energy contour information to improve large vocabulary isolated-word recognition. ICASSP 1991: 341-344 - [c10]Patrick Kenny, Rene Hollan, Vishwa Gupta, Matthew Lennig, Paul Mermelstein, Douglas D. O'Shaughnessy:
A*-admissible heuristics for rapid lexical access. ICASSP 1991: 689-692 - [c9]Patrick Kenny, Sarangarajan Parthasarathy, Vishwa Gupta, Matthew Lennig, Paul Mermelstein, Douglas D. O'Shaughnessy:
Energy, duration and Markov models. EUROSPEECH 1991: 655-658 - 1990
- [c8]Li Deng, Vishwa Gupta, Matthew Lennig, Patrick Kenny, Paul Mermelstein:
Acoustic recognition component of an 86000-word speech recognizer. ICASSP 1990: 741-744 - [c7]Matthew Lennig, Vishwa Gupta, Patrick Kenny, Paul Mermelstein, Douglas D. O'Shaughnessy:
An 86, 000-Word Recognizer Based on Phonemic Models. HLT 1990
1980 – 1989
- 1989
- [c6]Li Deng, Patrick Kenny, Matthew Lennig, Vishwa Gupta, Paul Mermelstein:
A locus model of coarticulation in an HMM speech recognizer. ICASSP 1989: 97-100 - 1988
- [c5]Li Deng, Matthew Lennig, Vishwa Gupta, Paul Mermelstein:
Modeling acoustic-phonetic detail in an HMM-based large vocabulary speech recognizer. ICASSP 1988: 509-512 - [c4]Pierre Dumouchel, Vishwa Gupta, Matthew Lennig, Paul Mermelstein:
Three probabilistic language models for a large-vocabulary speech recognizer. ICASSP 1988: 513-516 - 1987
- [c3]Vishwa Gupta, Matthew Lennig, Paul Mermelstein:
Integration of acoustic information in a large vocabulary word recognizer. ICASSP 1987: 697-700 - 1984
- [c2]Vishwa Gupta, Matthew Lennig, Paul Mermelstein:
Decision rules for speaker-independent isolated word recognition. ICASSP 1984: 336-339
1970 – 1979
- 1978
- [c1]Vishwa Gupta, J. Kent Bryan, John N. Gowdy:
Speaker-independent vowel indetification in continuous speech. ICASSP 1978: 546-548
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-07-31 21:42 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint