default search action
Jahangir Alam 0001
Person information
- affiliation: Computer Research Institute of Montreal, CRIM, Quebec, Canada
- affiliation (PhD 2014): University of Quebec, Institut National de la Recherche Scientifique, QC, Canada
Other persons with the same name
- Jahangir Alam 0002 — Aligarh Muslim University, Faculty of Engineering and Technology, India
- Jahangir Alam 0003 — North South University, Department of Electrical and Computer Engineering, Dhaka, Bangladesh
- Jahangir Alam 0004 — University of Dhaka, Department of Applied Mathematics, Bangladesh
- Jahangir Alam 0006 — University of Duisburg-Essen, Institute of Digital Signal Processing, Faculty of Engineering, Germany
- Md. Jahangir Alam (aka: Md Jahangir Alam) — disambiguation page
- Md. Jahangir Alam 0004 — Multimedia University, Faculty of Information Technology, Selangor, Malaysia
- Md. Jahangir Alam 0005 — Powerfront Pty. Ltd., Richmond, VIC, Australia (and 1 more)
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j17]R. Gnana Praveen, Jahangir Alam:
Incongruity-Aware Cross-Modal Attention for Audio-Visual Fusion in Dimensional Emotion Recognition. IEEE J. Sel. Top. Signal Process. 18(3): 444-458 (2024) - [j16]Abderrahim Fathan, Jahangir Alam:
An analytic study on clustering driven self-supervised speaker verification. Pattern Recognit. Lett. 179: 80-86 (2024) - [c101]R. Gnana Praveen, Jahangir Alam:
Recursive Joint Cross-Modal Attention for Multimodal Fusion in Dimensional Emotion Recognition. CVPR Workshops 2024: 4803-4813 - [c100]R. Gnana Praveen, Jahangir Alam:
Dynamic Cross Attention for Audio-Visual Person Verification. FG 2024: 1-5 - [c99]R. Gnana Praveen, Jahangir Alam:
Audio-Visual Person Verification Based on Recursive Fusion of Joint Cross-Attention. FG 2024: 1-5 - [c98]Abderrahim Fathan, Jahangir Alam:
Self-Supervised Speaker Verification Employing A Novel Clustering Algorithm. ICASSP 2024: 12597-12601 - [c97]R. Gnana Praveen, Jahangir Alam:
Cross-Attention is not always needed: Dynamic Cross-Attention for Audio-Visual Dimensional Emotion Recognition. ICME 2024: 1-6 - [c96]Abderrahim Fathan, Xiaolin Zhu, Jahangir Alam:
An investigative study of the effect of several regularization techniques on label noise robustness of self-supervised speaker verification systems. Odyssey 2024: 43-50 - [c95]Gnana Praveen Rajasekhar, Jahangir Alam:
Cross-Modal Transformers for Audio-Visual Person Verification. Odyssey 2024: 240-246 - [i13]R. Gnana Praveen, Jahangir Alam:
Audio-Visual Person Verification based on Recursive Fusion of Joint Cross-Attention. CoRR abs/2403.04654 (2024) - [i12]R. Gnana Praveen, Jahangir Alam:
Dynamic Cross Attention for Audio-Visual Person Verification. CoRR abs/2403.04661 (2024) - [i11]R. Gnana Praveen, Jahangir Alam:
Recursive Joint Cross-Modal Attention for Multimodal Fusion in Dimensional Emotion Recognition. CoRR abs/2403.13659 (2024) - [i10]R. Gnana Praveen, Jahangir Alam:
Cross-Attention is Not Always Needed: Dynamic Cross-Attention for Audio-Visual Dimensional Emotion Recognition. CoRR abs/2403.19554 (2024) - [i9]Hossein Zeinali, Kong Aik Lee, Jahangir Alam, Lukás Burget:
Text-dependent Speaker Verification (TdSV) Challenge 2024: Challenge Evaluation Plan. CoRR abs/2404.13428 (2024) - [i8]R. Gnana Praveen, Jahangir Alam:
Inconsistency-Aware Cross-Attention for Audio-Visual Fusion in Dimensional Emotion Recognition. CoRR abs/2405.12853 (2024) - 2023
- [c94]Abderrahim Fathan, Jahangir Alam:
CAMSAT: Augmentation Mix and Self-Augmented Training Clustering for Self-Supervised Speaker Recognition. ASRU 2023: 1-8 - [c93]Jahangir Alam, Woo Hyun Kang, Abderrahim Fathan:
Hybrid Neural Network with Cross- and Self-Module Attention Pooling for Text-Independent Speaker Verification. ICASSP 2023: 1-5 - [c92]Abderrahim Fathan, Jahangir Alam, Woo Hyun Kang:
Investigation Of The Quality Of Pseudo-Labels For The Self-Supervised Speaker Verification Task. ICASSP Workshops 2023: 1-5 - [c91]Jahangir Alam:
On the Use of Cross- and Self-Module Attentive Statistics Pooling Techniques for Text-Independent Speaker Verification. IJCB 2023: 1-9 - [c90]Jahangir Alam, Abderrahim Fathan:
On the Use of Cross-module Attention Statistics Pooling for Speaker Verification. IWBF 2023: 1-6 - [c89]Abderrahim Fathan, Jahangir Alam:
On the influence of the quality of pseudo-labels on the self-supervised speaker verification task: a thorough analysis. IWBF 2023: 1-6 - [c88]Gnana Praveen Rajasekhar, Jahangir Alam:
Audio-Visual Speaker Verification via Joint Cross-Attention. SPECOM (2) 2023: 18-31 - [c87]Md Shahidul Alam, Abderrahim Fathan, Jahangir Alam:
Audio DeepFake Detection Employing Multiple Parametric Exponential Linear Units. SPECOM (2) 2023: 307-321 - [c86]Abderrahim Fathan, Jahangir Alam, Xiaolin Zhu:
Multi-task Learning over Mixup Variants for the Speaker Verification Task. SPECOM (2) 2023: 446-460 - [c85]Abderrahim Fathan, Jahangir Alam:
Self-supervised Speaker Verification Employing Augmentation Mix and Self-augmented Training-Based Clustering. SPECOM (2) 2023: 550-563 - [i7]R. Gnana Praveen, Jahangir Alam:
Audio-Visual Speaker Verification via Joint Cross-Attention. CoRR abs/2309.16569 (2023) - 2022
- [j15]Woo Hyun Kang, Jahangir Alam, Abderrahim Fathan:
L-Mix: A Latent-Level Instance Mixup Regularization for Robust Self-Supervised Speaker Representation Learning. IEEE J. Sel. Top. Signal Process. 16(6): 1263-1272 (2022) - [j14]João Monteiro, Jahangir Alam, Tiago H. Falk:
Multi-level self-attentive TDNN: A general and efficient approach to summarize speech into discriminative utterance-level representations. Speech Commun. 140: 42-49 (2022) - [j13]Mohamed Dahmane, Jahangir Alam, Pierre-Luc St-Charles, Marc Lalonde, Kevin Heffner, Samuel Foucher:
A Multimodal Non-Intrusive Stress Monitoring From the Pleasure-Arousal Emotional Dimensions. IEEE Trans. Affect. Comput. 13(2): 1044-1056 (2022) - [c84]Woo Hyun Kang, Jahangir Alam, Abderrahim Fathan:
Robust Self-Supervised Speaker Representation Learning Via Instance Mix Regularization. ICASSP 2022: 6617-6621 - [c83]Abderrahim Fathan, Jahangir Alam, Woo Hyun Kang:
Mel-Spectrogram Image-Based End-to-End Audio Deepfake Detection Under Channel-Mismatched Conditions. ICME 2022: 1-6 - [c82]Woo Hyun Kang, Jahangir Alam, Abderrahim Fathan:
MIM-DG: Mutual information minimization-based domain generalization for speaker verification. INTERSPEECH 2022: 3674-3678 - [c81]Woo Hyun Kang, Jahangir Alam, Abderrahim Fathan:
Mixup regularization strategies for spoofing countermeasure system. INTERSPEECH 2022: 3734-3738 - [c80]Woo Hyun Kang, Jahangir Alam, Abderrahim Fathan:
End-to-end framework for spoof-aware speaker verification. INTERSPEECH 2022: 4362-4366 - [c79]Woo Hyun Kang, Jahangir Alam, Abderrahim Fathan:
Deep learning-based end-to-end spoken language identification system for domain-mismatched scenario. LREC 2022: 7339-7343 - [c78]Jahangir Alam, Woo Hyun Kang, Abderrahim Fathan:
Hybrid Neural Network-Based Deep Embedding Extractors for Text-Independent Speaker Verification. Odyssey 2022: 33-40 - [c77]Woo Hyun Kang, Jahangir Alam, Abderrahim Fathan:
Investigation on Mixup Strategies for End-to-End Voice Spoof Detection System. Odyssey 2022: 55-61 - [c76]Woo Hyun Kang, Jahangir Alam, Abderrahim Fathan:
Domain Generalized Speaker Embedding Learning via Mutual Information Minimization. Odyssey 2022: 178-184 - [c75]Jahangir Alam, Radek Benes, Marian Beszédes, Lukás Burget, Mohamed Dahmane, Abderrahim Fathan, Hamed Ghodrati, Ondrej Glembek, Woo Hyun Kang, Pavel Matejka, Ladislav Mosner, Oldrich Plchot, Johan Rohdin, Anna Silnova, Themos Stafylakis:
Development of ABC Systems for the 2021 Edition of NIST Speaker Recognition Evaluation. Odyssey 2022: 346-353 - [c74]Woo Hyun Kang, Jahangir Alam:
Investigation on Deep Speaker Embedding Extraction Methods for Multi-Genre Speaker Verification. Odyssey 2022: 376-383 - [c73]Woo Hyun Kang, Jahangir Alam, Abderrahim Fathan:
Flow-ER: A Flow-Based Embedding Regularization Strategy for Robust Speech Representation Learning. SLT 2022: 563-570 - [c72]Jahangir Alam, Woo Hyun Kang, Abderrahim Fathan:
Neural Embedding Extractors for Text-Independent Speaker Verification. SPECOM 2022: 10-23 - [c71]Abderrahim Fathan, Jahangir Alam, Woo Hyun Kang:
Multiresolution Decomposition Analysis via Wavelet Transforms for Audio Deepfake Detection. SPECOM 2022: 188-200 - [c70]Woo Hyun Kang, Jahangir Alam, Abderrahim Fathan:
An Analytic Study on Clustering-Based Pseudo-labels for Self-supervised Deep Speaker Verification. SPECOM 2022: 338-348 - [i6]Woo Hyun Kang, Jahangir Alam, Abderrahim Fathan:
Attentive activation function for improving end-to-end spoofing countermeasure systems. CoRR abs/2205.01528 (2022) - 2021
- [j12]Anderson R. Avila, Jahangir Alam, Fabiano O. Costa Prado, Douglas D. O'Shaughnessy, Tiago H. Falk:
On the use of blind channel response estimation and a residual neural network to detect physical access attacks to speaker verification systems. Comput. Speech Lang. 66: 101163 (2021) - [c69]Woo Hyun Kang, Jahangir Alam, Abderrahim Fathan:
Hybrid Network with Multi-Level Global-Local Statistics Pooling for Robust Text-Independent Speaker Recognition. ASRU 2021: 1116-1123 - [c68]Jahangir Alam, Abderrahim Fathan, Woo Hyun Kang:
Text-Independent Speaker Verification Employing CNN-LSTM-TDNN Hybrid Networks. SPECOM 2021: 1-13 - [c67]Jahangir Alam, Abderrahim Fathan, Woo Hyun Kang:
End-to-End Voice Spoofing Detection Employing Time Delay Neural Networks and Higher Order Statistics. SPECOM 2021: 14-25 - [c66]Abderrahim Fathan, Jahangir Alam, Woo Hyun Kang:
An Ensemble Approach for the Diagnosis of COVID-19 from Speech and Cough Sounds. SPECOM 2021: 190-201 - [i5]Woo Hyun Kang, Jahangir Alam, Abderrahim Fathan:
Robust Speech Representation Learning via Flow-based Embedding Regularization. CoRR abs/2112.03454 (2021) - 2020
- [j11]João Monteiro, Jahangir Alam, Tiago H. Falk:
Generalized end-to-end detection of spoofing attacks to automatic speaker recognizers. Comput. Speech Lang. 63: 101096 (2020) - [j10]Anderson R. Avila, Jahangir Alam, Douglas D. O'Shaughnessy, Tiago H. Falk:
On the use of the i-vector speech representation for instrumental quality measurement. Qual. User Exp. 5(1) (2020) - [c65]João Monteiro, Jahangir Alam, Tiago H. Falk:
An Ensemble Based Approach for Generalized Detection of Spoofing Attacks to Automatic Speaker Recognizers. ICASSP 2020: 6599-6603 - [c64]João Monteiro, Isabela Albuquerque, Jahangir Alam, R. Devon Hjelm, Tiago H. Falk:
An end-to-end approach for the verification problem: learning the right distance. ICML 2020: 7022-7033 - [c63]Hossein Zeinali, Kong Aik Lee, Jahangir Alam, Lukás Burget:
SdSV Challenge 2020: Large-Scale Evaluation of Short-Duration Speaker Verification. INTERSPEECH 2020: 731-735 - [c62]João Monteiro, Jahangir Alam, Tiago H. Falk:
On The Performance of Time-Pooling Strategies for End-to-End Spoken Language Identification. LREC 2020: 3566-3572 - [c61]Jahangir Alam, Gilles Boulianne, Lukás Burget, Mohamed Dahmane, Mireia Díez Sánchez, Alicia Lozano-Diez, Ondrej Glembek, Pierre-Luc St-Charles, Marc Lalonde, Pavel Matejka, Petr Mizera, João Monteiro, Ladislav Mosner, Cedric Noiseux, Ondrej Novotný, Oldrich Plchot, Johan Rohdin, Anna Silnova, Josef Slavícek, Themos Stafylakis, Shuai Wang, Hossein Zeinali:
Analysis of ABC Submission to NIST SRE 2019 CMN and VAST Challenge. Odyssey 2020: 289-295 - [c60]João Monteiro, Jahangir Alam, Tiago H. Falk:
A Multi-condition Training Strategy for Countermeasures Against Spoofing Attacks to Speaker Recognizers. Odyssey 2020: 296-303 - [i4]João Monteiro, Isabela Albuquerque, Jahangir Alam, R. Devon Hjelm, Tiago H. Falk:
An end-to-end approach for the verification problem: learning the right distance. CoRR abs/2002.09469 (2020)
2010 – 2019
- 2019
- [j9]João Monteiro, Jahangir Alam, Tiago H. Falk:
Residual convolutional neural network with attentive feature pooling for end-to-end language identification from short-duration speech. Comput. Speech Lang. 58: 364-376 (2019) - [c59]João Monteiro, Jahangir Alam:
Development of Voice Spoofing Detection Systems for 2019 Edition of Automatic Speaker Verification and Countermeasures Challenge. ASRU 2019: 1003-1010 - [c58]João Monteiro, Jahangir Alam, Gautam Bhattacharya, Tiago H. Falk:
End-to-End Language Identification Using a Residual Convolutional Neural Network with Attentive Temporal Pooling. EUSIPCO 2019: 1-5 - [c57]Gautam Bhattacharya, Jahangir Alam, Patrick Kenny:
Adapting End-to-end Neural Speaker Verification to New Languages and Recording Conditions with Adversarial Training. ICASSP 2019: 6041-6045 - [c56]Gautam Bhattacharya, João Monteiro, Jahangir Alam, Patrick Kenny:
Generative Adversarial Speaker Embedding Networks for Domain Robust End-to-end Speaker Verification. ICASSP 2019: 6226-6230 - [c55]Gautam Bhattacharya, Md. Jahangir Alam, Patrick Kenny:
Deep Speaker Recognition: Modular or Monolithic? INTERSPEECH 2019: 1143-1147 - [c54]Anderson R. Avila, Jahangir Alam, Douglas D. O'Shaughnessy, Tiago H. Falk:
Blind Channel Response Estimation for Replay Attack Detection. INTERSPEECH 2019: 2893-2897 - [c53]Vishwa Gupta, Lise Rebout, Gilles Boulianne, Pierre André Ménard, Jahangir Alam:
CRIM's Speech Transcription and Call Sign Detection System for the ATC Airbus Challenge Task. INTERSPEECH 2019: 3018-3022 - [c52]João Monteiro, Jahangir Alam, Tiago H. Falk:
Combining Speaker Recognition and Metric Learning for Speaker-Dependent Representation Learning. INTERSPEECH 2019: 4015-4019 - [c51]João Monteiro, Jahangir Alam, Tiago H. Falk:
End-To-End Detection Of Attacks To Automatic Speaker Recognizers With Time-Attentive Light Convolutional Neural Networks. MLSP 2019: 1-6 - [c50]Anderson R. Avila, Jahangir Alam, Douglas D. O'Shaughnessy, Tiago H. Falk:
Intrusive Quality Measurement of Noisy and Enhanced Speech based on i-Vector Similarity. QoMEX 2019: 1-5 - [c49]Jahangir Alam:
On the Use of Fisher Vector Encoding for Voice Spoofing Detection. UCAmI 2019: 37 - [i3]Hossein Zeinali, Kong Aik Lee, Jahangir Alam, Lukás Burget:
Short-duration Speaker Verification (SdSV) Challenge 2020: the Challenge Evaluation Plan. CoRR abs/1912.06311 (2019) - 2018
- [c48]Gautam Bhattacharya, Jahangir Alam, Vishwa Gupta, Patrick Kenny:
Deeply Fused Speaker Embeddings for Text-Independent Speaker Verification. INTERSPEECH 2018: 3588-3592 - [c47]Anderson R. Avila, Md. Jahangir Alam, Douglas D. O'Shaughnessy, Tiago H. Falk:
Investigating Speech Enhancement and Perceptual Quality for Speech Emotion Recognition. INTERSPEECH 2018: 3663-3667 - [c46]Md. Jahangir Alam, Gautam Bhattacharya, Patrick Kenny:
Speaker Verification in Mismatched Conditions with Frustratingly Easy Domain Adaptation. Odyssey 2018: 176-180 - [c45]Md. Jahangir Alam, Gautam Bhattacharya, Patrick Kenny:
Boosting the Performance of Spoofing Detection Systems on Replay Attacks Using q-Logarithm Domain Feature Normalization. Odyssey 2018: 393-398 - [i2]Gautam Bhattacharya, Jahangir Alam, Patrick Kenny:
Adapting End-to-End Neural Speaker Verification to New Languages and Recording Conditions with Adversarial Training. CoRR abs/1811.03055 (2018) - [i1]Gautam Bhattacharya, João Monteiro, Jahangir Alam, Patrick Kenny:
Generative Adversarial Speaker Embedding Networks for Domain Robust End-to-End Speaker Verification. CoRR abs/1811.03063 (2018) - 2017
- [c44]Md. Jahangir Alam, Patrick Kenny:
Spoofing detection employing infinite impulse response - constant Q transform-based feature representations. EUSIPCO 2017: 101-105 - [c43]Oldrich Plchot, Pavel Matejka, Anna Silnova, Ondrej Novotný, Mireia Díez Sánchez, Johan Rohdin, Ondrej Glembek, Niko Brümmer, Albert Swart, Jesús Jorrín-Prieto, Paola García, Luis Buera, Patrick Kenny, Md. Jahangir Alam, Gautam Bhattacharya:
Analysis and Description of ABC Submission to NIST SRE 2016. INTERSPEECH 2017: 1348-1352 - [c42]Gautam Bhattacharya, Jahangir Alam, Patrick Kenny:
Deep Speaker Embeddings for Short-Duration Speaker Verification. INTERSPEECH 2017: 1517-1521 - [c41]Jahangir Alam, Patrick Kenny, Gautam Bhattacharya, Marcel Kockmann:
Speaker Verification Under Adverse Conditions Using i-Vector Adaptation and Neural Networks. INTERSPEECH 2017: 3732-3736 - 2016
- [j8]Themos Stafylakis, Patrick Kenny, Md. Jahangir Alam, Marcel Kockmann:
Speaker and Channel Factors in Text-Dependent Speaker Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 24(1): 65-78 (2016) - [j7]Themos Stafylakis, Md. Jahangir Alam, Patrick Kenny:
Text-Dependent Speaker Recognition With Random Digit Strings. IEEE ACM Trans. Audio Speech Lang. Process. 24(7): 1194-1203 (2016) - [c40]Md. Jahangir Alam, Patrick Kenny, Vishwa Gupta:
Tandem Features for Text-Dependent Speaker Verification on the RedDots Corpus. INTERSPEECH 2016: 420-424 - [c39]Gautam Bhattacharya, Patrick Kenny, Jahangir Alam, Themos Stafylakis:
Deep Neural Network based Text-Dependent Speaker Verification : Preliminary Results. Odyssey 2016: 9-15 - [c38]Patrick Kenny, Themos Stafylakis, Jahangir Alam, Vishwa Gupta, Marcel Kockmann:
Uncertainty Modeling Without Subspace Methods For Text-Dependent Speaker Recognition. Odyssey 2016: 16-23 - [c37]Md. Jahangir Alam, Patrick Kenny, Vishwa Gupta, Themos Stafylakis:
Spoofing Detection on the ASVspoof2015 Challenge Corpus Employing Deep Neural Networks. Odyssey 2016: 270-276 - [c36]Themos Stafylakis, Patrick Kenny, Vishwa Gupta, Jahangir Alam, Marcel Kockmann:
Compensation for phonetic nuisance variability in speaker recognition using DNNs. Odyssey 2016: 340-345 - [c35]Gautam Bhattacharya, Jahangir Alam, Patrick Kenny, Vishwa Gupta:
Modelling speaker and channel variability using deep neural networks for robust speaker verification. SLT 2016: 192-198 - 2015
- [j6]Md. Jahangir Alam, Vishwa Gupta, Patrick Kenny, Pierre Dumouchel:
Speech recognition in reverberant and noisy environments employing multiple feature extractors and i-vector speaker adaptation. EURASIP J. Adv. Signal Process. 2015: 50 (2015) - [j5]Md. Jahangir Alam, Patrick Kenny, Douglas D. O'Shaughnessy:
Regularized minimum variance distortionless response-based cepstral features for robust continuous speech recognition. Speech Commun. 73: 28-46 (2015) - [c34]Patrick Kenny, Themos Stafylakis, Jahangir Alam, Marcel Kockmann:
JFA modeling with left-to-right structure and a new backend for text-dependent speaker recognition. ICASSP 2015: 4689-4693 - [c33]Themos Stafylakis, Patrick Kenny, Md. Jahangir Alam, Marcel Kockmann:
JFA for speaker recognition with random digit strings. INTERSPEECH 2015: 190-194 - [c32]Md. Jahangir Alam, Patrick Kenny, Themos Stafylakis:
Combining amplitude and phase-based features for speaker verification with short duration utterances. INTERSPEECH 2015: 249-253 - [c31]Md. Jahangir Alam, Patrick Kenny, Gautam Bhattacharya, Themos Stafylakis:
Development of CRIM system for the automatic speaker verification spoofing and countermeasures challenge 2015. INTERSPEECH 2015: 2072-2076 - [c30]Patrick Kenny, Themos Stafylakis, Md. Jahangir Alam, Marcel Kockmann:
An i-vector backend for speaker verification. INTERSPEECH 2015: 2307-2311 - [c29]Kong-Aik Lee, Anthony Larcher, Guangsen Wang, Patrick Kenny, Niko Brümmer, David A. van Leeuwen, Hagai Aronowitz, Marcel Kockmann, Carlos Vaquero, Bin Ma, Haizhou Li, Themos Stafylakis, Md. Jahangir Alam, Albert Swart, Javier Perez:
The reddots data collection for speaker recognition. INTERSPEECH 2015: 2996-3000 - [c28]Patrick Cardinal, Najim Dehak, Alessandro Lameiras Koerich, Jahangir Alam, Patrice Boucher:
ETS System for AV+EC 2015 Challenge. AVEC@ACM Multimedia 2015: 17-23 - 2014
- [j4]Md. Jahangir Alam, Patrick Kenny, Douglas D. O'Shaughnessy:
Robust feature extraction based on an asymmetric level-dependent auditory filterbank and a subband spectrum enhancement technique. Digit. Signal Process. 29: 147-157 (2014) - [c27]Md. Jahangir Alam, Patrick Kenny, Pierre Dumouchel, Douglas D. O'Shaughnessy:
Robust feature extractors for continuous speech recognition. EUSIPCO 2014: 944-948 - [c26]Md. Jahangir Alam, Patrick Kenny, Pierre Dumouchel, Douglas D. O'Shaughnessy:
Robust speech recognition using warped DFT-based cepstral features in clean and multistyle training. EUSIPCO 2014: 1791-1795 - [c25]Patrick Kenny, Themos Stafylakis, Pierre Ouellet, Md. Jahangir Alam:
JFA-based front ends for speaker recognition. ICASSP 2014: 1705-1709 - [c24]Patrick Kenny, Themos Stafylakis, Md. Jahangir Alam, Pierre Ouellet, Marcel Kockmann:
In-domain versus out-of-domain training for text-dependent JFA. INTERSPEECH 2014: 1332-1336 - [c23]Md. Jahangir Alam, Patrick Kenny, Pierre Dumouchel, Douglas D. O'Shaughnessy:
Noise spectrum estimation using Gaussian mixture model-based speech presence probability for robust speech recognition. INTERSPEECH 2014: 2759-2763 - [c22]Md. Jahangir Alam, Yazid Attabi, Patrick Kenny, Pierre Dumouchel, Douglas D. O'Shaughnessy:
Automatic Emotion Recognition from Cochlear Implant-Like Spectrally Reduced Speech. IWAAL 2014: 332-340 - [c21]Patrick Kenny, Themos Stafylakis, Pierre Ouellet, Md. Jahangir Alam, Pierre Dumouchel:
Supervised/Unsupervised Voice Activity Detectors for Text-dependent Speaker Recognition on the RSR2015 Corpus. Odyssey 2014: 123-130 - [c20]Patrick Kenny, Themos Stafylakis, Md. Jahangir Alam, Pierre Ouellet, Marcel Kockmann:
Joint Factor Analysis for Text-Dependent Speaker Verification. Odyssey 2014: 200-207 - [c19]Patrick Kenny, Themos Stafylakis, Pierre Ouellet, Vishwa Gupta, Md. Jahangir Alam:
Deep Neural Networks for extracting Baum-Welch statistics for Speaker Recognition. Odyssey 2014: 293-298 - 2013
- [j3]Md. Jahangir Alam, Patrick Kenny, Douglas D. O'Shaughnessy:
Low-variance Multitaper Mel-frequency Cepstral Coefficient Features for Speech and Speaker Recognition Systems. Cogn. Comput. 5(4): 533-544 (2013) - [j2]Md. Jahangir Alam, Tomi Kinnunen, Patrick Kenny, Pierre Ouellet, Douglas D. O'Shaughnessy:
Multitaper MFCC and PLP features for speaker verification using i-vectors. Speech Commun. 55(2): 237-251 (2013) - [c18]Yazid Attabi, Md. Jahangir Alam, Pierre Dumouchel, Patrick Kenny, Douglas D. O'Shaughnessy:
Multiple windowed spectral features for emotion recognition. ICASSP 2013: 7527-7531 - [c17]Patrick Kenny, Themos Stafylakis, Pierre Ouellet, Md. Jahangir Alam, Pierre Dumouchel:
PLDA for speaker verification with utterances of arbitrary duration. ICASSP 2013: 7649-7653 - [c16]Md. Jahangir Alam, Patrick Kenny, Douglas D. O'Shaughnessy:
Speech recognition using regularized minimum variance distortionless response spectrum estimation-based cepstral features. ICASSP 2013: 8071-8075 - [c15]Md. Jahangir Alam, Patrick Kenny, Douglas D. O'Shaughnessy:
Regularized MVDR spectrum estimation-based robust feature extractors for speech recognition. INTERSPEECH 2013: 891-895 - [c14]Md. Jahangir Alam, Yazid Attabi, Pierre Dumouchel, Patrick Kenny, Douglas D. O'Shaughnessy:
Amplitude modulation features for emotion recognition from speech. INTERSPEECH 2013: 2420-2424 - [c13]Tomi Kinnunen, Md. Jahangir Alam, Pavel Matejka, Patrick Kenny, Jan Cernocký, Douglas D. O'Shaughnessy:
Frequency warping and robust speaker verification: a comparison of alternative mel-scale representations. INTERSPEECH 2013: 3122-3126 - [c12]Md. Jahangir Alam, Patrick Kenny, Douglas D. O'Shaughnessy:
Smoothed Nonlinear Energy Operator-Based Amplitude Modulation Features for Robust Speech Recognition. NOLISP 2013: 168-175 - 2012
- [c11]Md. Jahangir Alam, Patrick Kenny, Douglas D. O'Shaughnessy:
Robust speech recognition under noisy environments using asymmetric tapers. EUSIPCO 2012: 1638-1642 - [c10]Md. Jahangir Alam, Patrick Kenny, Douglas D. O'Shaughnessy:
Robust Feature Extraction for Speech Recognition by Enhancing Auditory Spectrum. INTERSPEECH 2012: 1360-1363 - [c9]Md. Jahangir Alam, Patrick Kenny, Douglas D. O'Shaughnessy:
On the use of asymmetric-shaped tapers for speaker verification using i-vectors. Odyssey 2012: 256-262 - 2011
- [j1]Md. Jahangir Alam, Douglas D. O'Shaughnessy:
Perceptual improvement of Wiener filtering employing a post-filter. Digit. Signal Process. 21(1): 54-65 (2011) - [c8]Md. Jahangir Alam, Tomi Kinnunen, Patrick Kenny, Pierre Ouellet, Douglas D. O'Shaughnessy:
Multi-taper MFCC features for speaker verification using I-vectors. ASRU 2011: 547-552 - [c7]Pavel Matejka, Ondrej Glembek, Fabio Castaldo, Md. Jahangir Alam, Oldrich Plchot, Patrick Kenny, Lukás Burget, Jan Cernocký:
Full-covariance UBM and heavy-tailed PLDA in i-vector speaker verification. ICASSP 2011: 4828-4831 - [c6]Md. Jahangir Alam, Patrick Kenny, Douglas D. O'Shaughnessy:
A Study of Low-variance Multi-taper Features for Distributed Speech Recognition. NOLISP 2011: 239-245 - [c5]Md. Jahangir Alam, Pierre Ouellet, Patrick Kenny, Douglas D. O'Shaughnessy:
Comparative Evaluation of Feature Normalization Techniques for Speaker Verification. NOLISP 2011: 246-253
2000 – 2009
- 2009
- [c4]Md. Jahangir Alam, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy:
An improved perceptual speech enhancement technique employing a psychoacoustically motivated weighting factor. ASRU 2009: 266-270 - 2008
- [c3]Md. Jahangir Alam, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy:
Speech enhancement based on a hybrid a priori signal-to-noise ratio (SNR) estimator and a self-adaptive Lagrange multiplier. EUSIPCO 2008: 1-5 - [c2]Md. Jahangir Alam, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy, Sofia Ben Jebara:
Speech enhancement using a wiener denoising technique and musical noise reduction. INTERSPEECH 2008: 407-410 - [c1]Md. Jahangir Alam, Douglas D. O'Shaughnessy, Sid-Ahmed Selouani:
Speech enhancement based on novel two-step a priori SNR estimators. INTERSPEECH 2008: 565-568
Coauthor Index
aka: R. Gnana Praveen
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-11 17:30 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint