default search action
Laurent Girin
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j36]Samir Sadok, Simon Leglaive, Laurent Girin, Xavier Alameda-Pineda, Renaud Séguier:
A multimodal dynamical variational autoencoder for audiovisual speech representation learning. Neural Networks 172: 106120 (2024) - [c88]Maxime Jacquelin, Maëva Garnier, Laurent Girin, Rémy Vincent, Olivier Perrotin:
Exploring the Multidimensional Representation of Unidimensional Speech Acoustic Parameters Extracted by Deep Unsupervised Models. ICASSP Workshops 2024: 858-862 - [c87]Maxime Jacquelin, Maëva Garnier, Laurent Girin, Rémy Vincent, Olivier Perrotin:
Exploration de la représentation multidimensionnelle de paramètres acoustiques unidimensionnels de la parole extraits par des modèles profonds non supervisés. TALN (JEP) 2024: 82-91 - [i40]Ihab Asaad, Maxime Jacquelin, Olivier Perrotin, Laurent Girin, Thomas Hueber:
Fill in the Gap! Combining Self-supervised Representation Learning with Neural Audio Synthesis for Speech Inpainting. CoRR abs/2405.20101 (2024) - 2023
- [j35]Samir Sadok, Simon Leglaive, Laurent Girin, Xavier Alameda-Pineda, Renaud Séguier:
Learning and controlling the source-filter representation of speech with a variational autoencoder. Speech Commun. 148: 53-65 (2023) - [j34]Xiaoyu Lin, Laurent Girin, Xavier Alameda-Pineda:
Mixture of Dynamical Variational Autoencoders for Multi-Source Trajectory Modeling and Separation. Trans. Mach. Learn. Res. 2023 (2023) - [c86]Xiaoyu Lin, Xiaoyu Bie, Simon Leglaive, Laurent Girin, Xavier Alameda-Pineda:
Speech Modeling with a Hierarchical Transformer Dynamical VAE. ICASSP 2023: 1-5 - [c85]Xiaoyu Lin, Simon Leglaive, Laurent Girin, Xavier Alameda-Pineda:
Unsupervised speech enhancement with deep dynamical generative speech and noise models. INTERSPEECH 2023: 5102-5106 - [c84]Maxime Jacquelin, Maeva Garnier, Laurent Girin, Rémy Vincent, Olivier Perrotin:
Exploring the multidimensional representation of individual speech acoustic parameters extracted by deep unsupervised models. SSW 2023: 240-241 - [i39]Xiaoyu Lin, Xiaoyu Bie, Simon Leglaive, Laurent Girin, Xavier Alameda-Pineda:
Speech Modeling with a Hierarchical Transformer Dynamical VAE. CoRR abs/2303.09404 (2023) - [i38]Samir Sadok, Simon Leglaive, Laurent Girin, Xavier Alameda-Pineda, Renaud Séguier:
A Multimodal Dynamical Variational Autoencoder for Audiovisual Speech Representation Learning. CoRR abs/2305.03582 (2023) - [i37]Xiaoyu Lin, Simon Leglaive, Laurent Girin, Xavier Alameda-Pineda:
Unsupervised speech enhancement with deep dynamical generative speech and noise models. CoRR abs/2306.07820 (2023) - [i36]Xiaoyu Lin, Laurent Girin, Xavier Alameda-Pineda:
Mixture of Dynamical Variational Autoencoders for Multi-Source Trajectory Modeling and Separation. CoRR abs/2312.04167 (2023) - 2022
- [j33]Xiaoyu Bie, Simon Leglaive, Xavier Alameda-Pineda, Laurent Girin:
Unsupervised Speech Enhancement Using Dynamical Variational Autoencoders. IEEE ACM Trans. Audio Speech Lang. Process. 30: 2993-3007 (2022) - [c83]Marc-Antoine Georges, Julien Diard, Laurent Girin, Jean-Luc Schwartz, Thomas Hueber:
Repeat after Me: Self-Supervised Learning of Acoustic-to-Articulatory Mapping by Vocal Imitation. ICASSP 2022: 8252-8256 - [c82]Brooke Stephenson, Laurent Besacier, Laurent Girin, Thomas Hueber:
BERT, can HE predict contrastive focus? Predicting and controlling prominence in neural TTS using a language model. INTERSPEECH 2022: 3383-3387 - [i35]Xiaoyu Lin, Laurent Girin, Xavier Alameda-Pineda:
Unsupervised Multiple-Object Tracking with a Dynamical Variational Autoencoder. CoRR abs/2202.09315 (2022) - [i34]Xiaoyu Bie, Wen Guo, Simon Leglaive, Laurent Girin, Francesc Moreno-Noguer, Xavier Alameda-Pineda:
HiT-DVAE: Human Motion Generation via Hierarchical Transformer Dynamical VAE. CoRR abs/2204.01565 (2022) - [i33]Marc-Antoine Georges, Julien Diard, Laurent Girin, Jean-Luc Schwartz, Thomas Hueber:
Repeat after me: Self-supervised learning of acoustic-to-articulatory mapping by vocal imitation. CoRR abs/2204.02269 (2022) - [i32]Samir Sadok, Simon Leglaive, Laurent Girin, Xavier Alameda-Pineda, Renaud Séguier:
Learning and controlling the source-filter representation of speech with a variational autoencoder. CoRR abs/2204.07075 (2022) - [i31]Brooke Stephenson, Laurent Besacier, Laurent Girin, Thomas Hueber:
BERT, can HE predict contrastive focus? Predicting and controlling prominence in neural TTS using a language model. CoRR abs/2207.01718 (2022) - 2021
- [j32]Laurent Girin, Simon Leglaive, Xiaoyu Bie, Julien Diard, Thomas Hueber, Xavier Alameda-Pineda:
Dynamical Variational Autoencoders: A Comprehensive Review. Found. Trends Mach. Learn. 15(1-2): 1-175 (2021) - [j31]Yutong Ban, Xavier Alameda-Pineda, Laurent Girin, Radu Horaud:
Variational Bayesian Inference for Audio-Visual Tracking of Multiple Speakers. IEEE Trans. Pattern Anal. Mach. Intell. 43(5): 1761-1776 (2021) - [j30]Fanny Roche, Thomas Hueber, Maëva Garnier, Samuel Limier, Laurent Girin:
Make That Sound More Metallic: Towards a Perceptually Relevant Control of the Timbre of Synthesizer Sounds Using a Variational Autoencoder. Trans. Int. Soc. Music. Inf. Retr. 4(1): 52-66 (2021) - [c81]Pierre-Amaury Grumiaux, Srdan Kitic, Laurent Girin, Alexandre Guérin:
Improved feature extraction for CRNN-based multiple sound source localization. EUSIPCO 2021: 231-235 - [c80]Xiaoyu Bie, Laurent Girin, Simon Leglaive, Thomas Hueber, Xavier Alameda-Pineda:
A Benchmark of Dynamical Variational Autoencoders Applied to Speech Spectrogram Modeling. Interspeech 2021: 46-50 - [c79]Marc-Antoine Georges, Laurent Girin, Jean-Luc Schwartz, Thomas Hueber:
Learning Robust Speech Representation with an Articulatory-Regularized Variational Autoencoder. Interspeech 2021: 3345-3349 - [c78]Brooke Stephenson, Thomas Hueber, Laurent Girin, Laurent Besacier:
Alternate Endings: Improving Prosody for Incremental Neural TTS with Predicted Future Text Input. Interspeech 2021: 3865-3869 - [c77]Pierre-Amaury Grumiaux, Srdan Kitic, Prerak Srivastava, Laurent Girin, Alexandre Guérin:
Saladnet: Self-Attentive Multisource Localization in the Ambisonics Domain. WASPAA 2021: 336-340 - [i30]Pierre-Amaury Grumiaux, Srdan Kitic, Laurent Girin, Alexandre Guérin:
Multichannel CRNN for Speaker Counting: an Analysis of Performance. CoRR abs/2101.01977 (2021) - [i29]Brooke Stephenson, Thomas Hueber, Laurent Girin, Laurent Besacier:
Alternate Endings: Improving Prosody for Incremental Neural TTS with Predicted Future Text Input. CoRR abs/2102.09914 (2021) - [i28]Marc-Antoine Georges, Laurent Girin, Jean-Luc Schwartz, Thomas Hueber:
Learning robust speech representation with an articulatory-regularized variational autoencoder. CoRR abs/2104.03204 (2021) - [i27]Pierre-Amaury Grumiaux, Srdan Kitic, Laurent Girin, Alexandre Guérin:
Improved feature extraction for CRNN-based multiple sound source localization. CoRR abs/2105.01897 (2021) - [i26]Xiaoyu Bie, Laurent Girin, Simon Leglaive, Thomas Hueber, Xavier Alameda-Pineda:
A Benchmark of Dynamical Variational Autoencoders applied to Speech Spectrogram Modeling. CoRR abs/2106.06500 (2021) - [i25]Xiaoyu Bie, Simon Leglaive, Xavier Alameda-Pineda, Laurent Girin:
Unsupervised Speech Enhancement using Dynamical Variational Auto-Encoders. CoRR abs/2106.12271 (2021) - [i24]Pierre-Amaury Grumiaux, Srdan Kitic, Prerak Srivastava, Laurent Girin, Alexandre Guérin:
SALADnet: Self-Attentive multisource Localization in the Ambisonics Domain. CoRR abs/2107.11066 (2021) - [i23]Pierre-Amaury Grumiaux, Srdan Kitic, Laurent Girin, Alexandre Guérin:
A Survey of Sound Source Localization with Deep Learning Methods. CoRR abs/2109.03465 (2021) - 2020
- [j29]Thomas Hueber, Eric Tatulli, Laurent Girin, Jean-Luc Schwartz:
Evaluating the Potential Gain of Auditory and Audiovisual Speech-Predictive Coding Using Deep Learning. Neural Comput. 32(3): 596-625 (2020) - [j28]Mostafa Sadeghi, Simon Leglaive, Xavier Alameda-Pineda, Laurent Girin, Radu Horaud:
Audio-Visual Speech Enhancement Using Conditional Variational Auto-Encoders. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1788-1800 (2020) - [c76]Pierre-Amaury Grumiaux, Srdan Kitic, Laurent Girin, Alexandre Guérin:
High-Resolution Speaker Counting in Reverberant Rooms Using CRNN with Ambisonics Features. EUSIPCO 2020: 71-75 - [c75]Simon Leglaive, Xavier Alameda-Pineda, Laurent Girin, Radu Horaud:
A Recurrent Variational Autoencoder for Speech Enhancement. ICASSP 2020: 371-375 - [c74]Brooke Stephenson, Laurent Besacier, Laurent Girin, Thomas Hueber:
What the Future Brings: Investigating the Impact of Lookahead for Incremental Neural TTS. INTERSPEECH 2020: 215-219 - [i22]Pierre-Amaury Grumiaux, Srdjan Kitic, Laurent Girin, Alexandre Guérin:
High-Resolution Speaker Counting In Reverberant Rooms Using CRNN With Ambisonics Features. CoRR abs/2003.07839 (2020) - [i21]Laurent Girin, Simon Leglaive, Xiaoyu Bie, Julien Diard, Thomas Hueber, Xavier Alameda-Pineda:
Dynamical Variational Autoencoders: A Comprehensive Review. CoRR abs/2008.12595 (2020) - [i20]Brooke Stephenson, Laurent Besacier, Laurent Girin, Thomas Hueber:
What the Future Brings: Investigating the Impact of Lookahead for Incremental Neural TTS. CoRR abs/2009.02035 (2020) - [i19]Xiaofei Li, Laurent Girin, Fabien Badeig, Radu Horaud:
Reverberant Sound Localization with a Robot Head Based on Direct-Path Relative Transfer Function. CoRR abs/2012.03574 (2020)
2010 – 2019
- 2019
- [j27]Xiaofei Li, Laurent Girin, Radu Horaud:
Expectation-maximisation for speech source separation using convolutive transfer function. CAAI Trans. Intell. Technol. 4(1): 47-53 (2019) - [j26]Pierre Laffitte, Yun Wang, David Sodoyer, Laurent Girin:
Assessing the performances of different neural network architectures for the detection of screams and shouts in public transportation. Expert Syst. Appl. 117: 29-41 (2019) - [j25]Xiaofei Li, Yutong Ban, Laurent Girin, Xavier Alameda-Pineda, Radu Horaud:
Online Localization and Tracking of Multiple Moving Speakers in Reverberant Environments. IEEE J. Sel. Top. Signal Process. 13(1): 88-103 (2019) - [j24]Xiaofei Li, Simon Leglaive, Laurent Girin, Radu Horaud:
Audio-Noise Power Spectral Density Estimation Using Long Short-Term Memory. IEEE Signal Process. Lett. 26(6): 918-922 (2019) - [j23]Xiaofei Li, Laurent Girin, Sharon Gannot, Radu Horaud:
Multichannel Speech Separation and Enhancement Using the Convolutive Transfer Function. IEEE ACM Trans. Audio Speech Lang. Process. 27(3): 645-659 (2019) - [j22]Xiaofei Li, Laurent Girin, Sharon Gannot, Radu Horaud:
Multichannel Online Dereverberation Based on Spectral Magnitude Inverse Filtering. IEEE ACM Trans. Audio Speech Lang. Process. 27(9): 1365-1377 (2019) - [c73]Raphael Frisch, Marvin Faix, Jacques Droulez, Laurent Girin, Emmanuel Mazer:
Bayesian time-domain multiple sound source localization for a stochastic machine. EUSIPCO 2019: 1-5 - [c72]Simon Leglaive, Laurent Girin, Radu Horaud:
Semi-supervised Multichannel Speech Enhancement with Variational Autoencoders and Non-negative Matrix Factorization. ICASSP 2019: 101-105 - [c71]Simon Leglaive, Umut Simsekli, Antoine Liutkus, Laurent Girin, Radu Horaud:
Speech Enhancement with Variational Autoencoders and Alpha-stable Distributions. ICASSP 2019: 541-545 - [c70]Xavier Alameda-Pineda, Soraya Arias, Yutong Ban, Guillaume Delorme, Laurent Girin, Radu Horaud, Xiaofei Li, Bastien Mourgue, Guillaume Sarrazin:
Audio-Visual Variational Fusion for Multi-Person Tracking with Robots. ACM Multimedia 2019: 1059-1061 - [i18]Simon Leglaive, Laurent Girin, Radu Horaud:
A variance modeling framework based on variational autoencoders for speech enhancement. CoRR abs/1902.01605 (2019) - [i17]Simon Leglaive, Umut Simsekli, Antoine Liutkus, Laurent Girin, Radu Horaud:
Speech enhancement with variational autoencoders and alpha-stable distributions. CoRR abs/1902.03926 (2019) - [i16]Xiaofei Li, Simon Leglaive, Laurent Girin, Radu Horaud:
Audio-noise Power Spectral Density Estimation Using Long Short-term Memory. CoRR abs/1904.05166 (2019) - [i15]Xiaofei Li, Laurent Girin, Radu Horaud:
Expectation-Maximization for Speech Source Separation Using Convolutive Transfer Function. CoRR abs/1904.05249 (2019) - [i14]Mostafa Sadeghi, Simon Leglaive, Xavier Alameda-Pineda, Laurent Girin, Radu Horaud:
Audio-visual Speech Enhancement Using Conditional Variational Auto-Encoder. CoRR abs/1908.02590 (2019) - [i13]Simon Leglaive, Xavier Alameda-Pineda, Laurent Girin, Radu Horaud:
A Recurrent Variational Autoencoder for Speech Enhancement. CoRR abs/1910.10942 (2019) - 2018
- [j21]Xiaofei Li, Sharon Gannot, Laurent Girin, Radu Horaud:
Multichannel Identification and Nonnegative Equalization for Dereverberation and Noise Reduction Based on Convolutive Transfer Function. IEEE ACM Trans. Audio Speech Lang. Process. 26(10): 1755-1768 (2018) - [c69]Xiaofei Li, Sharon Gannot, Laurent Girin, Radu Horaud:
Multisource Mint Using Convolutive Transfer Function. ICASSP 2018: 756-760 - [c68]Yutong Ban, Xiaofei Li, Xavier Alameda-Pineda, Laurent Girin, Radu Horaud:
Accounting for Room Acoustics in Audio-Visual Multi-Speaker Tracking. ICASSP 2018: 6553-6557 - [c67]Xiaofei Li, Bastien Mourgue, Laurent Girin, Sharon Gannot, Radu Horaud:
Online Localization of Multiple Moving Speakers in Reverberant Environments. SAM 2018: 405-409 - [c66]Simon Leglaive, Laurent Girin, Radu Horaud:
A variance Modeling Framework based on variational Autoencoders for speech enhancement. MLSP 2018: 1-6 - [i12]Fanny Roche, Thomas Hueber, Samuel Limier, Laurent Girin:
Autoencoders for music sound synthesis: a comparison of linear, shallow, deep and variational models. CoRR abs/1806.04096 (2018) - [i11]Xiaofei Li, Yutong Ban, Laurent Girin, Xavier Alameda-Pineda, Radu Horaud:
Online Localization and Tracking of Multiple Moving Speakers in Reverberant Environments. CoRR abs/1809.10936 (2018) - [i10]Yutong Ban, Xavier Alameda-Pineda, Laurent Girin, Radu Horaud:
Variational Bayesian Inference for Audio-Visual Tracking of Multiple Speakers. CoRR abs/1809.10961 (2018) - [i9]Simon Leglaive, Laurent Girin, Radu Horaud:
Semi-supervised multichannel speech enhancement with variational autoencoders and non-negative matrix factorization. CoRR abs/1811.06713 (2018) - [i8]Xiaofei Li, Yutong Ban, Laurent Girin, Xavier Alameda-Pineda, Radu Horaud:
A cascaded multiple-speaker localization and tracking system. CoRR abs/1812.04417 (2018) - [i7]Xiaofei Li, Laurent Girin, Sharon Gannot, Radu Horaud:
Multichannel Online Dereverberation based on Spectral Magnitude Inverse Filtering. CoRR abs/1812.08471 (2018) - 2017
- [j20]Diandra Fabre, Thomas Hueber, Laurent Girin, Xavier Alameda-Pineda, Pierre Badin:
Automatic animation of an articulatory tongue model from ultrasound images of the vocal tract. Speech Commun. 93: 63-75 (2017) - [j19]Laurent Girin, Thomas Hueber, Xavier Alameda-Pineda:
Extending the Cascaded Gaussian Mixture Regression Framework for Cross-Speaker Acoustic-Articulatory Mapping. IEEE ACM Trans. Audio Speech Lang. Process. 25(3): 662-673 (2017) - [j18]Xiaofei Li, Laurent Girin, Radu Horaud, Sharon Gannot:
Multiple-Speaker Localization Based on Direct-Path Features and Likelihood Maximization With Spatial Sparsity Regularization. IEEE ACM Trans. Audio Speech Lang. Process. 25(10): 1997-2012 (2017) - [c65]Laurent Girin, Roland Badeau:
On the Use of Latent Mixing Filters in Audio Source Separation. LVA/ICA 2017: 225-235 - [c64]Laurent Girin, Thomas Hueber, Xavier Alameda-Pineda:
Adaptation of a Gaussian Mixture Regressor to a New Input Distribution: Extending the C-GMR Framework. LVA/ICA 2017: 459-468 - [c63]Dionyssos Kounades-Bastian, Laurent Girin, Xavier Alameda-Pineda, Sharon Gannot, Radu Horaud:
An EM algorithm for joint source separation and diarisation of multichannel convolutive speech mixtures. ICASSP 2017: 16-20 - [c62]Xiaofei Li, Laurent Girin, Radu Horaud:
Audio source separation based on convolutive transfer function and frequency-domain lasso optimization. ICASSP 2017: 541-545 - [c61]Yutong Ban, Laurent Girin, Xavier Alameda-Pineda, Radu Horaud:
Exploiting the Complementarity of Audio and Visual Data in Multi-speaker Tracking. ICCV Workshops 2017: 446-454 - [c60]Raphael Frisch, Raphaël Laurent, Marvin Faix, Laurent Girin, Laurent Fesquet, Augustin Lux, Jacques Droulez, Pierre Bessière, Emmanuel Mazer:
A Bayesian Stochastic Machine for Sound Source Localization. ICRC 2017: 1-8 - [c59]Dionyssos Kounades-Bastian, Laurent Girin, Xavier Alameda-Pineda, Radu Horaud, Sharon Gannot:
Exploiting the intermittency of speech for joint separation and diarization. WASPAA 2017: 41-45 - [c58]Mathieu Fontaine, Antoine Liutkus, Laurent Girin, Roland Badeau:
Explaining the parameterized wiener filter with alpha-stable processes. WASPAA 2017: 51-55 - [c57]Xiaofei Li, Laurent Girin, Radu Horaud:
An em algorithm for audio source separation based on the convolutive transfer function. WASPAA 2017: 56-60 - [i6]Xiaofei Li, Laurent Girin, Sharon Gannot, Radu Horaud:
Multichannel Source Separation and Speech Enhancement Using the Convolutive Transfer Function. CoRR abs/1711.07911 (2017) - 2016
- [j17]Florent Bocquelet, Thomas Hueber, Laurent Girin, Christophe Savariaux, Blaise Yvert:
Real-Time Control of an Articulatory-Based Speech Synthesizer for Brain Computer Interfaces. PLoS Comput. Biol. 12(11) (2016) - [j16]Dionyssos Kounades-Bastian, Laurent Girin, Xavier Alameda-Pineda, Sharon Gannot, Radu Horaud:
A Variational EM Algorithm for the Separation of Time-Varying Convolutive Audio Mixtures. IEEE ACM Trans. Audio Speech Lang. Process. 24(8): 1408-1423 (2016) - [j15]Xiaofei Li, Laurent Girin, Radu Horaud, Sharon Gannot:
Estimation of the Direct-Path Relative Transfer Function for Supervised Sound-Source Localization. IEEE ACM Trans. Audio Speech Lang. Process. 24(11): 2171-2186 (2016) - [c56]Dionyssos Kounades-Bastian, Laurent Girin, Xavier Alameda-Pineda, Sharon Gannot, Radu Horaud:
An inverse-gamma source variance prior with factorized parameterization for audio source separation. ICASSP 2016: 136-140 - [c55]Xiaofei Li, Laurent Girin, Sharon Gannot, Radu Horaud:
Non-stationary noise power spectral density estimation based on regional statistics. ICASSP 2016: 181-185 - [c54]Pierre Laffitte, David Sodoyer, Charles Tatkeu, Laurent Girin:
Deep neural networks for automatic detection of screams and shouted speech in subway trains. ICASSP 2016: 6460-6464 - [c53]Xiaofei Li, Laurent Girin, Fabien Badeig, Radu Horaud:
Reverberant sound localization with a robot head based on direct-path relative transfer function. IROS 2016: 2819-2826 - [c52]Xiaofei Li, Radu Horaud, Laurent Girin, Sharon Gannot:
Voice activity detection based on statistical likelihood ratio with adaptive thresholding. IWAENC 2016: 1-5 - [i5]Xiaofei Li, Laurent Girin, Sharon Gannot, Radu Horaud:
Multiple-Speaker Localization Based on Direct-Path Features and Likelihood Maximization with Spatial Sparsity Regularization. CoRR abs/1611.01172 (2016) - 2015
- [j14]Antoine Deleforge, Radu Horaud, Yoav Y. Schechner, Laurent Girin:
Co-Localization of Audio Sources in Images Using Binaural Features and Locally-Linear Regression. IEEE ACM Trans. Audio Speech Lang. Process. 23(4): 718-731 (2015) - [j13]Thomas Hueber, Laurent Girin, Xavier Alameda-Pineda, Gérard Bailly:
Speaker-Adaptive Acoustic-Articulatory Inversion Using Cascaded Gaussian Mixture Regression. IEEE ACM Trans. Audio Speech Lang. Process. 23(12): 2246-2259 (2015) - [c51]Xiaofei Li, Radu Horaud, Laurent Girin, Sharon Gannot:
Local relative transfer function for sound source localization. EUSIPCO 2015: 399-403 - [c50]Xiaofei Li, Laurent Girin, Radu Horaud, Sharon Gannot:
Estimation of relative transfer function in the presence of stationary noise based on segmental power spectral density matrix subtraction. ICASSP 2015: 320-324 - [c49]Florent Bocquelet, Thomas Hueber, Laurent Girin, Christophe Savariaux, Blaise Yvert:
Real-time control of a DNN-based articulatory synthesizer for silent speech conversion: a pilot study. INTERSPEECH 2015: 2405-2409 - [c48]Dionyssos Kounades-Bastian, Laurent Girin, Xavier Alameda-Pineda, Sharon Gannot, Radu Horaud:
A variational EM algorithm for the separation of moving sound sources. WASPAA 2015: 1-5 - [i4]Xiaofei Li, Laurent Girin, Radu Horaud, Sharon Gannot:
Binaural Sound Source Localization based on Direct-Path Relative Transfer Function. CoRR abs/1509.03205 (2015) - [i3]Dionyssos Kounades-Bastian, Laurent Girin, Xavier Alameda-Pineda, Sharon Gannot, Radu Horaud:
A Variational EM Algorithm for the Separation of Time-Varying Convolutive Audio Mixtures. CoRR abs/1510.04595 (2015) - 2014
- [c47]Serap Kirbiz, Alexey Ozerov, Antoine Liutkus, Laurent Girin:
Perceptual coding-based Informed Source Separation. EUSIPCO 2014: 959-963 - [c46]Antoine Deleforge, Vincent Drouard, Laurent Girin, Radu Horaud:
Mapping sounds onto images using binaural spectrograms. EUSIPCO 2014: 2470-2474 - [c45]Maxime Janvier, Xavier Alameda-Pineda, Laurent Girin, Radu Horaud:
Sound representation and classification benchmark for domestic robots. ICRA 2014: 6285-6292 - [c44]Florent Bocquelet, Thomas Hueber, Laurent Girin, Pierre Badin, Blaise Yvert:
Robust articulatory speech synthesis using deep neural networks for BCI applications. INTERSPEECH 2014: 2288-2292 - [i2]Maxime Janvier, Xavier Alameda-Pineda, Laurent Girin, Radu Horaud:
Sound Representation and Classification Benchmark for Domestic Robots. CoRR abs/1402.3689 (2014) - [i1]Antoine Deleforge, Radu Horaud, Yoav Y. Schechner, Laurent Girin:
Co-Localization of Audio Sources Using Binaural Features and Locally-Linear Regression. CoRR abs/1408.2700 (2014) - 2013
- [j12]Shuhua Zhang, Laurent Girin:
Fast and Accurate Direct MDCT to DFT Conversion With Arbitrary Window Functions. IEEE Trans. Speech Audio Process. 21(3): 567-578 (2013) - [c43]Shuhua Zhang, Laurent Girin, Antoine Liutkus:
Informed Source Separation from compressed mixtures using spatial wiener filter and quantization noise estimation. ICASSP 2013: 61-65 - 2012
- [j11]Antoine Liutkus, Jonathan Pinel, Roland Badeau, Laurent Girin, Gaël Richard:
Informed source separation through spectrogram coding and data embedding. Signal Process. 92(8): 1937-1949 (2012) - [c42]Antoine Liutkus, Stanislaw Gorlow, Nicolas Sturmel, Shuhua Zhang, Laurent Girin, Roland Badeau, Laurent Daudet, Sylvain Marchand, Gaël Richard:
Informed audio source separation: A comparative study. EUSIPCO 2012: 2397-2401 - [c41]Maxime Janvier, Xavier Alameda-Pineda, Laurent Girin, Radu Horaud:
Sound-event recognition with a companion humanoid. Humanoids 2012: 104-111 - [c40]Frédéric Berthommier, Laurent Girin, Louis-Jean Boë:
A Simple Hybrid Acoustic / Morphologically-Constrained Technique for the Synthesis of Stop Consonants in Various Vocalic Contexts. INTERSPEECH 2012: 2542-2545 - [c39]Timothée Gerber, Martin Dutasta, Laurent Girin, Cédric Févotte:
Professionally-produced Music Separation Guided by Covers. ISMIR 2012: 85-90 - 2011
- [j10]Mathieu Parvaix, Laurent Girin:
Informed Source Separation of Linear Instantaneous Under-Determined Audio Mixtures by Source Index Embedding. IEEE Trans. Speech Audio Process. 19(6): 1721-1733 (2011) - [c38]Faten Ben Ali, Laurent Girin, Sonia Djaziri Larbi:
A Long-Term Harmonic Plus Noise Model for Speech Signals. INTERSPEECH 2011: 53-56 - [c37]Shuhua Zhang, Laurent Girin:
An Informed Source Separation System for Speech Signals. INTERSPEECH 2011: 573-576 - [c36]Laurent Girin, Jonathan Pinel:
Informed Audio Source Separation from Compressed Linear Stereo Mixtures. Semantic Audio 2011 - [c35]Jonathan Pinel, Laurent Girin:
"Sparsification" of Audio Signals Using the MDCT/IntMDCT and a Psychoacoustic Model - Application to Informed Audio Source Separation. Semantic Audio 2011 - 2010
- [j9]Laurent Girin:
Adaptive Long-Term Coding of LSF Parameters Trajectories for Large-Delay/Very- to Ultra-Low Bit-Rate Speech Coding. EURASIP J. Audio Speech Music. Process. 2010 (2010) - [j8]Mathieu Parvaix, Laurent Girin, Jean-Marc Brossier:
A Watermarking-Based Method for Informed Source Separation of Audio Signals With a Single Sensor. IEEE Trans. Speech Audio Process. 18(6): 1464-1475 (2010) - [c34]Sylvain Marchand, Boris Mansencal, Laurent Girin:
Interactive Music with Active Audio CDs. CMMR 2010: 31-50 - [c33]Mathieu Parvaix, Laurent Girin:
Informed source separation of underdetermined instantaneous stereo mixtures using source index embedding. ICASSP 2010: 245-248
2000 – 2009
- 2009
- [c32]Mathieu Parvaix, Laurent Girin, Jean-Marc Brossier:
A watermarking-based method for single-channel audio source separation. ICASSP 2009: 101-104 - 2008
- [c31]Mohammad Firouzmand, Laurent Girin:
Long-term flexible 2D cepstral modeling of speech spectral amplitudes. ICASSP 2008: 3937-3940 - [c30]Kris Hermus, Laurent Girin, Hugo Van hamme, Sufian Irhimeh:
Estimation of the voicing cut-off frequency contour of natural speech based on harmonic and aperiodic energies. ICASSP 2008: 4473-4476 - 2007
- [j7]Bertrand Rivet, Laurent Girin, Christian Jutten:
Visual voice activity detection as a help for speech source separation from convolutive mixtures. Speech Commun. 49(7-8): 667-677 (2007) - [j6]Bertrand Rivet, Laurent Girin, Christian Jutten:
Mixing Audiovisual Speech Processing and Blind Source Separation for the Extraction of Speech Signals From Convolutive Mixtures. IEEE Trans. Speech Audio Process. 15(1): 96-108 (2007) - [j5]Bertrand Rivet, Laurent Girin, Christian Jutten:
Log-Rayleigh Distribution: A Simple and Efficient Statistical Representation of Log-Spectral Coefficients. IEEE Trans. Speech Audio Process. 15(3): 796-802 (2007) - [j4]Laurent Girin, Mohammad Firouzmand, Sylvain Marchand:
Perceptual Long-Term Variable-Rate Sinusoidal Modeling of Speech. IEEE Trans. Speech Audio Process. 15(3): 851-861 (2007) - [c29]Bertrand Rivet, Laurent Girin, Christine Servière, Dinh-Tuan Pham, Christian Jutten:
Audiovisual speech source separation: a regularization method based on visual voice activity detection. AVSP 2007: 7 - [c28]Bertrand Rivet, Andrew J. Aubrey, Laurent Girin, Yulia Hicks, Christian Jutten, Jonathon A. Chambers:
Development and comparison of two approaches for visual speech analysis with application to voice activity detection. AVSP 2007: 14 - [c27]Andrew J. Aubrey, Bertrand Rivet, Yulia Hicks, Laurent Girin, Jonathon A. Chambers, Christian Jutten:
Two novel visual voice activity detectors based on appearance models and retinal filtering. EUSIPCO 2007: 2409-2413 - [c26]Laurent Girin:
Long-Term Quantization of Speech LSF Parameters. ICASSP (4) 2007: 845-848 - [c25]Bertrand Rivet, Laurent Girin, Christine Servière, Dinh-Tuan Pham, Christian Jutten:
Using a Visual Voice Activity Detector to Regularize the Permutations in Blind Separation of Convolutive Speech Mixtures. DSP 2007: 223-226 - 2006
- [c24]Laurent Girin:
Theoretical and experimental bases of a new method for accurate separation of harmonic and noise components of speech signals. EUSIPCO 2006: 1-5 - [c23]David Sodoyer, Bertrand Rivet, Laurent Girin, Jean-Luc Schwartz, Christian Jutten:
An Analysis of Visual Speech Information Applied to Voice Activity Detection. ICASSP (1) 2006: 601-604 - 2005
- [c22]Mohammad Firouzmand, Laurent Girin:
Perceptually Weighted Long Term Modeling of Sinusoidal Speech Amplitude Trajectories. ICASSP (1) 2005: 369-372 - [c21]Bertrand Rivet, Laurent Girin, Christian Jutten:
Solving the indeterminations of blind source separation of convolutive speech mixtures. ICASSP (5) 2005: 533-536 - [c20]Mohammad Firouzmand, Laurent Girin, Sylvain Marchand:
Comparing several models for perceptual long-term modeling of amplitude and phase trajectories of sinusoidal speech. INTERSPEECH 2005: 357-360 - 2004
- [j3]David Sodoyer, Laurent Girin, Christian Jutten, Jean-Luc Schwartz:
Developing an audio-visual speech source separation algorithm. Speech Commun. 44(1-4): 113-125 (2004) - [j2]Laurent Girin:
Joint matrix quantization of face parameters and LPC coefficients for low bit rate audiovisual speech coding. IEEE Trans. Speech Audio Process. 12(3): 265-276 (2004) - [c19]Laurent Girin, Sylvain Marchand:
Watermarking of speech signals using the sinusoidal model and frequency modulation of the partials. ICASSP (1) 2004: 633-636 - [c18]Denis Beautemps, Thomas Burger, Laurent Girin:
Characterizing and classifying cued speech vowels from labial parameters. INTERSPEECH 2004: 1861-1864 - [c17]Laurent Girin, Mohammad Firouzmand, Sylvain Marchand:
Long term modeling of phase trajectories within the speech sinusoidal model framework. INTERSPEECH 2004: 2469-2472 - [c16]Bertrand Rivet, Laurent Girin, Christian Jutten, Jean-Luc Schwartz:
Using audiovisual speech processing to improve the robustness of the separation of convolutive speech mixtures. MMSP 2004: 47-50 - 2003
- [c15]Laurent Girin:
Pure audio McGurk effect. AVSP 2003: 139-144 - [c14]David Sodoyer, Laurent Girin, Christian Jutten, Jean-Luc Schwartz:
Further experiments on audio-visual speech source separation. AVSP 2003: 145-150 - [c13]David Sodoyer, Laurent Girin, Christian Jutten, Jean-Luc Schwartz:
Extracting an AV speech source from a mixture of signals. INTERSPEECH 2003: 1393-1396 - [c12]David Sodoyer, Laurent Girin, Christian Jutten, Jean-Luc Schwartz:
Speech extraction based on ICA and audio-visual coherence. ISSPA (2) 2003: 65-68 - 2002
- [j1]David Sodoyer, Jean-Luc Schwartz, Laurent Girin, Jacob Klinkisch, Christian Jutten:
Separation of Audio-Visual Speech Sources: A New Approach Exploiting the Audio-Visual Coherence of Speech Stimuli. EURASIP J. Adv. Signal Process. 2002(11): 1165-1173 (2002) - [c11]David Sodoyer, Laurent Girin, Christian Jutten, Jean-Luc Schwartz:
Audio-visual speech sources separation: a new approach exploiting the audio-visual coherence of speech stimuli. INTERSPEECH 2002: 1953-1956 - 2001
- [c10]Laurent Girin, A. Allard, Jean-Luc Schwartz:
Speech signals separation: a new approach exploiting the coherence of audio and visual speech. MMSP 2001: 631-636
1990 – 1999
- 1998
- [c9]Elodie Foucher, Laurent Girin, Gang Feng:
Audiovisual Speech Coder : Using Vector Quantization To Exploit The Audio/Video Correlation. AVSP 1998: 67-72 - [c8]Elodie Foucher, Gang Feng, Laurent Girin:
A preliminary study of an audio-visual speech coder: Using video parameters to reduce an LPC vocoder bit rate. EUSIPCO 1998: 1-4 - [c7]Laurent Girin, Gang Feng, Jean-Luc Schwartz:
Fusion of auditory and visual information for noisy speech enhancement: a preliminary study of vowel transitions. ICASSP 1998: 1005-1008 - [c6]Laurent Girin, Laurent Varin, Gang Feng, Jean-Luc Schwartz:
A signal processing system for having the sound "pop-out" in noise thanks to the image of the speaker's lips: new advances using multi-layer perceptrons. ICSLP 1998 - [c5]Laurent Girin, Laurent Varin, Gang Feng, Jean-Luc Schwartz:
Audiovisual speech enhancement: new advances using multi-layer perceptrons. MMSP 1998: 77-82 - [c4]Laurent Girin, Elodie Foucher, Gang Feng:
An audio-visual distance for audio-visual speech vector quantization. MMSP 1998: 523-528 - 1997
- [c3]Laurent Girin, Jean-Luc Schwartz, Gang Feng:
Can the visual input make the audio signal "pop out" in noise ? a first study of the enhancement of noisy VCV acoustic sequences by audio-visual fusion. AVSP 1997: 37-40 - [c2]Laurent Girin, Gang Feng, Jean-Luc Schwartz:
Noisy speech enhancement by fusion of auditory and visual information: a study of vowel transitions. EUROSPEECH 1997: 2555-2558 - 1995
- [c1]Laurent Girin, Gang Feng, Jean-Luc Schwartz:
Noisy speech enhancement with filters estimated from the speaker's lips. EUROSPEECH 1995: 1559-1562
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-09-26 23:42 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint