default search action
Neil Zeghidour
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c26]Geoffrey Cideron, Sertan Girgin, Mauro Verzetti, Damien Vincent, Matej Kastelic, Zalán Borsos, Brian McWilliams, Victor Ungureanu, Olivier Bachem, Olivier Pietquin, Matthieu Geist, Léonard Hussenot, Neil Zeghidour, Andrea Agostinelli:
MusicRL: Aligning Music Generation to Human Preferences. ICML 2024 - [i34]Geoffrey Cideron, Sertan Girgin, Mauro Verzetti, Damien Vincent, Matej Kastelic, Zalán Borsos, Brian McWilliams, Victor Ungureanu, Olivier Bachem, Olivier Pietquin, Matthieu Geist, Léonard Hussenot, Neil Zeghidour, Andrea Agostinelli:
MusicRL: Aligning Music Generation to Human Preferences. CoRR abs/2402.04229 (2024) - [i33]Matthieu Futeral, Andrea Agostinelli, Marco Tagliasacchi, Neil Zeghidour, Eugene Kharitonov:
MAD Speech: Measures of Acoustic Diversity of Speech. CoRR abs/2404.10419 (2024) - 2023
- [j5]Eugene Kharitonov, Damien Vincent, Zalán Borsos, Raphaël Marinier, Sertan Girgin, Olivier Pietquin, Matt Sharifi, Marco Tagliasacchi, Neil Zeghidour:
Speak, Read and Prompt: High-Fidelity Text-to-Speech with Minimal Supervision. Trans. Assoc. Comput. Linguistics 11: 1703-1718 (2023) - [j4]Zalán Borsos, Raphaël Marinier, Damien Vincent, Eugene Kharitonov, Olivier Pietquin, Matthew Sharifi, Dominik Roblek, Olivier Teboul, David Grangier, Marco Tagliasacchi, Neil Zeghidour:
AudioLM: A Language Modeling Approach to Audio Generation. IEEE ACM Trans. Audio Speech Lang. Process. 31: 2523-2533 (2023) - [c25]Teerapat Jenrungrot, Michael Chinen, W. Bastiaan Kleijn, Jan Skoglund, Zalán Borsos, Neil Zeghidour, Marco Tagliasacchi:
LMCodec: A Low Bitrate Speech Codec with Causal Transformer Models. ICASSP 2023: 1-5 - [c24]Ahmed Omran, Neil Zeghidour, Zalán Borsos, Félix de Chaumont Quitry, Malcolm Slaney, Marco Tagliasacchi:
Disentangling Speech from Surroundings with Neural Embeddings. ICASSP 2023: 1-5 - [c23]Subhashini Venugopalan, Jimmy Tobin, Samuel J. Yang, Katie Seaver, Richard J. N. Cave, Pan-Pan Jiang, Neil Zeghidour, Rus Heywood, Jordan R. Green, Michael P. Brenner:
Speech Intelligibility Classifiers from 550k Disordered Speech Samples. ICASSP 2023: 1-5 - [c22]Othmane-Latif Ouabi, Neil Zeghidour, Nico F. Declercq, Matthieu Geist, Cédric Pradalier:
Pose-graph SLAM Using Multi-order Ultrasonic Echoes and Beamforming for Long-range Inspection Robots. ICRA 2023: 10623-10629 - [c21]Hakan Erdogan, Scott Wisdom, Xuankai Chang, Zalán Borsos, Marco Tagliasacchi, Neil Zeghidour, John R. Hershey:
TokenSplit: Using Discrete Speech Representations for Direct, Refined, and Transcript-Conditioned Speech Separation and Recognition. INTERSPEECH 2023: 3462-3466 - [i32]Andrea Agostinelli, Timo I. Denk, Zalán Borsos, Jesse H. Engel, Mauro Verzetti, Antoine Caillon, Qingqing Huang, Aren Jansen, Adam Roberts, Marco Tagliasacchi, Matthew Sharifi, Neil Zeghidour, Christian Havnø Frank:
MusicLM: Generating Music From Text. CoRR abs/2301.11325 (2023) - [i31]Chris Donahue, Antoine Caillon, Adam Roberts, Ethan Manilow, Philippe Esling, Andrea Agostinelli, Mauro Verzetti, Ian Simon, Olivier Pietquin, Neil Zeghidour, Jesse H. Engel:
SingSong: Generating musical accompaniments from singing. CoRR abs/2301.12662 (2023) - [i30]Eugene Kharitonov, Damien Vincent, Zalán Borsos, Raphaël Marinier, Sertan Girgin, Olivier Pietquin, Matthew Sharifi, Marco Tagliasacchi, Neil Zeghidour:
Speak, Read and Prompt: High-Fidelity Text-to-Speech with Minimal Supervision. CoRR abs/2302.03540 (2023) - [i29]David W. Romero, Neil Zeghidour:
DNArch: Learning Convolutional Neural Architectures by Backpropagation. CoRR abs/2302.05400 (2023) - [i28]Subhashini Venugopalan, Jimmy Tobin, Samuel J. Yang, Katie Seaver, Richard J. N. Cave, Pan-Pan Jiang, Neil Zeghidour, Rus Heywood, Jordan R. Green, Michael P. Brenner:
Speech Intelligibility Classifiers from 550k Disordered Speech Samples. CoRR abs/2303.07533 (2023) - [i27]Teerapat Jenrungrot, Michael Chinen, W. Bastiaan Kleijn, Jan Skoglund, Zalán Borsos, Neil Zeghidour, Marco Tagliasacchi:
LMCodec: A Low Bitrate Speech Codec With Causal Transformer Models. CoRR abs/2303.12984 (2023) - [i26]Zalán Borsos, Matthew Sharifi, Damien Vincent, Eugene Kharitonov, Neil Zeghidour, Marco Tagliasacchi:
SoundStorm: Efficient Parallel Audio Generation. CoRR abs/2305.09636 (2023) - [i25]Paul K. Rubenstein, Chulayuth Asawaroengchai, Duc Dung Nguyen, Ankur Bapna, Zalán Borsos, Félix de Chaumont Quitry, Peter Chen, Dalia El Badawy, Wei Han, Eugene Kharitonov, Hannah Muckenhirn, Dirk Padfield, James Qin, Danny Rozenberg, Tara N. Sainath, Johan Schalkwyk, Matthew Sharifi, Michelle Tadmor Ramanovich, Marco Tagliasacchi, Alexandru Tudor, Mihajlo Velimirovic, Damien Vincent, Jiahui Yu, Yongqiang Wang, Vicky Zayats, Neil Zeghidour, Yu Zhang, Zhishuai Zhang, Lukas Zilka, Christian Havnø Frank:
AudioPaLM: A Large Language Model That Can Speak and Listen. CoRR abs/2306.12925 (2023) - [i24]Hakan Erdogan, Scott Wisdom, Xuankai Chang, Zalán Borsos, Marco Tagliasacchi, Neil Zeghidour, John R. Hershey:
TokenSplit: Using Discrete Speech Representations for Direct, Refined, and Transcript-Conditioned Speech Separation and Recognition. CoRR abs/2308.10415 (2023) - 2022
- [j3]Neil Zeghidour, Alejandro Luebs, Ahmed Omran, Jan Skoglund, Marco Tagliasacchi:
SoundStream: An End-to-End Neural Audio Codec. IEEE ACM Trans. Audio Speech Lang. Process. 30: 495-507 (2022) - [c20]Othmane-Latif Ouabi, Jiawei Yi, Neil Zeghidour, Nico F. Declercq, Matthieu Geist, Cédric Pradalier:
Polygonal Shapes Reconstruction from Acoustic Echoes Using a Mobile Sensor and Beamforming. EUSIPCO 2022: 1507-1511 - [c19]Rachid Riad, Olivier Teboul, David Grangier, Neil Zeghidour:
Learning Strides in Convolutional Neural Networks. ICLR 2022 - [c18]Curtis Hawthorne, Andrew Jaegle, Catalina Cangea, Sebastian Borgeaud, Charlie Nash, Mateusz Malinowski, Sander Dieleman, Oriol Vinyals, Matthew M. Botvinick, Ian Simon, Hannah Sheahan, Neil Zeghidour, Jean-Baptiste Alayrac, João Carreira, Jesse H. Engel:
General-purpose, long-context autoregressive modeling with Perceiver AR. ICML 2022: 8535-8558 - [c17]Othmane-Latif Ouabi, Ayoub Ridani, Pascal Pomarede, Neil Zeghidour, Nico F. Declercq, Matthieu Geist, Cédric Pradalier:
Combined Grid and Feature-based Mapping of Metal Structures with Ultrasonic Guided Waves. ICRA 2022: 5056-5062 - [c16]Sarthak Yadav, Neil Zeghidour:
Learning neural audio features without supervision. INTERSPEECH 2022: 396-400 - [c15]Curtis Hawthorne, Ian Simon, Adam Roberts, Neil Zeghidour, Josh Gardner, Ethan Manilow, Jesse H. Engel:
Multi-instrument Music Synthesis with Spectrogram Diffusion. ISMIR 2022: 598-607 - [i23]Rachid Riad, Olivier Teboul, David Grangier, Neil Zeghidour:
Learning strides in convolutional neural networks. CoRR abs/2202.01653 (2022) - [i22]Curtis Hawthorne, Andrew Jaegle, Catalina Cangea, Sebastian Borgeaud, Charlie Nash, Mateusz Malinowski, Sander Dieleman, Oriol Vinyals, Matthew M. Botvinick, Ian Simon, Hannah Sheahan, Neil Zeghidour, Jean-Baptiste Alayrac, João Carreira, Jesse H. Engel:
General-purpose, long-context autoregressive modeling with Perceiver AR. CoRR abs/2202.07765 (2022) - [i21]Sarthak Yadav, Neil Zeghidour:
Learning neural audio features without supervision. CoRR abs/2203.15519 (2022) - [i20]Ahmed Omran, Neil Zeghidour, Zalán Borsos, Félix de Chaumont Quitry, Malcolm Slaney, Marco Tagliasacchi:
Disentangling speech from surroundings in a neural audio codec. CoRR abs/2203.15578 (2022) - [i19]Curtis Hawthorne, Ian Simon, Adam Roberts, Neil Zeghidour, Josh Gardner, Ethan Manilow, Jesse H. Engel:
Multi-instrument Music Synthesis with Spectrogram Diffusion. CoRR abs/2206.05408 (2022) - [i18]Zalán Borsos, Raphaël Marinier, Damien Vincent, Eugene Kharitonov, Olivier Pietquin, Matthew Sharifi, Olivier Teboul, David Grangier, Marco Tagliasacchi, Neil Zeghidour:
AudioLM: a Language Modeling Approach to Audio Generation. CoRR abs/2209.03143 (2022) - 2021
- [j2]Andrew N. Carr, Quentin Berthet, Mathieu Blondel, Olivier Teboul, Neil Zeghidour:
Self-Supervised Learning of Audio Representations From Permutations With Differentiable Ranking. IEEE Signal Process. Lett. 28: 708-712 (2021) - [j1]Neil Zeghidour, David Grangier:
Wavesplit: End-to-End Speech Separation by Speaker Clustering. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2840-2849 (2021) - [c14]Neil Zeghidour, Olivier Teboul, David Grangier:
Dive: End-to-End Speech Diarization Via Iterative Speaker Embedding. ASRU 2021: 702-709 - [c13]Aaqib Saeed, David Grangier, Olivier Pietquin, Neil Zeghidour:
Learning From Heterogeneous Eeg Signals with Differentiable Channel Reordering. ICASSP 2021: 1255-1259 - [c12]Aaqib Saeed, David Grangier, Neil Zeghidour:
Contrastive Learning of General-Purpose Audio Representations. ICASSP 2021: 3875-3879 - [c11]Neil Zeghidour, Olivier Teboul, Félix de Chaumont Quitry, Marco Tagliasacchi:
LEAF: A Learnable Frontend for Audio Classification. ICLR 2021 - [i17]Neil Zeghidour, Olivier Teboul, Félix de Chaumont Quitry, Marco Tagliasacchi:
LEAF: A Learnable Frontend for Audio Classification. CoRR abs/2101.08596 (2021) - [i16]Andrew N. Carr, Quentin Berthet, Mathieu Blondel, Olivier Teboul, Neil Zeghidour:
Self-Supervised Learning of Audio Representations from Permutations with Differentiable Ranking. CoRR abs/2103.09879 (2021) - [i15]Neil Zeghidour, Olivier Teboul, David Grangier:
DIVE: End-to-end Speech Diarization via Iterative Speaker Embedding. CoRR abs/2105.13802 (2021) - [i14]Neil Zeghidour, Alejandro Luebs, Ahmed Omran, Jan Skoglund, Marco Tagliasacchi:
SoundStream: An End-to-End Neural Audio Codec. CoRR abs/2107.03312 (2021) - 2020
- [i13]Neil Zeghidour, David Grangier:
Wavesplit: End-to-End Speech Separation by Speaker Clustering. CoRR abs/2002.08933 (2020) - [i12]Aaqib Saeed, David Grangier, Neil Zeghidour:
Contrastive Learning of General-Purpose Audio Representations. CoRR abs/2010.10915 (2020) - [i11]Aaqib Saeed, David Grangier, Olivier Pietquin, Neil Zeghidour:
Learning from Heterogeneous EEG Signals with Differentiable Channel Reordering. CoRR abs/2010.13694 (2020)
2010 – 2019
- 2019
- [b1]Neil Zeghidour:
Learning representations of speech from the raw waveform. (Apprentissage de représentations de la parole à partir du signal brut). PSL Research University, Paris, France, 2019 - [c10]Yossi Adi, Neil Zeghidour, Ronan Collobert, Nicolas Usunier, Vitaliy Liptchinsky, Gabriel Synnaeve:
To Reverse the Gradient or Not: an Empirical Comparison of Adversarial and Multi-task Learning in Speech Recognition. ICASSP 2019: 3742-3746 - [c9]Juliette Millet, Neil Zeghidour:
Learning to Detect Dysarthria from Raw Speech. ICASSP 2019: 5831-5835 - [i10]Gabriel Dulac-Arnold, Neil Zeghidour, Marco Cuturi, Lucas Beyer, Jean-Philippe Vert:
Deep multi-class learning from label proportions. CoRR abs/1905.12909 (2019) - 2018
- [c8]Neil Zeghidour, Nicolas Usunier, Iasonas Kokkinos, Thomas Schatz, Gabriel Synnaeve, Emmanuel Dupoux:
Learning Filterbanks from Raw Speech for Phone Recognition. ICASSP 2018: 5509-5513 - [c7]Neil Zeghidour, Nicolas Usunier, Gabriel Synnaeve, Ronan Collobert, Emmanuel Dupoux:
End-to-End Speech Recognition from the Raw Waveform. INTERSPEECH 2018: 781-785 - [c6]Rachid Riad, Corentin Dancette, Julien Karadayi, Neil Zeghidour, Thomas Schatz, Emmanuel Dupoux:
Sampling Strategies in Siamese Networks for Unsupervised Speech Representation Learning. INTERSPEECH 2018: 2658-2662 - [c5]Alexandre Défossez, Neil Zeghidour, Nicolas Usunier, Léon Bottou, Francis R. Bach:
SING: Symbol-to-Instrument Neural Generator. NeurIPS 2018: 9055-9065 - [i9]Rachid Riad, Corentin Dancette, Julien Karadayi, Neil Zeghidour, Thomas Schatz, Emmanuel Dupoux:
Sampling strategies in Siamese Networks for unsupervised speech representation learning. CoRR abs/1804.11297 (2018) - [i8]Neil Zeghidour, Nicolas Usunier, Gabriel Synnaeve, Ronan Collobert, Emmanuel Dupoux:
End-to-End Speech Recognition From the Raw Waveform. CoRR abs/1806.07098 (2018) - [i7]Alexandre Défossez, Neil Zeghidour, Nicolas Usunier, Léon Bottou, Francis R. Bach:
SING: Symbol-to-Instrument Neural Generator. CoRR abs/1810.09785 (2018) - [i6]Juliette Millet, Neil Zeghidour:
Learning to detect dysarthria from raw speech. CoRR abs/1811.11101 (2018) - [i5]Yossi Adi, Neil Zeghidour, Ronan Collobert, Nicolas Usunier, Vitaliy Liptchinsky, Gabriel Synnaeve:
To Reverse the Gradient or Not: An Empirical Comparison of Adversarial and Multi-task Learning in Speech Recognition. CoRR abs/1812.03483 (2018) - [i4]Neil Zeghidour, Qiantong Xu, Vitaliy Liptchinsky, Nicolas Usunier, Gabriel Synnaeve, Ronan Collobert:
Fully Convolutional Speech Recognition. CoRR abs/1812.06864 (2018) - 2017
- [c4]Rahma Chaabouni, Ewan Dunbar, Neil Zeghidour, Emmanuel Dupoux:
Learning Weakly Supervised Multimodal Phoneme Embeddings. INTERSPEECH 2017: 2218-2222 - [c3]Guillaume Lample, Neil Zeghidour, Nicolas Usunier, Antoine Bordes, Ludovic Denoyer, Marc'Aurelio Ranzato:
Fader Networks: Manipulating Images by Sliding Attributes. NIPS 2017: 5967-5976 - [i3]Rahma Chaabouni, Ewan Dunbar, Neil Zeghidour, Emmanuel Dupoux:
Learning weakly supervised multimodal phoneme embeddings. CoRR abs/1704.06913 (2017) - [i2]Guillaume Lample, Neil Zeghidour, Nicolas Usunier, Antoine Bordes, Ludovic Denoyer, Marc'Aurelio Ranzato:
Fader Networks: Manipulating Images by Sliding Attributes. CoRR abs/1706.00409 (2017) - [i1]Neil Zeghidour, Nicolas Usunier, Iasonas Kokkinos, Thomas Schatz, Gabriel Synnaeve, Emmanuel Dupoux:
Learning Filterbanks from Raw Speech for Phone Recognition. CoRR abs/1711.01161 (2017) - 2016
- [c2]Neil Zeghidour, Gabriel Synnaeve, Maarten Versteegh, Emmanuel Dupoux:
A deep scattering spectrum - Deep Siamese network pipeline for unsupervised acoustic modeling. ICASSP 2016: 4965-4969 - [c1]Neil Zeghidour, Gabriel Synnaeve, Nicolas Usunier, Emmanuel Dupoux:
Joint Learning of Speaker and Phonetic Similarities with Siamese Networks. INTERSPEECH 2016: 1295-1299
Coauthor Index
aka: Zalán Borsos
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-09-04 00:29 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint