default search action

combined dblp search
author search
venue search
publication search

ask others

Yossi Adi

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j8]
- view
  - electronic edition @ jmlr.org (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/jmlr/PratapTSTBKENVF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/PratapTSTBKENVF24
Vineel Pratap, Andros Tjandra, Bowen Shi, Paden Tomasello, Arun Babu, Sayani Kundu, Ali Elkahky, Zhaoheng Ni, Apoorv Vyas, Maryam Fazel-Zarandi, Alexei Baevski, Yossi Adi, Xiaohui Zhang, Wei-Ning Hsu, Alexis Conneau, Michael Auli:
Scaling Speech Technology to 1, 000+ Languages. J. Mach. Learn. Res. 25: 97:1-97:52 (2024)
[c57]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/YarivGBWSA24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/YarivGBWSA24
Guy Yariv, Itai Gat, Sagie Benaim, Lior Wolf, Idan Schwartz, Yossi Adi:
Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation. AAAI 2024: 6639-6647
[c56]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/LorberbomGASH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/LorberbomGASH24
Guy Lorberbom, Itai Gat, Yossi Adi, Alexander G. Schwing, Tamir Hazan:
Layer Collaboration in the Forward-Forward Algorithm. AAAI 2024: 14141-14148
[c55]
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/emnlp/OrenHNA024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/OrenHNA024
Matanel Oren, Michael Hassid, Yarden Nir, Yossi Adi, Roy Schwartz:
Transformers are Multi-State RNNs. EMNLP 2024: 18724-18741
[c54]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/ZivGLRKCDSA24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ZivGLRKCDSA24
Alon Ziv, Itai Gat, Gaël Le Lan, Tal Remez, Felix Kreuk, Jade Copet, Alexandre Défossez, Gabriel Synnaeve, Yossi Adi:
Masked Audio Generation using a Single Non-Autoregressive Transformer. ICLR 2024
[c53]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/LemercierRCAD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/LemercierRCAD24
Jean-Marie Lemercier, Simon Rouard, Jade Copet, Yossi Adi, Alexandre Défossez:
An Independence-promoting Loss for Music Generation with Language Models. ICML 2024
[i78]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-04577
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-04577
Alon Ziv, Itai Gat, Gaël Le Lan, Tal Remez, Felix Kreuk, Alexandre Défossez, Jade Copet, Gabriel Synnaeve, Yossi Adi:
Masked Audio Generation using a Single Non-Autoregressive Transformer. CoRR abs/2401.04577 (2024)
[i77]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-06104
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-06104
Matanel Oren, Michael Hassid, Yossi Adi, Roy Schwartz:
Transformers are Multi-State RNNs. CoRR abs/2401.06104 (2024)
[i76]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-00725
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-00725
Michael Hassid, Tal Remez, Jonas Gehring, Roy Schwartz, Yossi Adi:
The Larger the Better? Improved LLM Code-Generation via Budget Reallocation. CoRR abs/2404.00725 (2024)
[i75]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-02315
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-02315
Jean-Marie Lemercier, Simon Rouard, Jade Copet, Yossi Adi, Alexandre Défossez:
An Independence-promoting Loss for Music Generation with Language Models. CoRR abs/2406.02315 (2024)
[i74]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-07725
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-07725
Xuankai Chang, Jiatong Shi, Jinchuan Tian, Yuning Wu, Yuxun Tang, Yihan Wu, Shinji Watanabe, Yossi Adi, Xie Chen, Qin Jin:
The Interspeech 2024 Challenge on Speech Processing Using Discrete Units. CoRR abs/2406.07725 (2024)
[i73]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-10970
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-10970
Or Tal, Alon Ziv, Itai Gat, Felix Kreuk, Yossi Adi:
Joint Audio and Symbolic Conditioning for Temporally Controlled Text-to-Music Generation. CoRR abs/2406.10970 (2024)
[i72]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-11037
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-11037
Shoval Messica, Yossi Adi:
NAST: Noise Aware Speech Tokenization for Speech Language Models. CoRR abs/2406.11037 (2024)
[i71]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-13621
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-13621
Guy Yariv, Idan Schwartz, Yossi Adi, Sagie Benaim:
Improving Visual Commonsense in Language Models via Multiple Image Generation. CoRR abs/2406.13621 (2024)
[i70]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-07566
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-07566
Arnon Turetzky, Or Tal, Yael Segal-Feldman, Yehoshua Dissen, Ella Zeldes, Amit Roth, Eyal Cohen, Yosi Shrem, Bronya Roni Chernyak, Olga Seleznova, Joseph Keshet, Yossi Adi:
HebDB: a Weakly Supervised Dataset for Hebrew Speech Processing. CoRR abs/2407.07566 (2024)
[i69]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-12206
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-12206
Amit Roth, Arnon Turetzky, Yossi Adi:
A Language Modeling Approach to Diacritic-Free Hebrew TTS. CoRR abs/2407.12206 (2024)
[i68]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-12563
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-12563
Simon Rouard, Yossi Adi, Jade Copet, Axel Roebel, Alexandre Défossez:
Audio Conditioning for Music Generation via Discrete Bottleneck Features. CoRR abs/2407.12563 (2024)
[i67]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-15595
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-15595
Itai Gat, Tal Remez, Neta Shaul, Felix Kreuk, Ricky T. Q. Chen, Gabriel Synnaeve, Yossi Adi, Yaron Lipman:
Discrete Flow Matching. CoRR abs/2407.15595 (2024)
[i66]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-17434
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-17434
Shiran Aziz, Yossi Adi, Shmuel Peleg:
Audio Enhancement from Multiple Crowdsourced Recordings: A Simple and Effective Baseline. CoRR abs/2408.17434 (2024)
[i65]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-02915
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-02915
Robin San Roman, Pierre Fernandez, Antoine Deleforge, Yossi Adi, Romain Serizel:
Latent Watermarking of Audio Generative Models. CoRR abs/2409.02915 (2024)
[i64]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-03701
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-03701
Arnon Turetzky, Yossi Adi:
LAST: Language Model Aware Speech Tokenization. CoRR abs/2409.03701 (2024)
[i63]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-07437
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-07437
Gallil Maimon, Amit Roth, Yossi Adi:
A Suite for Acoustic Language Model Evaluation. CoRR abs/2409.07437 (2024)
2023
[j7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/tacl/NguyenKCAHETASM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tacl/NguyenKCAHETASM23
Tu Anh Nguyen, Eugene Kharitonov, Jade Copet, Yossi Adi, Wei-Ning Hsu, Ali Elkahky, Paden Tomasello, Robin Algayres, Benoît Sagot, Abdelrahman Mohamed, Emmanuel Dupoux:
Generative Spoken Dialogue Language Modeling. Trans. Assoc. Comput. Linguistics 11: 250-266 (2023)
[j6]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/tmlr/DefossezCSA23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/DefossezCSA23
Alexandre Défossez, Jade Copet, Gabriel Synnaeve, Yossi Adi:
High Fidelity Neural Audio Compression. Trans. Mach. Learn. Res. 2023 (2023)
[c52]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/HsuRSDA23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/HsuRSDA23
Wei-Ning Hsu, Tal Remez, Bowen Shi, Jacob Donley, Yossi Adi:
ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Regeneration. CVPR 2023: 18796-18806
[c51]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/AlgayresANCSSD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/AlgayresANCSSD23
Robin Algayres, Yossi Adi, Tu Anh Nguyen, Jade Copet, Gabriel Synnaeve, Benoît Sagot, Emmanuel Dupoux:
Generative Spoken Language Model based on continuous word-sized audio tokens. EMNLP 2023: 3008-3028
[c50]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/MaimonA23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/MaimonA23
Gallil Maimon, Yossi Adi:
Speaking Style Conversion in the Waveform Domain Using Discrete Self-Supervised Units. EMNLP (Findings) 2023: 8048-8061
[c49]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ElkahkyHTNAACDM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ElkahkyHTNAACDM23
Ali Elkahky, Wei-Ning Hsu, Paden Tomasello, Tu Anh Nguyen, Robin Algayres, Yossi Adi, Jade Copet, Emmanuel Dupoux, Abdelrahman Mohamed:
Do Coarser Units Benefit Cluster Prediction-Based Speech Pre-Training? ICASSP 2023: 1-5
[c48]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuangPKWGSALC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuangPKWGSALC23
Wen-Chin Huang, Benjamin N. Peloquin, Justine Kao, Changhan Wang, Hongyu Gong, Elizabeth Salesky, Yossi Adi, Ann Lee, Peng-Jen Chen:
A Holistic Cascade System, Benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation. ICASSP 2023: 1-5
[c47]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MandelTA23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MandelTA23
Moshe Mandel, Or Tal, Yossi Adi:
AERO: Audio Super Resolution in the Spectral Domain. ICASSP 2023: 1-5
[c46]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShefferA23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShefferA23
Roy Sheffer, Yossi Adi:
I Hear Your True Colors: Image Guided Audio Generation. ICASSP 2023: 1-5
[c45]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SichermanA23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SichermanA23
Amitay Sicherman, Yossi Adi:
Analysing Discrete Self Supervised Speech Representation For Spoken Language Modeling. ICASSP 2023: 1-5
[c44]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/KreukSPSDCPTA23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/KreukSPSDCPTA23
Felix Kreuk, Gabriel Synnaeve, Adam Polyak, Uriel Singer, Alexandre Défossez, Jade Copet, Devi Parikh, Yaniv Taigman, Yossi Adi:
AudioGen: Textually Guided Audio Generation. ICLR 2023
[c43]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NguyenHDSGFRCSH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NguyenHDSGFRCSH23
Tu Anh Nguyen, Wei-Ning Hsu, Antony D'Avirro, Bowen Shi, Itai Gat, Maryam Fazel-Zarandi, Tal Remez, Jade Copet, Gabriel Synnaeve, Michael Hassid, Felix Kreuk, Yossi Adi, Emmanuel Dupoux:
Expresso: A Benchmark and Analysis of Discrete Expressive Speech Resynthesis. INTERSPEECH 2023: 4823-4827
[c42]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YarivGWAS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YarivGWAS23
Guy Yariv, Itai Gat, Lior Wolf, Yossi Adi, Idan Schwartz:
Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation. INTERSPEECH 2023: 5446-5450
[c41]
- view
  authority control:
- export record
  dblp key:
  - conf/iwslt/GatKN0CSDA23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwslt/GatKN0CSDA23
Itai Gat, Felix Kreuk, Tu Anh Nguyen, Ann Lee, Jade Copet, Gabriel Synnaeve, Emmanuel Dupoux, Yossi Adi:
Augmentation Invariant Discrete Representation for Generative Spoken Language Modeling. IWSLT@ACL 2023: 465-477
[c40]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/CopetKGRKSAD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/CopetKGRKSAD23
Jade Copet, Felix Kreuk, Itai Gat, Tal Remez, David Kant, Gabriel Synnaeve, Yossi Adi, Alexandre Défossez:
Simple and Controllable Music Generation. NeurIPS 2023
[c39]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/HassidRNGCKCDSD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/HassidRNGCKCDSD23
Michael Hassid, Tal Remez, Tu Anh Nguyen, Itai Gat, Alexis Conneau, Felix Kreuk, Jade Copet, Alexandre Défossez, Gabriel Synnaeve, Emmanuel Dupoux, Roy Schwartz, Yossi Adi:
Textually Pretrained Speech Language Models. NeurIPS 2023
[c38]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/LeVSKSMWMAMH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LeVSKSMWMAMH23
Matthew Le, Apoorv Vyas, Bowen Shi, Brian Karrer, Leda Sari, Rashel Moritz, Mary Williamson, Vimal Manohar, Yossi Adi, Jay Mahadeokar, Wei-Ning Hsu:
Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale. NeurIPS 2023
[c37]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/RomanADSSD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/RomanADSSD23
Robin San Roman, Yossi Adi, Antoine Deleforge, Romain Serizel, Gabriel Synnaeve, Alexandre Défossez:
From Discrete Tokens to High-Fidelity Audio Using Multi-Band Diffusion. NeurIPS 2023
[i62]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-00591
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-00591
Amitay Sicherman, Yossi Adi:
Analysing Discrete Self Supervised Speech Representation for Spoken Language Modeling. CoRR abs/2301.00591 (2023)
[i61]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-10606
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-10606
Wen-Chin Huang, Benjamin N. Peloquin, Justine Kao, Changhan Wang, Hongyu Gong, Elizabeth Salesky, Yossi Adi, Ann Lee, Peng-Jen Chen:
A Holistic Cascade System, benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation. CoRR abs/2301.10606 (2023)
[i60]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-12393
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-12393
Guy Lorberbom, Itai Gat, Yossi Adi, Alexander G. Schwing, Tamir Hazan:
Layer Collaboration in the Forward-Forward Algorithm. CoRR abs/2305.12393 (2023)
[i59]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-13009
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-13009
Michael Hassid, Tal Remez, Tu Anh Nguyen, Itai Gat, Alexis Conneau, Felix Kreuk, Jade Copet, Alexandre Défossez, Gabriel Synnaeve, Emmanuel Dupoux, Roy Schwartz, Yossi Adi:
Textually Pretrained Speech Language Models. CoRR abs/2305.13009 (2023)
[i58]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-13050
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-13050
Guy Yariv, Itai Gat, Lior Wolf, Yossi Adi, Idan Schwartz:
AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation. CoRR abs/2305.13050 (2023)
[i57]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-13516
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-13516
Vineel Pratap, Andros Tjandra, Bowen Shi, Paden Tomasello, Arun Babu, Sayani Kundu, Ali Elkahky, Zhaoheng Ni, Apoorv Vyas, Maryam Fazel-Zarandi, Alexei Baevski, Yossi Adi, Xiaohui Zhang, Wei-Ning Hsu, Alexis Conneau, Michael Auli:
Scaling Speech Technology to 1, 000+ Languages. CoRR abs/2305.13516 (2023)
[i56]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-05284
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-05284
Jade Copet, Felix Kreuk, Itai Gat, Tal Remez, David Kant, Gabriel Synnaeve, Yossi Adi, Alexandre Défossez:
Simple and Controllable Music Generation. CoRR abs/2306.05284 (2023)
[i55]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-15687
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-15687
Matthew Le, Apoorv Vyas, Bowen Shi, Brian Karrer, Leda Sari, Rashel Moritz, Mary Williamson, Vimal Manohar, Yossi Adi, Jay Mahadeokar, Wei-Ning Hsu:
Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale. CoRR abs/2306.15687 (2023)
[i54]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-02560
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-02560
Robin San-Roman, Yossi Adi, Antoine Deleforge, Romain Serizel, Gabriel Synnaeve, Alexandre Défossez:
From Discrete Tokens to High-Fidelity Audio Using Multi-Band Diffusion. CoRR abs/2308.02560 (2023)
[i53]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-05725
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-05725
Tu Anh Nguyen, Wei-Ning Hsu, Antony D'Avirro, Bowen Shi, Itai Gat, Maryam Fazel-Zarandi, Tal Remez, Jade Copet, Gabriel Synnaeve, Michael Hassid, Felix Kreuk, Yossi Adi, Emmanuel Dupoux:
EXPRESSO: A Benchmark and Analysis of Discrete Expressive Speech Resynthesis. CoRR abs/2308.05725 (2023)
[i52]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-12950
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-12950
Baptiste Rozière, Jonas Gehring, Fabian Gloeckle, Sten Sootla, Itai Gat, Xiaoqing Ellen Tan, Yossi Adi, Jingyu Liu, Tal Remez, Jérémy Rapin, Artyom Kozhevnikov, Ivan Evtimov, Joanna Bitton, Manish Bhatt, Cristian Canton-Ferrer, Aaron Grattafiori, Wenhan Xiong, Alexandre Défossez, Jade Copet, Faisal Azhar, Hugo Touvron, Louis Martin, Nicolas Usunier, Thomas Scialom, Gabriel Synnaeve:
Code Llama: Open Foundation Models for Code. CoRR abs/2308.12950 (2023)
[i51]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-16429
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-16429
Guy Yariv, Itai Gat, Sagie Benaim, Lior Wolf, Idan Schwartz, Yossi Adi:
Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation. CoRR abs/2309.16429 (2023)
[i50]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-17020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-17020
Po-Chun Hsu, Ali Elkahky, Wei-Ning Hsu, Yossi Adi, Tu Anh Nguyen, Jade Copet, Emmanuel Dupoux, Hung-yi Lee, Abdelrahman Mohamed:
Low-Resource Self-Supervised Learning with SSL-Enhanced TTS. CoRR abs/2309.17020 (2023)
[i49]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-05224
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-05224
Robin Algayres, Yossi Adi, Tu Anh Nguyen, Jade Copet, Gabriel Synnaeve, Benoît Sagot, Emmanuel Dupoux:
Generative Spoken Language Model based on continuous word-sized audio tokens. CoRR abs/2310.05224 (2023)
2022
[j5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/jstsp/TzinisAIXSK22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/TzinisAIXSK22
Efthymios Tzinis, Yossi Adi, Vamsi K. Ithapu, Buye Xu, Paris Smaragdis, Anurag Kumar:
RemixIT: Continual Self-Training of Speech Enhancement Models via Bootstrapped Remixing. IEEE J. Sel. Top. Signal Process. 16(6): 1329-1341 (2022)
[j4]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/tmlr/DefossezAS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/DefossezAS22
Alexandre Défossez, Yossi Adi, Gabriel Synnaeve:
Differentiable Model Compression via Pseudo Quantization Noise. Trans. Mach. Learn. Res. 2022 (2022)
[c36]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/LeeCWGPMPAHTPH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/LeeCWGPMPAHTPH22
Ann Lee, Peng-Jen Chen, Changhan Wang, Jiatao Gu, Sravya Popuri, Xutai Ma, Adam Polyak, Yossi Adi, Qing He, Yun Tang, Juan Pino, Wei-Ning Hsu:
Direct Speech-to-Speech Translation With Discrete Units. ACL (1) 2022: 3327-3339
[c35]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/KharitonovLPACL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/KharitonovLPACL22
Eugene Kharitonov, Ann Lee, Adam Polyak, Yossi Adi, Jade Copet, Kushal Lakhotia, Tu Anh Nguyen, Morgane Rivière, Abdelrahman Mohamed, Emmanuel Dupoux, Wei-Ning Hsu:
Text-Free Prosody-Aware Generative Spoken Language Modeling. ACL (1) 2022: 8666-8681
[c34]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/KreukPCKNRHMDA22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/KreukPCKNRHMDA22
Felix Kreuk, Adam Polyak, Jade Copet, Eugene Kharitonov, Tu Anh Nguyen, Morgane Rivière, Wei-Ning Hsu, Abdelrahman Mohamed, Emmanuel Dupoux, Yossi Adi:
Textless Speech Emotion Conversion using Discrete & Decomposed Representations. EMNLP 2022: 11200-11214
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TzinisAIXK22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TzinisAIXK22
Efthymios Tzinis, Yossi Adi, Vamsi K. Ithapu, Buye Xu, Anurag Kumar:
Continual Self-Training With Bootstrapped Remixing For Speech Enhancement. ICASSP 2022: 6947-6951
[c32]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/BerlinerRARH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/BerlinerRARH22
Alon Berliner, Guy Rotman, Yossi Adi, Roi Reichart, Tamir Hazan:
Learning Discrete Structured Variational Auto-Encoder using Natural Evolution Strategies. ICLR 2022
[c31]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TalMKA22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TalMKA22
Or Tal, Moshe Mandel, Felix Kreuk, Yossi Adi:
A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement. INTERSPEECH 2022: 1193-1197
[c30]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SeysselLADW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SeysselLADW22
Maureen de Seyssel, Marvin Lavechin, Yossi Adi, Emmanuel Dupoux, Guillaume Wisniewski:
Probing phoneme, language and speaker information in unsupervised speech representations. INTERSPEECH 2022: 1402-1406
[c29]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BassanAR22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BassanAR22
Shahaf Bassan, Yossi Adi, Jeffrey S. Rosenschein:
Unsupervised Symbolic Music Segmentation using Ensemble Temporal Prediction Errors. INTERSPEECH 2022: 2423-2427
[c28]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TuretzkyMAP22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TuretzkyMAP22
Arnon Turetzky, Tzvi Michelson, Yossi Adi, Shmuel Peleg:
Deep Audio Waveform Prior. INTERSPEECH 2022: 2938-2942
[c27]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PopuriCWPAGHL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PopuriCWPAGHL22
Sravya Popuri, Peng-Jen Chen, Changhan Wang, Juan Pino, Yossi Adi, Jiatao Gu, Wei-Ning Hsu, Ann Lee:
Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation. INTERSPEECH 2022: 5195-5199
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/naacl/LeeGDSCWPAPGH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/LeeGDSCWPAPGH22
Ann Lee, Hongyu Gong, Paul-Ambroise Duquenne, Holger Schwenk, Peng-Jen Chen, Changhan Wang, Sravya Popuri, Yossi Adi, Juan Miguel Pino, Jiatao Gu, Wei-Ning Hsu:
Textless Speech-to-Speech Translation on Real Data. NAACL-HLT 2022: 860-872
[c25]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/GatASH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/GatASH22
Itai Gat, Yossi Adi, Alexander G. Schwing, Tamir Hazan:
On the Importance of Gradient Norm in PAC-Bayesian Bounds. NeurIPS 2022
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/TomaselloSLHLSECHAANDZM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/TomaselloSLHLSECHAANDZM22
Paden Tomasello, Akshat Shrivastava, Daniel Lazar, Po-Chun Hsu, Duc Le, Adithya Sagar, Ali Elkahky, Jade Copet, Wei-Ning Hsu, Yossi Adi, Robin Algayres, Tu Anh Nguyen, Emmanuel Dupoux, Luke Zettlemoyer, Abdelrahman Mohamed:
Stop: A Dataset for Spoken Task Oriented Semantic Parsing. SLT 2022: 991-998
[i48]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2202-07359
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-07359
Eugene Kharitonov, Jade Copet, Kushal Lakhotia, Tu Anh Nguyen, Paden Tomasello, Ann Lee, Ali Elkahky, Wei-Ning Hsu, Abdelrahman Mohamed, Emmanuel Dupoux, Yossi Adi:
textless-lib: a Library for Textless Spoken Language Processing. CoRR abs/2202.07359 (2022)
[i47]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2202-08862
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-08862
Efthymios Tzinis, Yossi Adi, Vamsi Krishna Ithapu, Buye Xu, Paris Smaragdis, Anurag Kumar:
RemixIT: Continual self-training of speech enhancement models via bootstrapped remixing. CoRR abs/2202.08862 (2022)
[i46]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-16193
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-16193
Maureen de Seyssel, Marvin Lavechin, Yossi Adi, Emmanuel Dupoux, Guillaume Wisniewski:
Probing phoneme, language and speaker information in unsupervised speech representations. CoRR abs/2203.16193 (2022)
[i45]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-16502
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-16502
Tu Anh Nguyen, Eugene Kharitonov, Jade Copet, Yossi Adi, Wei-Ning Hsu, Ali Elkahky, Paden Tomasello, Robin Algayres, Benoît Sagot, Abdelrahman Mohamed, Emmanuel Dupoux:
Generative Spoken Dialogue Language Modeling. CoRR abs/2203.16502 (2022)
[i44]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-02967
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-02967
Sravya Popuri, Peng-Jen Chen, Changhan Wang, Juan Pino, Yossi Adi, Jiatao Gu, Wei-Ning Hsu, Ann Lee:
Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation. CoRR abs/2204.02967 (2022)
[i43]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-01324
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-01324
Alon Berliner, Guy Rotman, Yossi Adi, Roi Reichart, Tamir Hazan:
Learning Discrete Structured Variational Auto-Encoder using Natural Evolution Strategies. CoRR abs/2205.01324 (2022)
[i42]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-11000
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-11000
Or Tal, Moshe Mandel, Felix Kreuk, Yossi Adi:
A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement. CoRR abs/2206.11000 (2022)
[i41]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-00760
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-00760
Shahaf Bassan, Yossi Adi, Jeffrey S. Rosenschein:
Unsupervised Symbolic Music Segmentation using Ensemble Temporal Prediction Errors. CoRR abs/2207.00760 (2022)
[i40]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-10441
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-10441
Arnon Turetzky, Tzvi Michelson, Yossi Adi, Shmuel Peleg:
Deep Audio Waveform Prior. CoRR abs/2207.10441 (2022)
[i39]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-15352
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-15352
Felix Kreuk, Gabriel Synnaeve, Adam Polyak, Uriel Singer, Alexandre Défossez, Jade Copet, Devi Parikh, Yaniv Taigman, Yossi Adi:
AudioGen: Textually Guided Audio Generation. CoRR abs/2209.15352 (2022)
[i38]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-15483
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-15483
Itai Gat, Felix Kreuk, Ann Lee, Jade Copet, Gabriel Synnaeve, Emmanuel Dupoux, Yossi Adi:
On The Robustness of Self-Supervised Representations for Spoken Language Modeling. CoRR abs/2209.15483 (2022)
[i37]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-06143
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-06143
Itai Gat, Yossi Adi, Alexander G. Schwing, Tamir Hazan:
On the Importance of Gradient Norm in PAC-Bayesian Bounds. CoRR abs/2210.06143 (2022)
[i36]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-13438
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-13438
Alexandre Défossez, Jade Copet, Gabriel Synnaeve, Yossi Adi:
High Fidelity Neural Audio Compression. CoRR abs/2210.13438 (2022)
[i35]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-01223
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-01223
Felix Kreuk, Yaniv Taigman, Adam Polyak, Jade Copet, Gabriel Synnaeve, Alexandre Défossez, Yossi Adi:
Audio Language Modeling using Perceptually-Guided Discrete Representations. CoRR abs/2211.01223 (2022)
[i34]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-03089
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-03089
Roy Sheffer, Yossi Adi:
I Hear Your True Colors: Image Guided Audio Generation. CoRR abs/2211.03089 (2022)
[i33]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-12232
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-12232
Moshe Mandel, Or Tal, Yossi Adi:
AERO: Audio Super Resolution in the Spectral Domain. CoRR abs/2211.12232 (2022)
[i32]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-09730
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-09730
Gallil Maimon, Yossi Adi:
Speaking Style Conversion With Discrete Self-Supervised Units. CoRR abs/2212.09730 (2022)
[i31]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-11377
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-11377
Wei-Ning Hsu, Tal Remez, Bowen Shi, Jacob Donley, Yossi Adi:
ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement. CoRR abs/2212.11377 (2022)
2021
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/TanXKNA21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/TanXKNA21
Ke Tan, Buye Xu, Anurag Kumar, Eliya Nachmani, Yossi Adi:
SAGRNN: Self-Attentive Gated RNN For Binaural Speaker Separation With Interaural Cue Preservation. IEEE Signal Process. Lett. 28: 26-30 (2021)
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/aies/SegalAPBGK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aies/SegalAPBGK21
Shahar Segal, Yossi Adi, Benny Pinkas, Carsten Baum, Chaya Ganesh, Joseph Keshet:
Fairness in the Eyes of the Data: Certifying Machine-Learning Models. AIES 2021: 926-935
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/WangHAPLCGP21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/WangHAPLCGP21
Changhan Wang, Wei-Ning Hsu, Yossi Adi, Adam Polyak, Ann Lee, Peng-Jen Chen, Jiatao Gu, Juan Pino:
fairseq S\^2: A Scalable and Integrable Speech Synthesis Toolkit. EMNLP (Demos) 2021: 143-152
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChazanWNA21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChazanWNA21
Shlomo E. Chazan, Lior Wolf, Eliya Nachmani, Yossi Adi:
Single Channel Voice Separation for Unknown Number of Speakers Under Reverberant and Noisy Settings. ICASSP 2021: 3730-3734
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PolyakWAKT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PolyakWAKT21
Adam Polyak, Lior Wolf, Yossi Adi, Ori Kabeli, Yaniv Taigman:
High Fidelity Speech Regeneration with Application to Speech Enhancement. ICASSP 2021: 7143-7147
[c19]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PolyakACKLHMD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PolyakACKLHMD21
Adam Polyak, Yossi Adi, Jade Copet, Eugene Kharitonov, Kushal Lakhotia, Wei-Ning Hsu, Abdelrahman Mohamed, Emmanuel Dupoux:
Speech Resynthesis from Discrete Disentangled Self-Supervised Representations. Interspeech 2021: 3615-3619
[i30]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2102-00429
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-00429
Adam Polyak, Lior Wolf, Yossi Adi, Ori Kabeli, Yaniv Taigman:
High Fidelity Speech Regeneration with Application to Speech Enhancement. CoRR abs/2102.00429 (2021)
[i29]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2102-01192
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-01192
Kushal Lakhotia, Evgeny Kharitonov, Wei-Ning Hsu, Yossi Adi, Adam Polyak, Benjamin Bolte, Tu Anh Nguyen, Jade Copet, Alexei Baevski, Adelrahman Mohamed, Emmanuel Dupoux:
Generative Spoken Language Modeling from Raw Audio. CoRR abs/2102.01192 (2021)
[i28]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2104-00355
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-00355
Adam Polyak, Yossi Adi, Jade Copet, Eugene Kharitonov, Kushal Lakhotia, Wei-Ning Hsu, Abdelrahman Mohamed, Emmanuel Dupoux:
Speech Resynthesis from Discrete Disentangled Self-Supervised Representations. CoRR abs/2104.00355 (2021)
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2104-09987
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-09987
Alexandre Défossez, Yossi Adi, Gabriel Synnaeve:
Differentiable Model Compression via Pseudo Quantization Noise. CoRR abs/2104.09987 (2021)
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-13493
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-13493
Ori Kabeli, Yossi Adi, Zhenyu Tang, Buye Xu, Anurag Kumar:
Online Self-Attentive Gated RNNs for Real-Time Speaker Separation. CoRR abs/2106.13493 (2021)
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2107-05604
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-05604
Ann Lee, Peng-Jen Chen, Changhan Wang, Jiatao Gu, Xutai Ma, Adam Polyak, Yossi Adi, Qing He, Yun Tang, Juan Miguel Pino, Wei-Ning Hsu:
Direct speech-to-speech translation with discrete units. CoRR abs/2107.05604 (2021)
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2109-03264
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-03264
Eugene Kharitonov, Ann Lee, Adam Polyak, Yossi Adi, Jade Copet, Kushal Lakhotia, Tu Anh Nguyen, Morgane Rivière, Abdelrahman Mohamed, Emmanuel Dupoux, Wei-Ning Hsu:
Text-Free Prosody-Aware Generative Spoken Language Modeling. CoRR abs/2109.03264 (2021)
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2109-06912
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-06912
Changhan Wang, Wei-Ning Hsu, Yossi Adi, Adam Polyak, Ann Lee, Peng-Jen Chen, Jiatao Gu, Juan Miguel Pino:
fairseq S^2: A Scalable and Integrable Speech Synthesis Toolkit. CoRR abs/2109.06912 (2021)
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-10103
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-10103
Efthymios Tzinis, Yossi Adi, Vamsi K. Ithapu, Buye Xu, Anurag Kumar:
Continual self-training with bootstrapped remixing for speech enhancement. CoRR abs/2110.10103 (2021)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2111-07402
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-07402
Felix Kreuk, Adam Polyak, Jade Copet, Eugene Kharitonov, Tu Anh Nguyen, Morgane Rivière, Wei-Ning Hsu, Abdelrahman Mohamed, Emmanuel Dupoux, Yossi Adi:
Textless Speech Emotion Conversion using Decomposed and Discrete Representations. CoRR abs/2111.07402 (2021)
2020
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KreukSKA20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KreukSKA20
Felix Kreuk, Yaniv Sheena, Joseph Keshet, Yossi Adi:
Phoneme Boundary Detection Using Learnable Segmental Features. ICASSP 2020: 8089-8093
[c17]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/NachmaniAW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/NachmaniAW20
Eliya Nachmani, Yossi Adi, Lior Wolf:
Voice Separation with an Unknown Number of Multiple Speakers. ICML 2020: 7164-7175
[c16]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PolyakWAT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PolyakWAT20
Adam Polyak, Lior Wolf, Yossi Adi, Yaniv Taigman:
Unsupervised Cross-Domain Singing Voice Conversion. INTERSPEECH 2020: 801-805
[c15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DefossezSA20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DefossezSA20
Alexandre Défossez, Gabriel Synnaeve, Yossi Adi:
Real Time Speech Enhancement in the Waveform Domain. INTERSPEECH 2020: 3291-3295
[c14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KreukKA20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KreukKA20
Felix Kreuk, Joseph Keshet, Yossi Adi:
Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation. INTERSPEECH 2020: 3700-3704
[c13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KreukARSK20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KreukARSK20
Felix Kreuk, Yossi Adi, Bhiksha Raj, Rita Singh, Joseph Keshet:
Hide and Speak: Towards Deep Neural Networks for Speech Steganography. INTERSPEECH 2020: 4656-4660
[c12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/lpar/GoldbergerKAK20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/lpar/GoldbergerKAK20
Ben Goldberger, Guy Katz, Yossi Adi, Joseph Keshet:
Minimal Modifications of Deep Neural Networks using Verification. LPAR 2020: 260-278
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2002-04992
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-04992
Felix Kreuk, Yaniv Sheena, Joseph Keshet, Yossi Adi:
Phoneme Boundary Detection using Learnable Segmental Features. CoRR abs/2002.04992 (2020)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2002-09866
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-09866
Yossi Adi, Yaniv Nemcovsky, Alexander G. Schwing, Tamir Hazan:
On the generalization of bayesian deep nets for multi-class classification. CoRR abs/2002.09866 (2020)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2003-01531
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-01531
Eliya Nachmani, Yossi Adi, Lior Wolf:
Voice Separation with an Unknown Number of Multiple Speakers. CoRR abs/2003.01531 (2020)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2006-12847
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-12847
Alexandre Défossez, Gabriel Synnaeve, Yossi Adi:
Real Time Speech Enhancement in the Waveform Domain. CoRR abs/2006.12847 (2020)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2007-13465
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-13465
Felix Kreuk, Joseph Keshet, Yossi Adi:
Self-Supervised Contrastive Learning for Unsupervised Phoneme Segmentation. CoRR abs/2007.13465 (2020)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2008-02830
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-02830
Adam Polyak, Lior Wolf, Yossi Adi, Yaniv Taigman:
Unsupervised Cross-Domain Singing Voice Conversion. CoRR abs/2008.02830 (2020)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2009-01381
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-01381
Ke Tan, Buye Xu, Anurag Kumar, Eliya Nachmani, Yossi Adi:
SAGRNN: Self-Attentive Gated RNN for Binaural Speaker Separation with Interaural Cue Preservation. CoRR abs/2009.01381 (2020)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2009-01534
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-01534
Shahar Segal, Yossi Adi, Benny Pinkas, Carsten Baum, Chaya Ganesh, Joseph Keshet:
Fairness in the Eyes of the Data: Certifying Machine-Learning Models. CoRR abs/2009.01534 (2020)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2011-02329
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-02329
Shlomo E. Chazan, Lior Wolf, Eliya Nachmani, Yossi Adi:
Single channel voice separation for unknown number of speakers under reverberant and noisy settings. CoRR abs/2011.02329 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/AdiZCULS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/AdiZCULS19
Yossi Adi, Neil Zeghidour, Ronan Collobert, Nicolas Usunier, Vitaliy Liptchinsky, Gabriel Synnaeve:
To Reverse the Gradient or Not: an Empirical Comparison of Adversarial and Multi-task Learning in Speech Recognition. ICASSP 2019: 3742-3746
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1902-03083
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-03083
Felix Kreuk, Yossi Adi, Bhiksha Raj, Rita Singh, Joseph Keshet:
Hide and Speak: Deep Neural Networks for Speech Steganography. CoRR abs/1902.03083 (2019)
2018
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KreukACK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KreukACK18
Felix Kreuk, Yossi Adi, Moustapha Cissé, Joseph Keshet:
Fooling End-To-End Speaker Verification With Adversarial Examples. ICASSP 2018: 1962-1966
[c9]
- view
- export record
  dblp key:
  - conf/nips/ShalevAK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ShalevAK18
Gabi Shalev, Yossi Adi, Joseph Keshet:
Out-of-Distribution Detection using Multiple Semantic Label Representations. NeurIPS 2018: 7386-7396
[c8]
- view
  - electronic edition @ usenix.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/uss/AdiBCPK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uss/AdiBCPK18
Yossi Adi, Carsten Baum, Moustapha Cissé, Benny Pinkas, Joseph Keshet:
Turning Your Weakness Into a Strength: Watermarking Deep Neural Networks by Backdooring. USENIX Security Symposium 2018: 1615-1631
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1801-03339
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1801-03339
Felix Kreuk, Yossi Adi, Moustapha Cissé, Joseph Keshet:
Fooling End-to-end Speaker Verification by Adversarial Examples. CoRR abs/1801.03339 (2018)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1802-04633
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-04633
Yossi Adi, Carsten Baum, Moustapha Cissé, Benny Pinkas, Joseph Keshet:
Turning Your Weakness Into a Strength: Watermarking Deep Neural Networks by Backdooring. CoRR abs/1802.04633 (2018)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1808-06664
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1808-06664
Gabi Shalev, Yossi Adi, Joseph Keshet:
Out-of-Distribution Detection using Multiple Semantic Label Representations. CoRR abs/1808.06664 (2018)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1812-03483
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1812-03483
Yossi Adi, Neil Zeghidour, Ronan Collobert, Nicolas Usunier, Vitaliy Liptchinsky, Gabriel Synnaeve:
To Reverse the Gradient or Not: An Empirical Comparison of Adversarial and Multi-task Learning in Speech Recognition. CoRR abs/1812.03483 (2018)
2017
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/ibmrd/AdiKBLG17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ibmrd/AdiKBLG17
Yossi Adi, Einat Kermany, Yonatan Belinkov, Ofer Lavi, Yoav Goldberg:
Analysis of sentence embedding models using prediction tasks in natural language processing. IBM J. Res. Dev. 61(4-5): 3:1-3:9 (2017)
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/AdiKCG17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/AdiKCG17
Yossi Adi, Joseph Keshet, Emily Cibelli, Matthew Goldrick:
Sequence segmentation using joint RNN and structured prediction models. ICASSP 2017: 2422-2426
[c6]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/AdiKBLG17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/AdiKBLG17
Yossi Adi, Einat Kermany, Yonatan Belinkov, Ofer Lavi, Yoav Goldberg:
Fine-grained Analysis of Sentence Embeddings Using Auxiliary Prediction Tasks. ICLR (Poster) 2017
[c5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SheenaHAK17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SheenaHAK17
Yaniv Sheena, Mísa Hejná, Yossi Adi, Joseph Keshet:
Automatic Measurement of Pre-Aspiration. INTERSPEECH 2017: 1049-1053
[c4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NaamanAK17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NaamanAK17
Einat Naaman, Yossi Adi, Joseph Keshet:
Learning Similarity Functions for Pronunciation Variations. INTERSPEECH 2017: 2561-2565
[c3]
- view
- export record
  dblp key:
  - conf/nips/CisseANK17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/CisseANK17
Moustapha Cissé, Yossi Adi, Natalia Neverova, Joseph Keshet:
Houdini: Fooling Deep Structured Visual and Speech Recognition Models with Adversarial Examples. NIPS 2017: 6977-6987
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/NaamanAK17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/NaamanAK17
Einat Naaman, Yossi Adi, Joseph Keshet:
Learning Similarity Function for Pronunciation Variations. CoRR abs/1703.09817 (2017)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/SheenaHAK17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/SheenaHAK17
Yaniv Sheena, Mísa Hejná, Yossi Adi, Joseph Keshet:
Automatic Measurement of Pre-aspiration. CoRR abs/1704.01653 (2017)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/CisseANK17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/CisseANK17
Moustapha Cissé, Yossi Adi, Natalia Neverova, Joseph Keshet:
Houdini: Fooling Deep Structured Prediction Models. CoRR abs/1707.05373 (2017)
2016
[j1]
- view
  - electronic edition @ jmlr.org (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/jmlr/AdiK16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/AdiK16
Yossi Adi, Joseph Keshet:
StructED: Risk Minimization in Structured Prediction. J. Mach. Learn. Res. 17: 64:1-64:5 (2016)
[c2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AdiKDG16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AdiKDG16
Yossi Adi, Joseph Keshet, Olga Dmitrieva, Matthew Goldrick:
Automatic Measurement of Voice Onset Time and Prevoicing Using Recurrent Neural Networks. INTERSPEECH 2016: 3152-3155
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/AdiKBLG16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/AdiKBLG16
Yossi Adi, Einat Kermany, Yonatan Belinkov, Ofer Lavi, Yoav Goldberg:
Fine-grained Analysis of Sentence Embeddings Using Auxiliary Prediction Tasks. CoRR abs/1608.04207 (2016)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/AdiKCG16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/AdiKCG16
Yossi Adi, Joseph Keshet, Emily Cibelli, Matthew Goldrick:
Sequence Segmentation Using Joint RNN and Structured Prediction Models. CoRR abs/1610.07918 (2016)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/AdiKCGCG16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/AdiKCGCG16
Yossi Adi, Joseph Keshet, Emily Cibelli, Erin Gustafson, Cynthia G. Clopper, Matthew Goldrick:
Automatic measurement of vowel duration via structured prediction. CoRR abs/1610.08166 (2016)
2015
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/mlsp/AdiKG15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlsp/AdiKG15
Yossi Adi, Joseph Keshet, Matthew Goldrick:
Vowel duration measurement using deep neural networks. MLSP 2015: 1-6

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.