default search action
Scott Wisdom
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [j2]Simon Leglaive, Matthieu Fraticelli, Hend Elghazaly, Léonie Borne, Mostafa Sadeghi, Scott Wisdom, Manuel Pariente, John R. Hershey, Daniel Pressnitzer, Jon P. Barker:
Objective and subjective evaluation of speech enhancement methods in the UDASE task of the 7th CHiME challenge. Comput. Speech Lang. 89: 101685 (2025) - 2024
- [c36]Cong Han, Kevin W. Wilson, Scott Wisdom, John R. Hershey:
Unsupervised Multi-Channel Separation And Adaptation. ICASSP 2024: 721-725 - [i32]Simon Leglaive, Matthieu Fraticelli, Hend Elghazaly, Léonie Borne, Mostafa Sadeghi, Scott Wisdom, Manuel Pariente, John R. Hershey, Daniel Pressnitzer, Jon P. Barker:
Objective and subjective evaluation of speech enhancement methods in the UDASE task of the 7th CHiME challenge. CoRR abs/2402.01413 (2024) - [i31]Artem Dementyev, Chandan K. A. Reddy, Scott Wisdom, Navin Chatlani, John R. Hershey, Richard F. Lyon:
Towards sub-millisecond latency real-time speech enhancement models on hearables. CoRR abs/2409.18239 (2024) - 2023
- [c35]Pradyumna Reddy, Scott Wisdom, Klaus Greff, John R. Hershey, Thomas Kipf:
Audioslots: A Slot-Centric Generative Model For Audio Separation. ICASSP Workshops 2023: 1-5 - [c34]Hakan Erdogan, Scott Wisdom, Xuankai Chang, Zalán Borsos, Marco Tagliasacchi, Neil Zeghidour, John R. Hershey:
TokenSplit: Using Discrete Speech Representations for Direct, Refined, and Transcript-Conditioned Speech Separation and Recognition. INTERSPEECH 2023: 3462-3466 - [i30]Pradyumna Reddy, Scott Wisdom, Klaus Greff, John R. Hershey, Thomas Kipf:
AudioSlots: A slot-centric generative model for audio separation. CoRR abs/2305.05591 (2023) - [i29]Cong Han, Kevin W. Wilson, Scott Wisdom, John R. Hershey:
Unsupervised Multi-channel Separation and Adaptation. CoRR abs/2305.11151 (2023) - [i28]Simon Leglaive, Léonie Borne, Efthymios Tzinis, Mostafa Sadeghi, Matthieu Fraticelli, Scott Wisdom, Manuel Pariente, Daniel Pressnitzer, John R. Hershey:
The CHiME-7 UDASE task: Unsupervised domain adaptation for conversational speech enhancement. CoRR abs/2307.03533 (2023) - [i27]Hakan Erdogan, Scott Wisdom, Xuankai Chang, Zalán Borsos, Marco Tagliasacchi, Neil Zeghidour, John R. Hershey:
TokenSplit: Using Discrete Speech Representations for Direct, Refined, and Transcript-Conditioned Speech Separation and Recognition. CoRR abs/2308.10415 (2023) - 2022
- [c33]Efthymios Tzinis, Scott Wisdom, Tal Remez, John R. Hershey:
AudioScopeV2: Audio-Visual Attention Architectures for Calibrated Open-Domain On-Screen Sound Separation. ECCV (37) 2022: 368-385 - [c32]Tom Denton, Scott Wisdom, John R. Hershey:
Improving Bird Classification with Unsupervised Sound Separation. ICASSP 2022: 636-640 - [c31]Aswin Sivaraman, Scott Wisdom, Hakan Erdogan, John R. Hershey:
Adapting Speech Separation to Real-World Meetings using Mixture Invariant Training. ICASSP 2022: 686-690 - [c30]Hannah Muckenhirn, Aleksandr Safin, Hakan Erdogan, Felix de Chaumont Quitry, Marco Tagliasacchi, Scott Wisdom, John R. Hershey:
CycleGAN-based Unpaired Speech Dereverberation. INTERSPEECH 2022: 196-200 - [c29]Katharine Patterson, Kevin W. Wilson, Scott Wisdom, John R. Hershey:
Distance-Based Sound Separation. INTERSPEECH 2022: 901-905 - [c28]Samuel J. Yang, Scott Wisdom, Chet Gnegy, Richard F. Lyon, Sagar Savla:
Listening with Googlears: Low-Latency Neural Multiframe Beamforming and Equalization for Hearing Aids. INTERSPEECH 2022: 3939-3943 - [c27]Kevin Kilgour, Beat Gfeller, Qingqing Huang, Aren Jansen, Scott Wisdom, Marco Tagliasacchi:
Text-Driven Separation of Arbitrary Sounds. INTERSPEECH 2022: 5403-5407 - [i26]Hannah Muckenhirn, Aleksandr Safin, Hakan Erdogan, Felix de Chaumont Quitry, Marco Tagliasacchi, Scott Wisdom, John R. Hershey:
CycleGAN-Based Unpaired Speech Dereverberation. CoRR abs/2203.15652 (2022) - [i25]Kevin Kilgour, Beat Gfeller, Qingqing Huang, Aren Jansen, Scott Wisdom, Marco Tagliasacchi:
Text-Driven Separation of Arbitrary Sounds. CoRR abs/2204.05738 (2022) - [i24]Katharine Patterson, Kevin W. Wilson, Scott Wisdom, John R. Hershey:
Distance-Based Sound Separation. CoRR abs/2207.00562 (2022) - [i23]Efthymios Tzinis, Scott Wisdom, Tal Remez, John R. Hershey:
AudioScopeV2: Audio-Visual Attention Architectures for Calibrated Open-Domain On-Screen Sound Separation. CoRR abs/2207.10141 (2022) - 2021
- [c26]Scott Wisdom, Hakan Erdogan, Daniel P. W. Ellis, Romain Serizel, Nicolas Turpault, Eduardo Fonseca, Justin Salamon, Prem Seetharaman, John R. Hershey:
What's all the Fuss about Free Universal Sound Separation Data? ICASSP 2021: 186-190 - [c25]Nicolas Turpault, Romain Serizel, Scott Wisdom, Hakan Erdogan, John R. Hershey, Eduardo Fonseca, Prem Seetharaman, Justin Salamon:
Sound Event Detection and Separation: A Benchmark on Desed Synthetic Soundscapes. ICASSP 2021: 840-844 - [c24]Soumi Maiti, Hakan Erdogan, Kevin W. Wilson, Scott Wisdom, Shinji Watanabe, John R. Hershey:
End-To-End Diarization for Variable Number of Speakers with Local-Global Networks and Discriminative Speaker Embeddings. ICASSP 2021: 7183-7187 - [c23]Efthymios Tzinis, Scott Wisdom, Aren Jansen, Shawn Hershey, Tal Remez, Dan Ellis, John R. Hershey:
Into the Wild with AudioScope: Unsupervised Audio-Visual Separation of On-Screen Sounds. ICLR 2021 - [c22]Desh Raj, Pavel Denisov, Zhuo Chen, Hakan Erdogan, Zili Huang, Maokui He, Shinji Watanabe, Jun Du, Takuya Yoshioka, Yi Luo, Naoyuki Kanda, Jinyu Li, Scott Wisdom, John R. Hershey:
Integration of Speech Separation, Diarization, and Recognition for Multi-Speaker Meetings: System Description, Comparison, and Analysis. SLT 2021: 897-904 - [c21]Zhong-Qiu Wang, Hakan Erdogan, Scott Wisdom, Kevin W. Wilson, Desh Raj, Shinji Watanabe, Zhuo Chen, John R. Hershey:
Sequential Multi-Frame Neural Beamforming for Speech Separation and Enhancement. SLT 2021: 905-911 - [c20]Scott Wisdom, Aren Jansen, Ron J. Weiss, Hakan Erdogan, John R. Hershey:
Sparse, Efficient, and Semantic Mixture Invariant Training: Taming In-the-Wild Unsupervised Sound Separation. WASPAA 2021: 51-55 - [c19]Yuma Koizumi, Shigeki Karita, Scott Wisdom, Hakan Erdogan, John R. Hershey, Llion Jones, Michiel Bacchiani:
DF-Conformer: Integrated Architecture of Conv-Tasnet and Conformer Using Linear Complexity Self-Attention for Speech Enhancement. WASPAA 2021: 161-165 - [c18]Eduardo Fonseca, Aren Jansen, Daniel P. W. Ellis, Scott Wisdom, Marco Tagliasacchi, John R. Hershey, Manoj Plakal, Shawn Hershey, R. Channing Moore, Xavier Serra:
Self-Supervised Learning from Automatically Separated Sound Scenes. WASPAA 2021: 251-255 - [i22]Soumi Maiti, Hakan Erdogan, Kevin W. Wilson, Scott Wisdom, Shinji Watanabe, John R. Hershey:
End-to-End Diarization for Variable Number of Speakers with Local-Global Networks and Discriminative Speaker Embeddings. CoRR abs/2105.02096 (2021) - [i21]Eduardo Fonseca, Aren Jansen, Daniel P. W. Ellis, Scott Wisdom, Marco Tagliasacchi, John R. Hershey, Manoj Plakal, Shawn Hershey, R. Channing Moore, Xavier Serra:
Self-Supervised Learning from Automatically Separated Sound Scenes. CoRR abs/2105.02132 (2021) - [i20]Scott Wisdom, Aren Jansen, Ron J. Weiss, Hakan Erdogan, John R. Hershey:
Sparse, Efficient, and Semantic Mixture Invariant Training: Taming In-the-Wild Unsupervised Sound Separation. CoRR abs/2106.00847 (2021) - [i19]Efthymios Tzinis, Scott Wisdom, Tal Remez, John R. Hershey:
Improving On-Screen Sound Separation for Open Domain Videos with Audio-Visual Self-attention. CoRR abs/2106.09669 (2021) - [i18]Yuma Koizumi, Shigeki Karita, Scott Wisdom, Hakan Erdogan, John R. Hershey, Llion Jones, Michiel Bacchiani:
DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement. CoRR abs/2106.15813 (2021) - [i17]Aswin Sivaraman, Scott Wisdom, Hakan Erdogan, John R. Hershey:
Adapting Speech Separation to Real-World Meetings Using Mixture Invariant Training. CoRR abs/2110.10739 (2021) - 2020
- [c17]Nicolas Turpault, Scott Wisdom, Hakan Erdogan, John R. Hershey, Romain Serizel, Eduardo Fonseca, Prem Seetharaman, Justin Salamon:
Improving Sound Event Detection in Domestic Environments using Sound Separation. DCASE 2020: 205-209 - [c16]Efthymios Tzinis, Scott Wisdom, John R. Hershey, Aren Jansen, Daniel P. W. Ellis:
Improving Universal Sound Separation Using Sound Classification. ICASSP 2020: 96-100 - [c15]Samuel Sonning, Christian Schüldt, Hakan Erdogan, Scott Wisdom:
Performance Study of a Convolutional Time-Domain Audio Separation Network for Real-Time Speech Denoising. ICASSP 2020: 831-835 - [c14]Scott Wisdom, Efthymios Tzinis, Hakan Erdogan, Ron J. Weiss, Kevin W. Wilson, John R. Hershey:
Unsupervised Sound Separation Using Mixture Invariant Training. NeurIPS 2020 - [i16]Scott Wisdom, Efthymios Tzinis, Hakan Erdogan, Ron J. Weiss, Kevin W. Wilson, John R. Hershey:
Unsupervised Sound Separation Using Mixtures of Mixtures. CoRR abs/2006.12701 (2020) - [i15]Nicolas Turpault, Scott Wisdom, Hakan Erdogan, John R. Hershey, Romain Serizel, Eduardo Fonseca, Prem Seetharaman, Justin Salamon:
Improving Sound Event Detection In Domestic Environments Using Sound Separation. CoRR abs/2007.03932 (2020) - [i14]Nicolas Turpault, Romain Serizel, Scott Wisdom, Hakan Erdogan, John R. Hershey, Eduardo Fonseca, Prem Seetharaman, Justin Salamon:
Sound Event Detection and Separation: a Benchmark on Desed Synthetic Soundscapes. CoRR abs/2011.00801 (2020) - [i13]Scott Wisdom, Hakan Erdogan, Daniel P. W. Ellis, Romain Serizel, Nicolas Turpault, Eduardo Fonseca, Justin Salamon, Prem Seetharaman, John R. Hershey:
What's All the FUSS About Free Universal Sound Separation Data? CoRR abs/2011.00803 (2020) - [i12]Efthymios Tzinis, Scott Wisdom, Aren Jansen, Shawn Hershey, Tal Remez, Daniel P. W. Ellis, John R. Hershey:
Into the Wild with AudioScope: Unsupervised Audio-Visual Separation of On-Screen Sounds. CoRR abs/2011.01143 (2020) - [i11]Desh Raj, Pavel Denisov, Zhuo Chen, Hakan Erdogan, Zili Huang, Mao-Kui He, Shinji Watanabe, Jun Du, Takuya Yoshioka, Yi Luo, Naoyuki Kanda, Jinyu Li, Scott Wisdom, John R. Hershey:
Integration of speech separation, diarization, and recognition for multi-speaker meetings: System description, comparison, and analysis. CoRR abs/2011.02014 (2020)
2010 – 2019
- 2019
- [c13]Jonathan Le Roux, Scott Wisdom, Hakan Erdogan, John R. Hershey:
SDR - Half-baked or Well Done? ICASSP 2019: 626-630 - [c12]Scott Wisdom, John R. Hershey, Kevin W. Wilson, Jeremy Thorpe, Michael Chinen, Brian Patton, Rif A. Saurous:
Differentiable Consistency Constraints for Improved Deep Speech Enhancement. ICASSP 2019: 900-904 - [c11]Ilya Kavalerov, Scott Wisdom, Hakan Erdogan, Brian Patton, Kevin W. Wilson, Jonathan Le Roux, John R. Hershey:
Universal Sound Separation. WASPAA 2019: 175-179 - [i10]Mohamed Ezzeldin A. Elshaer, Scott Wisdom, Taniya Mishra:
Transfer Learning From Sound Representations For Anger Detection in Speech. CoRR abs/1902.02120 (2019) - [i9]Ilya Kavalerov, Scott Wisdom, Hakan Erdogan, Brian Patton, Kevin W. Wilson, Jonathan Le Roux, John R. Hershey:
Universal Sound Separation. CoRR abs/1905.03330 (2019) - [i8]Efthymios Tzinis, Scott Wisdom, John R. Hershey, Aren Jansen, Daniel P. W. Ellis:
Improving Universal Sound Separation Using Sound Classification. CoRR abs/1911.07951 (2019) - [i7]Zhong-Qiu Wang, Scott Wisdom, Kevin W. Wilson, John R. Hershey:
Alternating Between Spectral and Spatial Estimation for Speech Separation and Enhancement. CoRR abs/1911.07953 (2019) - 2018
- [i6]Jonathan Le Roux, Scott Wisdom, Hakan Erdogan, John R. Hershey:
SDR - half-baked or well done? CoRR abs/1811.02508 (2018) - [i5]Scott Wisdom, John R. Hershey, Kevin W. Wilson, Jeremy Thorpe, Michael Chinen, Brian Patton, Rif A. Saurous:
Differentiable Consistency Constraints for Improved Deep Speech Enhancement. CoRR abs/1811.08521 (2018) - 2017
- [c10]Scott Wisdom, Thomas Powers, James W. Pitton, Les E. Atlas:
Building recurrent networks by unfolding iterative thresholding for sequential sparse recovery. ICASSP 2017: 4346-4350 - [c9]Scott Wisdom, Thomas Powers, James W. Pitton, Les Atlas:
Deep recurrent NMF for speech separation by unfolding iterative thresholding. WASPAA 2017: 254-258 - [p1]John R. Hershey, Jonathan Le Roux, Shinji Watanabe, Scott Wisdom, Zhuo Chen, Yusuf Ziya Isik:
Novel Deep Architectures in Speech Processing. New Era for Robust Speech Recognition, Exploiting Deep Learning 2017: 135-164 - [i4]Scott Wisdom, Thomas Powers, James W. Pitton, Les Atlas:
Deep Recurrent NMF for Speech Separation by Unfolding Iterative Thresholding. CoRR abs/1709.07124 (2017) - 2016
- [c8]Scott Wisdom, Les E. Atlas, James W. Pitton, Greg Okopal:
Benefits of noncircular statistics for nonstationary signals. ACSSC 2016: 554-558 - [c7]Scott Wisdom, John R. Hershey, Jonathan Le Roux, Shinji Watanabe:
Deep unfolding for multichannel source separation. ICASSP 2016: 121-125 - [c6]Scott Wisdom, Les Atlas, James W. Pitton:
On spectral noncircularity of natural signals. SAM 2016: 1-5 - [c5]Scott Wisdom, Thomas Powers, John R. Hershey, Jonathan Le Roux, Les E. Atlas:
Full-Capacity Unitary Recurrent Neural Networks. NIPS 2016: 4880-4888 - [i3]Scott Wisdom, Thomas Powers, John R. Hershey, Jonathan Le Roux, Les E. Atlas:
Full-Capacity Unitary Recurrent Neural Networks. CoRR abs/1611.00035 (2016) - [i2]Scott Wisdom, Thomas Powers, James W. Pitton, Les E. Atlas:
Interpretable Recurrent Neural Networks Using Sequential Sparse Recovery. CoRR abs/1611.07252 (2016) - 2015
- [j1]Greg Okopal, Scott Wisdom, Les Atlas:
Speech Analysis With the Strong Uncorrelating Transform. IEEE ACM Trans. Audio Speech Lang. Process. 23(11): 1858-1868 (2015) - [c4]Scott Wisdom, Greg Okopal, Les E. Atlas, James W. Pitton:
Voice activity detection using subband noncircularity. ICASSP 2015: 4505-4509 - [i1]Scott Wisdom, Thomas Powers, Les Atlas, James W. Pitton:
Enhancement and Recognition of Reverberant and Noisy Speech by Extending Its Coherence. CoRR abs/1509.00533 (2015) - 2014
- [c3]Greg Okopal, Scott Wisdom, Les Atlas:
Estimating the noncircularity of latent components within complex-valued subband mixtures with applications to speech processing. ACSSC 2014: 1405-1409 - [c2]Scott Wisdom, James W. Pitton, Les Atlas:
Extending coherence for optimal detection of nonstationary harmonic signals. ACSSC 2014: 1784-1788 - [c1]Scott Wisdom, Les Atlas, James Pittore:
Extending coherence time for analysis of modulated random processes. ICASSP 2014: 340-344
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-14 21:00 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint