default search action

combined dblp search
author search
venue search
publication search

ask others

Scott Wisdom

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/LeglaiveFEBSWPHPB25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/LeglaiveFEBSWPHPB25
Simon Leglaive, Matthieu Fraticelli, Hend Elghazaly, Léonie Borne, Mostafa Sadeghi, Scott Wisdom, Manuel Pariente, John R. Hershey, Daniel Pressnitzer, Jon P. Barker:
Objective and subjective evaluation of speech enhancement methods in the UDASE task of the 7th CHiME challenge. Comput. Speech Lang. 89: 101685 (2025)
2024
[c36]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HanWWH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HanWWH24
Cong Han, Kevin W. Wilson, Scott Wisdom, John R. Hershey:
Unsupervised Multi-Channel Separation And Adaptation. ICASSP 2024: 721-725
[i32]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-01413
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-01413
Simon Leglaive, Matthieu Fraticelli, Hend Elghazaly, Léonie Borne, Mostafa Sadeghi, Scott Wisdom, Manuel Pariente, John R. Hershey, Daniel Pressnitzer, Jon P. Barker:
Objective and subjective evaluation of speech enhancement methods in the UDASE task of the 7th CHiME challenge. CoRR abs/2402.01413 (2024)
[i31]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-18239
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-18239
Artem Dementyev, Chandan K. A. Reddy, Scott Wisdom, Navin Chatlani, John R. Hershey, Richard F. Lyon:
Towards sub-millisecond latency real-time speech enhancement models on hearables. CoRR abs/2409.18239 (2024)
2023
[c35]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ReddyWGHK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ReddyWGHK23
Pradyumna Reddy, Scott Wisdom, Klaus Greff, John R. Hershey, Thomas Kipf:
Audioslots: A Slot-Centric Generative Model For Audio Separation. ICASSP Workshops 2023: 1-5
[c34]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ErdoganWCBTZH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ErdoganWCBTZH23
Hakan Erdogan, Scott Wisdom, Xuankai Chang, Zalán Borsos, Marco Tagliasacchi, Neil Zeghidour, John R. Hershey:
TokenSplit: Using Discrete Speech Representations for Direct, Refined, and Transcript-Conditioned Speech Separation and Recognition. INTERSPEECH 2023: 3462-3466
[i30]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-05591
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-05591
Pradyumna Reddy, Scott Wisdom, Klaus Greff, John R. Hershey, Thomas Kipf:
AudioSlots: A slot-centric generative model for audio separation. CoRR abs/2305.05591 (2023)
[i29]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-11151
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-11151
Cong Han, Kevin W. Wilson, Scott Wisdom, John R. Hershey:
Unsupervised Multi-channel Separation and Adaptation. CoRR abs/2305.11151 (2023)
[i28]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-03533
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-03533
Simon Leglaive, Léonie Borne, Efthymios Tzinis, Mostafa Sadeghi, Matthieu Fraticelli, Scott Wisdom, Manuel Pariente, Daniel Pressnitzer, John R. Hershey:
The CHiME-7 UDASE task: Unsupervised domain adaptation for conversational speech enhancement. CoRR abs/2307.03533 (2023)
[i27]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-10415
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-10415
Hakan Erdogan, Scott Wisdom, Xuankai Chang, Zalán Borsos, Marco Tagliasacchi, Neil Zeghidour, John R. Hershey:
TokenSplit: Using Discrete Speech Representations for Direct, Refined, and Transcript-Conditioned Speech Separation and Recognition. CoRR abs/2308.10415 (2023)
2022
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/TzinisWRH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/TzinisWRH22
Efthymios Tzinis, Scott Wisdom, Tal Remez, John R. Hershey:
AudioScopeV2: Audio-Visual Attention Architectures for Calibrated Open-Domain On-Screen Sound Separation. ECCV (37) 2022: 368-385
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DentonWH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DentonWH22
Tom Denton, Scott Wisdom, John R. Hershey:
Improving Bird Classification with Unsupervised Sound Separation. ICASSP 2022: 636-640
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SivaramanWEH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SivaramanWEH22
Aswin Sivaraman, Scott Wisdom, Hakan Erdogan, John R. Hershey:
Adapting Speech Separation to Real-World Meetings using Mixture Invariant Training. ICASSP 2022: 686-690
[c30]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MuckenhirnSEQTW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MuckenhirnSEQTW22
Hannah Muckenhirn, Aleksandr Safin, Hakan Erdogan, Felix de Chaumont Quitry, Marco Tagliasacchi, Scott Wisdom, John R. Hershey:
CycleGAN-based Unpaired Speech Dereverberation. INTERSPEECH 2022: 196-200
[c29]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PattersonWWH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PattersonWWH22
Katharine Patterson, Kevin W. Wilson, Scott Wisdom, John R. Hershey:
Distance-Based Sound Separation. INTERSPEECH 2022: 901-905
[c28]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YangWGLS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YangWGLS22
Samuel J. Yang, Scott Wisdom, Chet Gnegy, Richard F. Lyon, Sagar Savla:
Listening with Googlears: Low-Latency Neural Multiframe Beamforming and Equalization for Hearing Aids. INTERSPEECH 2022: 3939-3943
[c27]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KilgourGHJWT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KilgourGHJWT22
Kevin Kilgour, Beat Gfeller, Qingqing Huang, Aren Jansen, Scott Wisdom, Marco Tagliasacchi:
Text-Driven Separation of Arbitrary Sounds. INTERSPEECH 2022: 5403-5407
[i26]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-15652
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-15652
Hannah Muckenhirn, Aleksandr Safin, Hakan Erdogan, Felix de Chaumont Quitry, Marco Tagliasacchi, Scott Wisdom, John R. Hershey:
CycleGAN-Based Unpaired Speech Dereverberation. CoRR abs/2203.15652 (2022)
[i25]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-05738
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-05738
Kevin Kilgour, Beat Gfeller, Qingqing Huang, Aren Jansen, Scott Wisdom, Marco Tagliasacchi:
Text-Driven Separation of Arbitrary Sounds. CoRR abs/2204.05738 (2022)
[i24]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-00562
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-00562
Katharine Patterson, Kevin W. Wilson, Scott Wisdom, John R. Hershey:
Distance-Based Sound Separation. CoRR abs/2207.00562 (2022)
[i23]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-10141
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-10141
Efthymios Tzinis, Scott Wisdom, Tal Remez, John R. Hershey:
AudioScopeV2: Audio-Visual Attention Architectures for Calibrated Open-Domain On-Screen Sound Separation. CoRR abs/2207.10141 (2022)
2021
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WisdomEESTFSSH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WisdomEESTFSSH21
Scott Wisdom, Hakan Erdogan, Daniel P. W. Ellis, Romain Serizel, Nicolas Turpault, Eduardo Fonseca, Justin Salamon, Prem Seetharaman, John R. Hershey:
What's all the Fuss about Free Universal Sound Separation Data? ICASSP 2021: 186-190
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TurpaultSWEHFSS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TurpaultSWEHFSS21
Nicolas Turpault, Romain Serizel, Scott Wisdom, Hakan Erdogan, John R. Hershey, Eduardo Fonseca, Prem Seetharaman, Justin Salamon:
Sound Event Detection and Separation: A Benchmark on Desed Synthetic Soundscapes. ICASSP 2021: 840-844
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MaitiEWW0H21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MaitiEWW0H21
Soumi Maiti, Hakan Erdogan, Kevin W. Wilson, Scott Wisdom, Shinji Watanabe, John R. Hershey:
End-To-End Diarization for Variable Number of Speakers with Local-Global Networks and Discriminative Speaker Embeddings. ICASSP 2021: 7183-7187
[c23]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/TzinisWJHREH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/TzinisWJHREH21
Efthymios Tzinis, Scott Wisdom, Aren Jansen, Shawn Hershey, Tal Remez, Dan Ellis, John R. Hershey:
Into the Wild with AudioScope: Unsupervised Audio-Visual Separation of On-Screen Sounds. ICLR 2021
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/RajDCEHH0DYLKLW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/RajDCEHH0DYLKLW21
Desh Raj, Pavel Denisov, Zhuo Chen, Hakan Erdogan, Zili Huang, Maokui He, Shinji Watanabe, Jun Du, Takuya Yoshioka, Yi Luo, Naoyuki Kanda, Jinyu Li, Scott Wisdom, John R. Hershey:
Integration of Speech Separation, Diarization, and Recognition for Multi-Speaker Meetings: System Description, Comparison, and Analysis. SLT 2021: 897-904
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/WangEWWR0CH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/WangEWWR0CH21
Zhong-Qiu Wang, Hakan Erdogan, Scott Wisdom, Kevin W. Wilson, Desh Raj, Shinji Watanabe, Zhuo Chen, John R. Hershey:
Sequential Multi-Frame Neural Beamforming for Speech Separation and Enhancement. SLT 2021: 905-911
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/WisdomJWEH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/WisdomJWEH21
Scott Wisdom, Aren Jansen, Ron J. Weiss, Hakan Erdogan, John R. Hershey:
Sparse, Efficient, and Semantic Mixture Invariant Training: Taming In-the-Wild Unsupervised Sound Separation. WASPAA 2021: 51-55
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/KoizumiKWEHJB21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/KoizumiKWEHJB21
Yuma Koizumi, Shigeki Karita, Scott Wisdom, Hakan Erdogan, John R. Hershey, Llion Jones, Michiel Bacchiani:
DF-Conformer: Integrated Architecture of Conv-Tasnet and Conformer Using Linear Complexity Self-Attention for Speech Enhancement. WASPAA 2021: 161-165
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/FonsecaJEWTHPHM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/FonsecaJEWTHPHM21
Eduardo Fonseca, Aren Jansen, Daniel P. W. Ellis, Scott Wisdom, Marco Tagliasacchi, John R. Hershey, Manoj Plakal, Shawn Hershey, R. Channing Moore, Xavier Serra:
Self-Supervised Learning from Automatically Separated Sound Scenes. WASPAA 2021: 251-255
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2105-02096
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-02096
Soumi Maiti, Hakan Erdogan, Kevin W. Wilson, Scott Wisdom, Shinji Watanabe, John R. Hershey:
End-to-End Diarization for Variable Number of Speakers with Local-Global Networks and Discriminative Speaker Embeddings. CoRR abs/2105.02096 (2021)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2105-02132
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-02132
Eduardo Fonseca, Aren Jansen, Daniel P. W. Ellis, Scott Wisdom, Marco Tagliasacchi, John R. Hershey, Manoj Plakal, Shawn Hershey, R. Channing Moore, Xavier Serra:
Self-Supervised Learning from Automatically Separated Sound Scenes. CoRR abs/2105.02132 (2021)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-00847
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-00847
Scott Wisdom, Aren Jansen, Ron J. Weiss, Hakan Erdogan, John R. Hershey:
Sparse, Efficient, and Semantic Mixture Invariant Training: Taming In-the-Wild Unsupervised Sound Separation. CoRR abs/2106.00847 (2021)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-09669
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-09669
Efthymios Tzinis, Scott Wisdom, Tal Remez, John R. Hershey:
Improving On-Screen Sound Separation for Open Domain Videos with Audio-Visual Self-attention. CoRR abs/2106.09669 (2021)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-15813
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-15813
Yuma Koizumi, Shigeki Karita, Scott Wisdom, Hakan Erdogan, John R. Hershey, Llion Jones, Michiel Bacchiani:
DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement. CoRR abs/2106.15813 (2021)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-10739
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-10739
Aswin Sivaraman, Scott Wisdom, Hakan Erdogan, John R. Hershey:
Adapting Speech Separation to Real-World Meetings Using Mixture Invariant Training. CoRR abs/2110.10739 (2021)
2020
[c17]
- view
  - electronic edition @ dcase.community (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/dcase/TurpaultWEHSFSS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/TurpaultWEHSFSS20
Nicolas Turpault, Scott Wisdom, Hakan Erdogan, John R. Hershey, Romain Serizel, Eduardo Fonseca, Prem Seetharaman, Justin Salamon:
Improving Sound Event Detection in Domestic Environments using Sound Separation. DCASE 2020: 205-209
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TzinisWHJE20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TzinisWHJE20
Efthymios Tzinis, Scott Wisdom, John R. Hershey, Aren Jansen, Daniel P. W. Ellis:
Improving Universal Sound Separation Using Sound Classification. ICASSP 2020: 96-100
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SonningSEW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SonningSEW20
Samuel Sonning, Christian Schüldt, Hakan Erdogan, Scott Wisdom:
Performance Study of a Convolutional Time-Domain Audio Separation Network for Real-Time Speech Denoising. ICASSP 2020: 831-835
[c14]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/WisdomTEWWH20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WisdomTEWWH20
Scott Wisdom, Efthymios Tzinis, Hakan Erdogan, Ron J. Weiss, Kevin W. Wilson, John R. Hershey:
Unsupervised Sound Separation Using Mixture Invariant Training. NeurIPS 2020
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2006-12701
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-12701
Scott Wisdom, Efthymios Tzinis, Hakan Erdogan, Ron J. Weiss, Kevin W. Wilson, John R. Hershey:
Unsupervised Sound Separation Using Mixtures of Mixtures. CoRR abs/2006.12701 (2020)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2007-03932
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-03932
Nicolas Turpault, Scott Wisdom, Hakan Erdogan, John R. Hershey, Romain Serizel, Eduardo Fonseca, Prem Seetharaman, Justin Salamon:
Improving Sound Event Detection In Domestic Environments Using Sound Separation. CoRR abs/2007.03932 (2020)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2011-00801
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-00801
Nicolas Turpault, Romain Serizel, Scott Wisdom, Hakan Erdogan, John R. Hershey, Eduardo Fonseca, Prem Seetharaman, Justin Salamon:
Sound Event Detection and Separation: a Benchmark on Desed Synthetic Soundscapes. CoRR abs/2011.00801 (2020)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2011-00803
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-00803
Scott Wisdom, Hakan Erdogan, Daniel P. W. Ellis, Romain Serizel, Nicolas Turpault, Eduardo Fonseca, Justin Salamon, Prem Seetharaman, John R. Hershey:
What's All the FUSS About Free Universal Sound Separation Data? CoRR abs/2011.00803 (2020)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2011-01143
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-01143
Efthymios Tzinis, Scott Wisdom, Aren Jansen, Shawn Hershey, Tal Remez, Daniel P. W. Ellis, John R. Hershey:
Into the Wild with AudioScope: Unsupervised Audio-Visual Separation of On-Screen Sounds. CoRR abs/2011.01143 (2020)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2011-02014
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-02014
Desh Raj, Pavel Denisov, Zhuo Chen, Hakan Erdogan, Zili Huang, Mao-Kui He, Shinji Watanabe, Jun Du, Takuya Yoshioka, Yi Luo, Naoyuki Kanda, Jinyu Li, Scott Wisdom, John R. Hershey:
Integration of speech separation, diarization, and recognition for multi-speaker meetings: System description, comparison, and analysis. CoRR abs/2011.02014 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/RouxWEH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/RouxWEH19
Jonathan Le Roux, Scott Wisdom, Hakan Erdogan, John R. Hershey:
SDR - Half-baked or Well Done? ICASSP 2019: 626-630
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WisdomHWTCPS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WisdomHWTCPS19
Scott Wisdom, John R. Hershey, Kevin W. Wilson, Jeremy Thorpe, Michael Chinen, Brian Patton, Rif A. Saurous:
Differentiable Consistency Constraints for Improved Deep Speech Enhancement. ICASSP 2019: 900-904
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/KavalerovWEPWRH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/KavalerovWEPWRH19
Ilya Kavalerov, Scott Wisdom, Hakan Erdogan, Brian Patton, Kevin W. Wilson, Jonathan Le Roux, John R. Hershey:
Universal Sound Separation. WASPAA 2019: 175-179
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1902-02120
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-02120
Mohamed Ezzeldin A. Elshaer, Scott Wisdom, Taniya Mishra:
Transfer Learning From Sound Representations For Anger Detection in Speech. CoRR abs/1902.02120 (2019)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1905-03330
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-03330
Ilya Kavalerov, Scott Wisdom, Hakan Erdogan, Brian Patton, Kevin W. Wilson, Jonathan Le Roux, John R. Hershey:
Universal Sound Separation. CoRR abs/1905.03330 (2019)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1911-07951
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-07951
Efthymios Tzinis, Scott Wisdom, John R. Hershey, Aren Jansen, Daniel P. W. Ellis:
Improving Universal Sound Separation Using Sound Classification. CoRR abs/1911.07951 (2019)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1911-07953
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-07953
Zhong-Qiu Wang, Scott Wisdom, Kevin W. Wilson, John R. Hershey:
Alternating Between Spectral and Spatial Estimation for Speech Separation and Enhancement. CoRR abs/1911.07953 (2019)
2018
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1811-02508
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-02508
Jonathan Le Roux, Scott Wisdom, Hakan Erdogan, John R. Hershey:
SDR - half-baked or well done? CoRR abs/1811.02508 (2018)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1811-08521
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-08521
Scott Wisdom, John R. Hershey, Kevin W. Wilson, Jeremy Thorpe, Michael Chinen, Brian Patton, Rif A. Saurous:
Differentiable Consistency Constraints for Improved Deep Speech Enhancement. CoRR abs/1811.08521 (2018)
2017
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WisdomPPA17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WisdomPPA17
Scott Wisdom, Thomas Powers, James W. Pitton, Les E. Atlas:
Building recurrent networks by unfolding iterative thresholding for sequential sparse recovery. ICASSP 2017: 4346-4350
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/WisdomPPA17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/WisdomPPA17
Scott Wisdom, Thomas Powers, James W. Pitton, Les Atlas:
Deep recurrent NMF for speech separation by unfolding iterative thresholding. WASPAA 2017: 254-258
[p1]
- view
  authority control:
- export record
  dblp key:
  - books/sp/17/HersheyRWWCI17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/books/sp/17/HersheyRWWCI17
John R. Hershey, Jonathan Le Roux, Shinji Watanabe, Scott Wisdom, Zhuo Chen, Yusuf Ziya Isik:
Novel Deep Architectures in Speech Processing. New Era for Robust Speech Recognition, Exploiting Deep Learning 2017: 135-164
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1709-07124
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1709-07124
Scott Wisdom, Thomas Powers, James W. Pitton, Les Atlas:
Deep Recurrent NMF for Speech Separation by Unfolding Iterative Thresholding. CoRR abs/1709.07124 (2017)
2016
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/acssc/WisdomAPO16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acssc/WisdomAPO16
Scott Wisdom, Les E. Atlas, James W. Pitton, Greg Okopal:
Benefits of noncircular statistics for nonstationary signals. ACSSC 2016: 554-558
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WisdomHRW16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WisdomHRW16
Scott Wisdom, John R. Hershey, Jonathan Le Roux, Shinji Watanabe:
Deep unfolding for multichannel source separation. ICASSP 2016: 121-125
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/ieeesam/WisdomAP16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ieeesam/WisdomAP16
Scott Wisdom, Les Atlas, James W. Pitton:
On spectral noncircularity of natural signals. SAM 2016: 1-5
[c5]
- view
- export record
  dblp key:
  - conf/nips/WisdomPHRA16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WisdomPHRA16
Scott Wisdom, Thomas Powers, John R. Hershey, Jonathan Le Roux, Les E. Atlas:
Full-Capacity Unitary Recurrent Neural Networks. NIPS 2016: 4880-4888
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/WisdomPHRA16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/WisdomPHRA16
Scott Wisdom, Thomas Powers, John R. Hershey, Jonathan Le Roux, Les E. Atlas:
Full-Capacity Unitary Recurrent Neural Networks. CoRR abs/1611.00035 (2016)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/WisdomPPA16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/WisdomPPA16
Scott Wisdom, Thomas Powers, James W. Pitton, Les E. Atlas:
Interpretable Recurrent Neural Networks Using Sequential Sparse Recovery. CoRR abs/1611.07252 (2016)
2015
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/OkopalWA15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/OkopalWA15
Greg Okopal, Scott Wisdom, Les Atlas:
Speech Analysis With the Strong Uncorrelating Transform. IEEE ACM Trans. Audio Speech Lang. Process. 23(11): 1858-1868 (2015)
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WisdomOAP15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WisdomOAP15
Scott Wisdom, Greg Okopal, Les E. Atlas, James W. Pitton:
Voice activity detection using subband noncircularity. ICASSP 2015: 4505-4509
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/WisdomPAP15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/WisdomPAP15
Scott Wisdom, Thomas Powers, Les Atlas, James W. Pitton:
Enhancement and Recognition of Reverberant and Noisy Speech by Extending Its Coherence. CoRR abs/1509.00533 (2015)
2014
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/acssc/OkopalWA14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acssc/OkopalWA14
Greg Okopal, Scott Wisdom, Les Atlas:
Estimating the noncircularity of latent components within complex-valued subband mixtures with applications to speech processing. ACSSC 2014: 1405-1409
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/acssc/WisdomPA14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acssc/WisdomPA14
Scott Wisdom, James W. Pitton, Les Atlas:
Extending coherence for optimal detection of nonstationary harmonic signals. ACSSC 2014: 1784-1788
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WisdomAP14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WisdomAP14
Scott Wisdom, Les Atlas, James Pittore:
Extending coherence time for analysis of modulated random processes. ICASSP 2014: 340-344

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.