default search action
WASPAA 2021: New Paltz, NY, USA
- IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2021, New Paltz, NY, USA, October 17-20, 2021. IEEE 2021, ISBN 978-1-6654-4870-3
- Ryan M. Corey, Andrew C. Singer:
Adaptive Binaural Filtering for a Multiple-Talker Listening System Using Remote and On-Ear Microphones. 1-5 - Shahan Nercessian:
End-to-End Zero-Shot Voice Conversion Using a DDSP Vocoder. 1-5 - Shoichi Koyama, Tomoya Nishida, Keisuke Kimura, Takumi Abe, Natsuki Ueno, Jesper Brunnström:
MESHRIR: A Dataset of Room Impulse Responses on Meshed Grid Points for Evaluating Sound Field Analysis and Synthesis Methods. 1-5 - Pranay Manocha, Anurag Kumar, Buye Xu, Anjali Menon, Israel D. Gebru, Vamsi K. Ithapu, Paul Calamia:
DPLM: A Deep Perceptual Spatial-Audio Localization Metric. 6-10 - Zhixing Liu, Yannan Wang, Gaoxiong Yi, Tao Yu, Fei Chen:
Assessing Segmental Impact for Objective Speech Quality Evaluation. 11-15 - Ahmed Alghamdi, Wai-Yip Chan, Daniel Fogerty, Jesper Jensen:
Improved Intelligibility Prediction in the Modulation Domain. 16-20 - Ryo Tanabe, Harsh Purohit, Kota Dohi, Takashi Endo, Yuki Nikaido, Toshiki Nakamura, Yohei Kawaguchi:
MIMII Due: Sound Dataset for Malfunctioning Industrial Machine Investigation and Inspection with Domain Shifts Due to Changes in Operational and Environmental Conditions. 21-25 - Benjamin Elizalde, Radu Revutchi, Samarjit Das, Bhiksha Raj, Ian R. Lane, Laurie M. Heller:
Identifying Actions for Sound Event Classification. 26-30 - Krishna Subramani, Paris Smaragdis:
Point Cloud Audio Processing. 31-35 - Yu Wang, Nicholas J. Bryan, Justin Salamon, Mark Cartwright, Juan Pablo Bello:
Who Calls The Shots? Rethinking Few-Shot Learning for Audio. 36-40 - Zhepei Wang, Jonah Casebeer, Adam Clemmitt, Efthymios Tzinis, Paris Smaragdis:
Sound Event Detection with Adaptive Frequency Selection. 41-45 - Efthymios Tzinis, Jonah Casebeer, Zhepei Wang, Paris Smaragdis:
Separate But Together: Unsupervised Federated Learning for Speech Enhancement from Non-IID Data. 46-50 - Scott Wisdom, Aren Jansen, Ron J. Weiss, Hakan Erdogan, John R. Hershey:
Sparse, Efficient, and Semantic Mixture Invariant Training: Taming In-the-Wild Unsupervised Sound Separation. 51-55 - Zhong-Qiu Wang, Gordon Wichern, Jonathan Le Roux:
Convolutive Prediction for Reverberant Speech Separation. 56-60 - Aurora Cramer, Mark Cartwright, Fatemeh Pishdadian, Juan Pablo Bello:
Weakly Supervised Source-Specific Sound Level Estimation in Noisy Soundscapes. 61-65 - Ahmed Mustafa, Jan Büthe, Srikanth Korse, Kishan Gupta, Guillaume Fuchs, Nicola Pia:
A Streamwise Gan Vocoder for Wideband Speech Coding at Very Low Bit Rate. 66-70 - Santiago Pascual, Joan Serrà, Jordi Pons:
Adversarial Auto-Encoding for Packet Loss Concealment. 71-75 - Daniel T. Jones, Dushyant Sharma, Stanislav Yu. Kruchinin, Patrick A. Naylor:
Spatial Coding for Microphone Arrays Using Ipnlms-Based RTF Estimation. 76-80 - Hsuan-Yang Wang, Philip Nelson, Christine Evers:
Excitation-Inhibition Cell Activity Patterns for Binaural Source Localisation. 81-85 - Hongmei Hu, Stephan Dieter Ewert:
Speech Intelligibility of Mandarin- and German-Speaking Listeners in Challenging Conditions. 86-90 - Matteo Torcoli, Jouni Paulus, Thorsten Kastner, Christian Uhle:
Controlling the Remixing of Separated Dialogue with a Non-Intrusive Quality Estimate. 91-95 - Benjamin Stahl, Alois Sontacchi:
SIDIQ: Computational Quality Assessment of Enhanced Speech Based on Auditory Figure-Ground Segregation, Similarity, and Disturbance. 96-100 - Amir Ivry, Israel Cohen, Baruch Berdugo:
Objective Metrics to Evaluate Residual-Echo Suppression During Double-Talk. 101-105 - Raffaele Malvermi, Fabio Antonacci, Augusto Sarti, Roberto Corradi:
Prediction of Missing Frequency Response Functions Through Deep Image Prior. 106-110 - Giorgia Cantisani, Alexey Ozerov, Slim Essid, Gaël Richard:
User-Guided One-Shot Deep Model Adaptation for Music Source Separation. 111-115 - Javier Nistal, Cyran Aouameur, Stefan Lattner, Gaël Richard:
VQCPC-GAN: Variable-Length Adversarial Audio Synthesis Using Vector-Quantized Contrastive Predictive Coding. 116-120 - Christof Weiß, Geoffroy Peeters:
Learning Multi-Pitch Estimation from Weakly Aligned Score-Audio Pairs Using a Multi-Label CTC Loss. 121-125 - Guillaume Carbajal, Julius Richter, Timo Gerkmann:
Disentanglement Learning for Variational Autoencoders Applied to Audio-Visual Speech Enhancement. 126-130 - Andreas Brendel, Walter Kellermann:
Fasteriva: Update Rules for Independent Vector Analysis Based on Negentropy and the Majorize-Minimize Principle. 131-135 - Sebastian Braun, Ivan Tashev:
Low Complexity Online Convolutional Beamforming. 136-140 - Osman Asif Malik, Venkatalakshmi Vyjayanthi Narumanchi, Stephen Becker, Todd W. Murray:
Superresolution Photoacoustic Tomography Using Random Speckle Illumination and Second Order Moments. 141-145 - Wangyou Zhang, Jing Shi, Chenda Li, Shinji Watanabe, Yanmin Qian:
Closing the Gap Between Time-Domain Multi-Channel Speech Enhancement on Real and Simulation Conditions. 146-150 - Amy Bastine, Thushara D. Abhayapala, Jihui Zhang:
Analysis of Frequency-Dependent Behavior of Room Reflections Using Spherical Microphone Measurements & Von Mises-Fisher Clustering. 156-160 - Yuma Koizumi, Shigeki Karita, Scott Wisdom, Hakan Erdogan, John R. Hershey, Llion Jones, Michiel Bacchiani:
DF-Conformer: Integrated Architecture of Conv-Tasnet and Conformer Using Linear Complexity Self-Attention for Speech Enhancement. 161-165 - Jiaqi Su, Zeyu Jin, Adam Finkelstein:
HiFi-GAN-2: Studio-Quality Speech Enhancement via Generative Adversarial Networks Conditioned on Acoustic Features. 166-170 - Aswin Sivaraman, Minje Kim:
Zero-Shot Personalized Speech Enhancement Through Speaker-Informed Model Selection. 171-175 - Sunwoo Kim, Minje Kim:
Test-Time Adaptation Toward Personalized Speech Enhancement: Zero-Shot Learning with Knowledge Distillation. 176-180 - Enis Berk Çoban, Ali Raza Syed, Dara Pir, Michael I. Mandel:
Towards Large Scale Ecoacoustic Monitoring with Small Amounts of Labeled Data. 181-185 - Gordon Wichern, Ankush Chakrabarty, Zhong-Qiu Wang, Jonathan Le Roux:
Anomalous Sound Detection Using Attentive Neural Processes. 186-190 - Debottam Dutta, Purvi Agrawal, Sriram Ganapathy:
A Multi-Head Relevance Weighting Framework for Learning Raw Waveform Audio Representations. 191-195 - Donmoon Lee, Kyogu Lee:
Cross-Domain Semi-Supervised Audio Event Classification Using Contrastive Regularization. 196-200 - Vincent W. Neo, Christine Evers, Patrick A. Naylor:
Polynomial Matrix Eigenvalue Decomposition-Based Source Separation Using Informed Spherical Microphone Arrays. 201-205 - Thomas Dietzen, Enzo De Sena, Toon van Waterschoot:
Low-Complexity Steered Response Power Mapping Based on Nyquist-Shannon Sampling. 206-210 - Sharath Adavanne, Archontis Politis, Tuomas Virtanen:
Differentiable Tracking-Based Training of Deep Learning Sound Source Localizers. 211-215 - Daniele Salvati, Carlo Drioli, Gian Luca Foresti:
Spherical Harmonic Diagonal Unloading Beamforming with Ego-Noise Reduction for DOA Estimation from Autonomous Systems. 216-220 - Christian J. Steinmetz, Vamsi Krishna Ithapu, Paul Calamia:
Filtered Noise Shaping for Time Domain Room Impulse Response Estimation from Reverberant Speech. 221-225 - Prerak Srivastava, Antoine Deleforge, Emmanuel Vincent:
Blind Room Parameter Estimation Using Multiple Multichannel Speech Recordings. 226-230 - Jens Ahrens, Hannes Helmholz, David Lou Alon, Sebastià V. Amengual Garí:
Spherical Harmonic Decomposition of a Sound Field Based on Microphones Around the Circumference of a Human Head. 231-235 - Maximilian Kentgens, Peter Jax:
Ambient-Aware Sound Field Translation Using Optimal Spatial Filtering. 236-240 - Ege Erdem, Orhun Olgun, Hüseyin Hacihabiboglu:
Internal Time Delay Calibration of Rigid Spherical Microphone Arrays for Multi-Perspective 6DoF Audio Recordings. 241-245 - Irene Martín-Morató, Manu Harju, Annamaria Mesaros:
Crowdsourcing Strong Labels for Sound Event Detection. 246-250 - Eduardo Fonseca, Aren Jansen, Daniel P. W. Ellis, Scott Wisdom, Marco Tagliasacchi, John R. Hershey, Manoj Plakal, Shawn Hershey, R. Channing Moore, Xavier Serra:
Self-Supervised Learning from Automatically Separated Sound Scenes. 251-255 - Jun Deng, Chunhui Gao, Qian Feng, Xinzhou Xu, Zhaopeng Chen:
Adaptive Generalized Cross-Entropy Loss for Sound Event Classification with Noisy Labels. 256-260 - Ryosuke Horiuchi, Shoichi Koyama, Juliano G. C. Ribeiro, Natsuki Ueno, Hiroshi Saruwatari:
Kernel Learning for Sound Field Estimation with L1 and L2 Regularizations. 261-265 - Jingwei Xi, Wen Zhang, Thushara D. Abhayapala:
Magnitude Modelling of Individualized HRTFs Using DNN Based Spherical Harmonic Analysis. 266-270 - Yi Ren, Yoichi Haneda:
2D Local Exterior Sound Field Reproduction Using an Addition Theorem Based on Circular Harmonic Expansion. 271-275 - Takuma Okamoto:
2D Multizone Sound Field Synthesis with Interior-Exterior Ambisonics. 276-280 - Keisuke Kimura, Shoichi Koyama, Natsuki Ueno, Hiroshi Saruwatari:
Mean-Square-Error-Based Secondary Source Placement in Sound Field Synthesis with Prior Information on Desired Field. 281-285 - Hanwen Bi, Fei Ma, Thushara D. Abhayapala, Prasanga N. Samarasinghe:
Spherical Array Based Drone Noise Measurements and Modelling for Drone Noise Reduction via Propeller Phase Control. 286-290 - Jonah Casebeer, Nicholas J. Bryan, Paris Smaragdis:
Auto-DSP: Learning to Optimize Acoustic Echo Cancellers. 291-295 - Naoki Murata, Yuhta Takida, Tetsu Magariyachi:
Fast Convergent Method for Active Noise Control Over Spatial Region with Causal Constraint. 296-300 - Huiyuan Sun, Jihui Zhang, Thushara D. Abhayapala, Prasanga N. Samarasinghe:
Active Noise Control Over 3D Space with Remote Microphone Technique in the Wave Domain. 301-305 - Tamara Smyth, Devansh Zurale:
On the Role of Lip Reflection/Transmission in the Relationship Between LPC and Waveguide Vocal Tract Models. 311-315 - Darius Petermann, Seungkwon Beack, Minje Kim:
Harp-Net: Hyper-Autoencoded Reconstruction Propagation for Scalable Neural Audio Coding. 316-320 - François G. Germain:
Periodic Analysis of Nonlinear Virtual Analog Models. 321-325 - Aidan O. T. Hogg, Vincent W. Neo, Stephan Weiss, Christine Evers, Patrick A. Naylor:
A Polynomial Eigenvalue Decomposition Music Approach for Broadband Sound Source Localization. 326-330 - Daniel Aleksander Krause, Archontis Politis, Annamaria Mesaros:
Joint Direction and Proximity Classification of Overlapping Sound Events from Binaural Audio. 331-335 - Pierre-Amaury Grumiaux, Srdan Kitic, Prerak Srivastava, Laurent Girin, Alexandre Guérin:
Saladnet: Self-Attentive Multisource Localization in the Ambisonics Domain. 336-340 - Christoph Kirsch, Stephan Dieter Ewert:
Low-Order Filter Approximation of Diffraction for Virtual Acoustics. 341-345 - Thomas Deppisch, Jens Ahrens, Sebastià V. Amengual Garí, Paul Calamia:
Spatial Subtraction of Reflections from Room Impulse Responses Measured with a Spherical Microphone Array. 346-350 - Achille Aknin, Roland Badeau:
Stochastic Reverberation Model with a Frequency Dependent Attenuation. 351-355 - Paula Sánchez López, Paul Callens, Milos Cernak:
A Universal Deep Room Acoustics Estimator. 356-360 - Christoph Hold, Sebastian J. Schlecht, Archontis Politis, Ville Pulkki:
Spatial Filter Bank in the Spherical Harmonic Domain: Reconstruction and Application. 361-365 - Stefano Damiano, Federico Borra, Alberto Bernardini, Fabio Antonacci, Augusto Sarti:
Soundfield Reconstruction in Reverberant Rooms Based on Compressive Sensing and Image-Source Models of Early Reflections. 366-370 - Leo McCormack, Archontis Politis, Ville Pulkki:
Rendering of Source Spread for Arbitrary Playback Setups Based on Spatial Covariance Matching. 371-375
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.