default search action
SAPA@INTERSPEECH 2004: Jeju Island, Korea
- ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing, ICC, Jeju, Korea, October 3, 2004. ISCA 2004
- Futoshi Asano, Hideki Asoh:
Sound source localization and separation based on the EM algorithm. 37 - Matti Ryynänen, Anssi Klapuri:
Modelling of note events for singing transcription. 40 - Stefan Winter, Hiroshi Sawada, Shoko Araki, Shoji Makino:
Hierarchical clustering applied to overcomplete BSS for convolutive mixtures. 48 - Kazuyoshi Yoshii, Masataka Goto, Hiroshi G. Okuno:
Drum sound identification for polyphonic music using template adaptation and matching methods. 51 - Yasunari Obuchi:
Multiple-microphone robust speech recognition using decoder-based channel selection. 52 - Tomohiro Nakatani, Keisuke Kinoshita, Masato Miyoshi, Parham Zolfaghari:
Harmonicity based blind dereverberation with time warping. 53 - Tuomas Virtanen:
Separation of sound sources by convolutive sparse coding. 55 - Guoning Hu, DeLiang Wang:
Auditory segmentation based on event detection. 62 - Plamen J. Prodanov, Andrzej Drygajlo:
Bayesian networks for error handling through multimodality fusion in spoken dialogues with mobile robots. 70 - Werner Hemmert, Marcus Holmberg, David Gelbart:
Auditory-based automatic speech recognition. 74 - Hugo Bastos de Paula, Hani C. Yehia, Mauricio Alves Loureiro:
Representation and classification of the timbre space of a single musical instrument. 86 - Guillaume Lathoud, Iain McCowan:
A sector-based approach for localization of multiple speakers with microphone arrays. 93 - Daniel P. W. Ellis, Keansub Lee:
Features for segmenting and classifying long-duration recordings of "personal" audio. 106 - Chunghsin Yeh, Axel Röbel:
Physical principles driven joint evaluation of multiple f0 hypotheses. 109 - Rajkishore Prasad, Hiroshi Saruwatari, Kiyohiro Shikano:
MAP estimation of speech spectral component under GGD a priori. 115 - Shigeki Sagayama, Keigo Takahashi, Hirokazu Kameoka, Takuya Nishimoto:
Specmurt anasylis: a piano-roll-visualization of polyphonic music signal by deconvolution of log-frequency spectrum. 128 - Marios Athineos, Hynek Hermansky, Daniel P. W. Ellis:
PLP-squared: autoregressive modeling of auditory-like 2-d spectro-temporal patterns. 129 - Hynek Hermansky:
Stochastic techniques in deriving perceptual knowledge. 136 - Manuel Reyes-Gomez, Nebojsa Jojic, Daniel P. W. Ellis:
Towards single-channel unsupervised source separation of speech mixtures: the layered harmonics/formants separation-tracking model. 137 - John R. Hershey, Trausti T. Kristjansson, Zhengyou Zhang:
Model-based fusion of bone and air sensors for speech enhancement and robust speech recognition. 139 - Aarthi M. Reddy, Bhiksha Raj:
Soft mask estimation for single channel speaker separation. 158 - Paris Smaragdis:
Discovering auditory objects through non-negativity constraints. 161
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.