TASLP-II: Vol 21, No 7

Volume 21, Issue 7July 2013

Volume 21, Issue 7

July 2013

Publisher:

IEEE Press

ISSN:1558-7916

Tags:

Bibliometrics

Select All

Export Citations Save to Binder

research-article

Spectral and Spatial Multichannel Analysis/Synthesis of Interior Aircraft Sounds

Pages 1317–1329https://doi.org/10.1109/TASL.2013.2248712

A method for spectral and spatial multichannel analysis/synthesis of interior aircraft sounds is presented. We propose two extensions of the classical sinusoids+noise model, adapted to multichannel stationary sounds. First, a spectral estimator is ...

research-article

Model-Based Unsupervised Spoken Term Detection with Spoken Queries

Pages 1330–1342https://doi.org/10.1109/TASL.2013.2248714

We present a set of model-based approaches for unsupervised spoken term detection (STD) with spoken queries that requires neither speech recognition nor annotated data. This work shows the possibilities in migrating from DTW-based to model-based ...

research-article

On the Time-Domain Widely Linear LCMV Filter for Noise Reduction With a Stereo System

Pages 1343–1354https://doi.org/10.1109/TASL.2013.2248719

This paper deals with the problem of noise reduction in stereo sound systems where the objective is not only to reduce noise, but also to preserve the spatial information of both the desired speech and noise sources so that the listener can still ...

research-article

CLOSE—A Data-Driven Approach to Speech Separation

Pages 1355–1368https://doi.org/10.1109/TASL.2013.2250959

This paper studies single-channel speech separation, assuming unknown, arbitrary temporal dynamics for the speech signals to be separated. A data-driven approach is described, which matches each mixed speech segment against a composite training segment ...

research-article

Optimized Speech Dereverberation From Probabilistic Perspective for Time Varying Acoustic Transfer Function

Pages 1369–1380https://doi.org/10.1109/TASL.2013.2250960

A dereverberation technique has been developed that optimally combines multichannel inverse filtering (MIF), beamforming (BF), and non-linear reverberation suppression (NRS). It is robust against acoustic transfer function (ATF) fluctuations and creates ...

research-article

Towards Scaling Up Classification-Based Speech Separation

Pages 1381–1390https://doi.org/10.1109/TASL.2013.2250961

Formulating speech separation as a binary classification problem has been shown to be effective. While good separation performance is achieved in matched test conditions using kernel support vector machines (SVMs), separation in unmatched conditions ...

research-article

Sparse Reverberant Audio Source Separation via Reweighted Analysis

Pages 1391–1402https://doi.org/10.1109/TASL.2013.2250962

We propose a novel algorithm for source signals estimation from an underdetermined convolutive mixture assuming known mixing filters. Most of the state-of-the-art methods are dealing with anechoic or short reverberant mixture, assuming a synthesis ...

research-article

MDCT Sinusoidal Analysis for Audio Signals Analysis and Processing

Pages 1403–1414https://doi.org/10.1109/TASL.2013.2250963

The Modified Discrete Cosine Transform (MDCT) is widely used in audio signals compression, but mostly limited to representing audio signals. This is because the MDCT is a real transform: Phase information is missing and spectral power varies frame to ...

research-article

A Symmetric Kernel Partial Least Squares Framework for Speaker Recognition

Pages 1415–1423https://doi.org/10.1109/TASL.2013.2253096

I-vectors are concise representations of speaker characteristics. Recent progress in i-vectors related research has utilized their ability to capture speaker and channel variability to develop efficient automatic speaker verification (ASV) systems. ...

research-article

Ranking Through Clustering: An Integrated Approach to Multi-Document Summarization

Pages 1424–1433https://doi.org/10.1109/TASL.2013.2253098

Multi-document summarization aims to create a condensed summary while retaining the main characteristics of the original set of documents. Under such background, sentence ranking has hitherto been the issue of most concern. Since documents often cover a ...

research-article

Model-Based Inversion of Dynamic Range Compression

Pages 1434–1444https://doi.org/10.1109/TASL.2013.2253099

In this work it is shown how a dynamic nonlinear time-variant operator, such as a dynamic range compressor, can be inverted using an explicit signal model. By knowing the model parameters that were used for compression one is able to recover the ...

research-article

Stochastic-Deterministic MMSE STFT Speech Enhancement With General A Priori Information

Pages 1445–1457https://doi.org/10.1109/TASL.2013.2253100

A wide range of Bayesian short-time spectral amplitude (STSA) speech enhancement algorithms exist, varying in both the statistical model used for speech and the cost functions considered. Current algorithms of this class consistently assume that the ...

research-article

On Acoustic Emotion Recognition: Compensating for Covariate Shift

Pages 1458–1468https://doi.org/10.1109/TASL.2013.2255278

Pattern recognition tasks often face the situation that training data are not fully representative of test data. This problem is well-recognized in speech recognition, where methods like cepstral mean normalization (CMN), vocal tract length ...

research-article

Towards Abstractive Speech Summarization: Exploring Unsupervised and Supervised Approaches for Spoken Utterance Compression

Pages 1469–1480https://doi.org/10.1109/TASL.2013.2255279

Most previous studies on speech summarization focus on the extractive approaches. Yet directly concatenating the extracted speech utterances may not form a good summary due to the presence of disfluencies and redundancy in the unplanned spontaneous ...

research-article

A Perceptual Study on Velvet Noise and Its Variants at Different Pulse Densities

Pages 1481–1488https://doi.org/10.1109/TASL.2013.2255281

This paper investigates sparse noise sequences, including the previously proposed velvet noise and its novel variants defined here. All sequences consist of sample values minus one, zero, and plus one only, and the location and the sign of each impulse ...

research-article

Parametric Audio Coding With Exponentially Damped Sinusoids

Pages 1489–1501https://doi.org/10.1109/TASL.2013.2255284

Sinusoidal modeling is one of the most popular techniques for low bitrate audio coding. Usually, the sinusoidal parameters (amplitude, pulsation and phase of each sinusoidal component) are kept constant within a time segment. An alternative model, the ...

research-article

Functional Link Adaptive Filters for Nonlinear Acoustic Echo Cancellation

Pages 1502–1512https://doi.org/10.1109/TASL.2013.2255276

This paper introduces a new class of nonlinear adaptive filters, whose structure is based on Hammerstein model. Such filters derive from the functional link adaptive filter (FLAF) model, defined by a nonlinear input expansion, which enhances the ...

research-article

Performance of the SDW-MWF With Randomly Located Microphones in a Reverberant Enclosure

Pages 1513–1523https://doi.org/10.1109/TASL.2013.2255280

Beamforming with wireless acoustic sensor networks (WASNs) has recently drawn the attention of the research community. As the number of microphones grows it is difficult, and in some applications impossible, to determine their layout beforehand. A ...

research-article

Modeling of Complex Geometries and Boundary Conditions in Finite Difference/Finite Volume Time Domain Room Acoustics Simulation

Stefan Bilbao

Pages 1524–1533https://doi.org/10.1109/TASL.2013.2256897

Due to recent increases in computing power, room acoustics simulation in 3D using time stepping schemes is becoming a viable alternative to standard methods based on ray tracing and the image source method. Finite Difference Time Domain (FDTD) methods, ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

IEEE Transactions on Audio, Speech, and Language Processing

Sections

Spectral and Spatial Multichannel Analysis/Synthesis of Interior Aircraft Sounds

Model-Based Unsupervised Spoken Term Detection with Spoken Queries

On the Time-Domain Widely Linear LCMV Filter for Noise Reduction With a Stereo System

CLOSE—A Data-Driven Approach to Speech Separation

Optimized Speech Dereverberation From Probabilistic Perspective for Time Varying Acoustic Transfer Function

Towards Scaling Up Classification-Based Speech Separation

Sparse Reverberant Audio Source Separation via Reweighted Analysis

MDCT Sinusoidal Analysis for Audio Signals Analysis and Processing

A Symmetric Kernel Partial Least Squares Framework for Speaker Recognition

Ranking Through Clustering: An Integrated Approach to Multi-Document Summarization

Model-Based Inversion of Dynamic Range Compression

Stochastic-Deterministic MMSE STFT Speech Enhancement With General A Priori Information

On Acoustic Emotion Recognition: Compensating for Covariate Shift

Towards Abstractive Speech Summarization: Exploring Unsupervised and Supervised Approaches for Spoken Utterance Compression

A Perceptual Study on Velvet Noise and Its Variants at Different Pulse Densities

Parametric Audio Coding With Exponentially Damped Sinusoids

Functional Link Adaptive Filters for Nonlinear Acoustic Echo Cancellation

Performance of the SDW-MWF With Randomly Located Microphones in a Reverberant Enclosure

Modeling of Complex Geometries and Boundary Conditions in Finite Difference/Finite Volume Time Domain Room Acoustics Simulation