Nothing Special   »   [go: up one dir, main page]

skip to main content
Reflects downloads up to 28 Sep 2024Bibliometrics
Skip Table Of Content Section
research-article
Spectral and Spatial Multichannel Analysis/Synthesis of Interior Aircraft Sounds

A method for spectral and spatial multichannel analysis/synthesis of interior aircraft sounds is presented. We propose two extensions of the classical sinusoids+noise model, adapted to multichannel stationary sounds. First, a spectral estimator is ...

research-article
Model-Based Unsupervised Spoken Term Detection with Spoken Queries

We present a set of model-based approaches for unsupervised spoken term detection (STD) with spoken queries that requires neither speech recognition nor annotated data. This work shows the possibilities in migrating from DTW-based to model-based ...

research-article
On the Time-Domain Widely Linear LCMV Filter for Noise Reduction With a Stereo System

This paper deals with the problem of noise reduction in stereo sound systems where the objective is not only to reduce noise, but also to preserve the spatial information of both the desired speech and noise sources so that the listener can still ...

research-article
CLOSE—A Data-Driven Approach to Speech Separation

This paper studies single-channel speech separation, assuming unknown, arbitrary temporal dynamics for the speech signals to be separated. A data-driven approach is described, which matches each mixed speech segment against a composite training segment ...

research-article
Optimized Speech Dereverberation From Probabilistic Perspective for Time Varying Acoustic Transfer Function

A dereverberation technique has been developed that optimally combines multichannel inverse filtering (MIF), beamforming (BF), and non-linear reverberation suppression (NRS). It is robust against acoustic transfer function (ATF) fluctuations and creates ...

research-article
Towards Scaling Up Classification-Based Speech Separation

Formulating speech separation as a binary classification problem has been shown to be effective. While good separation performance is achieved in matched test conditions using kernel support vector machines (SVMs), separation in unmatched conditions ...

research-article
Sparse Reverberant Audio Source Separation via Reweighted Analysis

We propose a novel algorithm for source signals estimation from an underdetermined convolutive mixture assuming known mixing filters. Most of the state-of-the-art methods are dealing with anechoic or short reverberant mixture, assuming a synthesis ...

research-article
MDCT Sinusoidal Analysis for Audio Signals Analysis and Processing

The Modified Discrete Cosine Transform (MDCT) is widely used in audio signals compression, but mostly limited to representing audio signals. This is because the MDCT is a real transform: Phase information is missing and spectral power varies frame to ...

research-article
A Symmetric Kernel Partial Least Squares Framework for Speaker Recognition

I-vectors are concise representations of speaker characteristics. Recent progress in i-vectors related research has utilized their ability to capture speaker and channel variability to develop efficient automatic speaker verification (ASV) systems. ...

research-article
Ranking Through Clustering: An Integrated Approach to Multi-Document Summarization

Multi-document summarization aims to create a condensed summary while retaining the main characteristics of the original set of documents. Under such background, sentence ranking has hitherto been the issue of most concern. Since documents often cover a ...

research-article
Model-Based Inversion of Dynamic Range Compression

In this work it is shown how a dynamic nonlinear time-variant operator, such as a dynamic range compressor, can be inverted using an explicit signal model. By knowing the model parameters that were used for compression one is able to recover the ...

research-article
Stochastic-Deterministic MMSE STFT Speech Enhancement With General A Priori Information

A wide range of Bayesian short-time spectral amplitude (STSA) speech enhancement algorithms exist, varying in both the statistical model used for speech and the cost functions considered. Current algorithms of this class consistently assume that the ...

research-article
On Acoustic Emotion Recognition: Compensating for Covariate Shift

Pattern recognition tasks often face the situation that training data are not fully representative of test data. This problem is well-recognized in speech recognition, where methods like cepstral mean normalization (CMN), vocal tract length ...

research-article
Towards Abstractive Speech Summarization: Exploring Unsupervised and Supervised Approaches for Spoken Utterance Compression

Most previous studies on speech summarization focus on the extractive approaches. Yet directly concatenating the extracted speech utterances may not form a good summary due to the presence of disfluencies and redundancy in the unplanned spontaneous ...

research-article
A Perceptual Study on Velvet Noise and Its Variants at Different Pulse Densities

This paper investigates sparse noise sequences, including the previously proposed velvet noise and its novel variants defined here. All sequences consist of sample values minus one, zero, and plus one only, and the location and the sign of each impulse ...

research-article
Parametric Audio Coding With Exponentially Damped Sinusoids

Sinusoidal modeling is one of the most popular techniques for low bitrate audio coding. Usually, the sinusoidal parameters (amplitude, pulsation and phase of each sinusoidal component) are kept constant within a time segment. An alternative model, the ...

research-article
Functional Link Adaptive Filters for Nonlinear Acoustic Echo Cancellation

This paper introduces a new class of nonlinear adaptive filters, whose structure is based on Hammerstein model. Such filters derive from the functional link adaptive filter (FLAF) model, defined by a nonlinear input expansion, which enhances the ...

research-article
Performance of the SDW-MWF With Randomly Located Microphones in a Reverberant Enclosure

Beamforming with wireless acoustic sensor networks (WASNs) has recently drawn the attention of the research community. As the number of microphones grows it is difficult, and in some applications impossible, to determine their layout beforehand. A ...

research-article
Modeling of Complex Geometries and Boundary Conditions in Finite Difference/Finite Volume Time Domain Room Acoustics Simulation

Due to recent increases in computing power, room acoustics simulation in 3D using time stepping schemes is becoming a viable alternative to standard methods based on ray tracing and the image source method. Finite Difference Time Domain (FDTD) methods, ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.