Open Access

Towards Spike-Based Speech Processing: A Biologically Plausible Approach to Simple Acoustic Classification

and

John Harris

Harris, John

| Jun 16, 2008

International Journal of Applied Mathematics and Computer Science

Volume 18 (2008): Issue 2 (June 2008)

About this article

Cite

Page range: 129 - 137

DOI: https://doi.org/10.2478/v10006-008-0012-0

Keywords
Spike coding, synchrony coding, phase locking, speech perception, psychoacoustics, speech recognition

This content is open access.

Shortcomings of automatic speech recognition (ASR) applications are becoming more evident as they are more widely used in real life. The inherent non-stationarity associated with the timing of speech signals as well as the dynamical changes in the environment make the ensuing analysis and recognition extremely difficult. Researchers often turn to biology seeking clues to make better engineered systems, and ASR is no exception with the usage of feature sets such as Mel frequency cepstral coefficients, which employ filter banks similar to cochlear filter banks in frequency distribution and bandwidth. In this paper, we delve deeper into the mechanics of the human auditory system to take this biological inspiration to the next level. The main goal of this research is to investigate the computation potential of spike trains produced at the early stages of the auditory system for a simple acoustic classification task. First, various spike coding schemes from temporal to rate coding are explored, together with various spike-based encoders with various simplicity levels such as rank order coding and liquid state machine. Based on these findings, a biologically plausible system architecture is proposed for the recognition of phonetically simple acoustic signals which makes exclusive use of spikes for computation. The performance tests show superior performance on a noisy vowel data set when compared with a conventional ASR system.

eISSN:
Language:: English

Publication timeframe:: 4 times per year
Journal Subjects:: Mathematics, Applied Mathematics

Journal RSS Feed

Towards Spike-Based Speech Processing: A Biologically Plausible Approach to Simple Acoustic Classification

Published Online: Jun 16, 2008

Page range: 129 - 137

KeywordsSpike coding, synchrony coding, phase locking, speech perception, psychoacoustics, speech recognition

This content is open access.

Keywords
Spike coding, synchrony coding, phase locking, speech perception, psychoacoustics, speech recognition