Non-linear Prediction of Speech Signal Using Artificial Neural Nets

K. Ashouri⁷,
M. Amini⁷ &
M. H. Savoji⁷

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2510))

Included in the following conference series:

Eurasian Conference on Information and Communication Technology

376 Accesses

Abstract

Speech technology is one of the key technical issues involved in Information Technology as it constitutes an important aspect of Human Computer Interaction. Prediction of speech signal has applications in speech technology, especially in coding. Conventionally, linear prediction is used. However, non-linear phenomena exist in speech production and, considering this non-linearity should lead to lower signal dynamics during coding with a consequent reduction in bit-rate and the needed bandwidth. The non-linear prediction of speech segments, as long as a whole vowel, using neural nets is studied in this paper. It is shown that non-linear speech prediction does not lead to an appreciable further reduction in the residual signal in this case.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Speech Recognition Using Artificial Neural Network

Automatic Speech Recognition Based on Neural Networks

Deep Neural Network Based Speech Enhancement

References

H.M. Teager: Some Observations on Oral Air Flow Vocalization, IEEE Trans. ASSP, Vol. 28(5), PP 599–601
Google Scholar
N. Tishby: A dynamical systems approach to speech processing, Proc. ICASSP, 1990, PP365–368
Google Scholar
B. Townshend: Non-linear prediction of speech, Proc. ICASSP, 1991, PP 425–428
Google Scholar
G. DAlessandro etal.: A new sub-band non-linear prediction coding algorithm for narrowband speech signal-The NADPCMB-MLT coding scheme, Proc. ICASSP, 2002, (NEURAL-L03, paper 2066)
Google Scholar
A.S. Weigend: Time Series Analysis and Prediction, http://www.cs.colorado.edu/~andreas/home.html
T. Masters: Signal and Image Processing with Neural Networks, John Wiley & Sons (1994)
Google Scholar
N.K. Bose & P. Liang: Neural Network Fundamentals with Graphs, Algorithms and Applications, Mc Graw Hill (1996)
Google Scholar
Limin Fu: Neural Network in Computer Intelligence, Mc Graw Hill (1994)
Google Scholar
D.P. Mandic & J.A. Chambers: Recurrent Neural Networks for Prediction: Learning Algorithms, Architectures and Stability, John Wiley & Sons (2001)
Google Scholar
F.J. Pineda, Generalization of Back Propagation to Recurrent Neural Networks, Physics Review Letters, 59, PP 2229–2232
Google Scholar
MATLAB NN Toolbox User’s Guide; The MATH WORKS INC, http://www.mathworks.com
S. Haykin & S. Kesler, Prediction Error Filtering and Maximum Entropy Spectral Estimation, in Non-linear Methods of Spectral Analysis, Springer-Verlag (1983)
Google Scholar

Download references

Author information

Authors and Affiliations

Electrical and Computer Engineering Faculty, Shahid Beheshti University, Evin Square, Tehran, 1983963113, Iran
K. Ashouri, M. Amini & M. H. Savoji

Authors

K. Ashouri
View author publications
You can also search for this author in PubMed Google Scholar
M. Amini
View author publications
You can also search for this author in PubMed Google Scholar
M. H. Savoji
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Mathematics and Computer Science Computer Science Department, Shahid Bahonar University, 22 Bahman Bulvard, Kerman, Iran
Hassan Shafazand
Fraunhofer IPSI, Dolivostr. 15, 64293, Darmstadt, Germany
Hassan Shafazand
Institute of Software Technology, Vienna University of Technology, Favoritenstr. 9/188, 1040, Vienna, Austria
A. Min Tjoa

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ashouri, K., Amini, M., Savoji, M.H. (2002). Non-linear Prediction of Speech Signal Using Artificial Neural Nets. In: Shafazand, H., Tjoa, A.M. (eds) EurAsia-ICT 2002: Information and Communication Technology. EurAsia-ICT 2002. Lecture Notes in Computer Science, vol 2510. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36087-5_25

Download citation

DOI: https://doi.org/10.1007/3-540-36087-5_25
Published: 10 October 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00028-0
Online ISBN: 978-3-540-36087-2
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Non-linear Prediction of Speech Signal Using Artificial Neural Nets

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Speech Recognition Using Artificial Neural Network

Automatic Speech Recognition Based on Neural Networks

Deep Neural Network Based Speech Enhancement

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Non-linear Prediction of Speech Signal Using Artificial Neural Nets

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Speech Recognition Using Artificial Neural Network

Automatic Speech Recognition Based on Neural Networks

Deep Neural Network Based Speech Enhancement

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation