Nothing Special   »   [go: up one dir, main page]

Skip to main content

Prediction Model of Multi-channel Audio Quality Based on Multiple Linear Regression

  • Conference paper
  • First Online:
Advances in Multimedia Information Processing -- PCM 2015 (PCM 2015)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9314))

Included in the following conference series:

  • 1835 Accesses

Abstract

Perceived audio quality is an important metric to measure the perception degradation of multi-channel audio signals especially for coding and rendering systems. Conventional objective quality measurement such as PEAQ (Perceptual Evaluation of Audio Quality) is limited to describe both the basic audio quality and the spatial impression. A novel prediction model is proposed to predict the subjective quality of 5.1-channels audio systems. Two attributes are included in the evaluation including basic quality and surround effects. Multiple Linear Regression (MLR) combined with Principal Component Analysis (PCA) is used to establish the prediction model from the objective parameters to subjective audio quality. Data set for model training and testing is obtained from formal listening tests under different coding conditions. Preliminary experiment results with 5.1-channels audio show that the proposed model can predict multi-channel audio quality more accurately than the conventional PEAQ method considering both the basic audio quality and the surround effects.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. ISO/IEC 13818-3: Information technology-Generic coding of moving pictures and associated audio information – Part 3: Audio (1998)

    Google Scholar 

  2. ISO/IEC 13818-7: Information technology - Generic coding of moving pictures and associated audio information-Part 7: Advanced Audio Coding (AAC) (2006)

    Google Scholar 

  3. ISO/IEC JTC1/SC29/WG11: Information technology - report on the verification tests of MPEG-D MPEG surround (2007)

    Google Scholar 

  4. ITU-R BS.775-2: Multichannel stereophonic sound system with and without accompanying picture (2006)

    Google Scholar 

  5. Cheng, Y., Ruimin, H., Liuyue, S., et al.: A 3D audio coding technique based on extracting the distance parameter. In: IEEE International Conference on Multimedia and Expo, pp. 1–6. IEEE Press, California (2014)

    Google Scholar 

  6. Bin, C., Christian, R., Ian, S.B., Xiguang, Z.: A general compression approach to multi-channel three-dimensional audio. IEEE Trans. Audio Speech Lang. Process. 21(8), 1676–1688 (2013)

    Article  Google Scholar 

  7. ITU-R BS.1116-1: Methods for the subjective assessment of small impairments in audio systems including multichannel sound systems, Geneva, Switzerland (1997)

    Google Scholar 

  8. ITU-R BS.1285: Pre-selection methods for the subjective assessment of small impairments in audio systems, Geneva, Switzerland (1997)

    Google Scholar 

  9. ITU-R BS.1534: Method for the subjective assessment of intermediate quality level of coding systems, Geneva, Switzerland (2001)

    Google Scholar 

  10. ITU-R BS.1387-1: Method for objective measurements of perceived audio quality, Geneva, Switzerland (2001)

    Google Scholar 

  11. Inyong, C., Shinn-Cunningham, B.G., Sang, B.C., Sung, K.-M.: Objective measurement of perceived auditory quality in multichannel audio compression coding systems. J. Audio Eng. Soc. 56, 3–17 (2008)

    Google Scholar 

  12. Schafer, M., Bahram, M., Vary, P.: An extension of the PEAQ measure by a binaural hearing model. In: International Conference on Acoustics, Speech and Signal Processing, pp. 8164–8168. IEEE Press, Vancouver (2013)

    Google Scholar 

  13. Smimite, A., Beghdadi, A., Chen, K., Jafjaf, O.: A new approach for spatial audio quality assessment. In: International Conference on Telecommunications and Multimedia, pp. 46–51. IEEE Press, Greece (2014)

    Google Scholar 

  14. Jeroen, B., Par, S.V.D., Armin, K., Erik, S., Jeroen, B., Erik, S.: Parametric coding of stereo audio. EURASIP J. Adv. Signal Process. 9, 1305–1322 (2005)

    MATH  Google Scholar 

  15. Faller, C., Baumgarte, F.: Binaural cue coding: a novel and efficient representation of spatial audio. In: International Conference on Acoustics, Speech and Signal Processing, pp. 1841–1844. IEEE Press, Florida (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jing Wang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Wang, J., Zhao, Y., Li, W., Wang, F., Fei, Z., Xie, X. (2015). Prediction Model of Multi-channel Audio Quality Based on Multiple Linear Regression. In: Ho, YS., Sang, J., Ro, Y., Kim, J., Wu, F. (eds) Advances in Multimedia Information Processing -- PCM 2015. PCM 2015. Lecture Notes in Computer Science(), vol 9314. Springer, Cham. https://doi.org/10.1007/978-3-319-24075-6_66

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-24075-6_66

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-24074-9

  • Online ISBN: 978-3-319-24075-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics