Three-dimensional audio parametric encoding based on perceptual characteristics of spatial cue
Pages 2727 - 2736
Abstract
Three-dimensional sound effects require a considerable number of sound channels, causing audio-visual spatial orientation sense distortion under circumstances in which the code rate is restricted by the limitation of transmission channel bandwidth and storage capacity. As a result, existing 3D audio systems are incompatible with real-time broadcasting and home theatre applications, severely limiting the application and development of 3D audio systems. By investigating the mechanism of orientation parameters perceptual redundancy, this paper studied 3D spatial orientation cue perceptual characteristics, established an orientation cue perceptual model, developed a heterogeneous quantification table accordingly, and controlled the differences between each quantified value below the quantitative value perception threshold. Using this method, only the information perceptible to the human ear was quantified and perceptual distortions were minimized. The experimental results revealed that, compared to the SLQP method, the quantified bit of the proposed method was reduced by 8.66% in low resolution and 65.23% in high resolution. In addition, the accuracy of this method was higher than that of the SLQP method, enabling better alignment with human perceptual characteristics.
References
[1]
Mills AW 1958 On the minimum audible angle The Journal of the Acoustical Society of America 30 237 -246
[2]
Cheng B 2001 Spatial squeezing techniques for low bit-rate multichannel audio coding University of Wollongong Ph.D. Dissertation
[3]
Cheng B, Ritz C, and Burnett I 2008 A spatial squeezing approach to ambisonic audio compression IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 369 -372 Las Vegas
[4]
Cheng B, Ritz C, and Burnett I 2008 Psychoacoustic-based quantisation of spatial audio cues Electronics Letters 44 1098 -1099
[5]
Faller C and Baumgarte F 2003 Binaural cue coding-Part II: Schemes and applications IEEE Transactions on Speech and Audio Processing 11 520 -531
[6]
Hellerud E, Solvang A, and Svensson P 2009 Spatial redundancy in higher order ambisonics and its use for low-delay lossless compression IEEE International Conference on Acoustics, Speech and Signal Processing(ICASSP) 269 -272 Taipei
[7]
Pinto F and Vetterli M 2010 Space-time-frequency processing of acoustic wave fields: Theory, algorithms, and applications IEEE Transactions on Signal Processing 58 4608 -4620
[8]
ISO/IEC JTC1/SC29/WG11 (MPEG), Call for proposals on spatial audio coding, Doc. N6455, Munich, Germany, 2004
[9]
Johnston JD and Ferreira AJ 1992 Sum-difference stereo transform coding IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 569 -572 California
[10]
Herre J, Brandenburg K, and Lederer D 1994 Audio Engineering Society Convention 96 of Audio Engineering Society Intensity stereo coding 3799 Amsterdam
[11]
Brandenburg K 1999 MP3 and AAC explained 17th International Conference on High-Quality Audio Coding of Audio Engineering Society 17 -009 Florence
[12]
Marinus MB, de PJ, and Werner B 2000 108th AES Convention On the applicability of distributed mode loudspeaker panels for wave field synthesis based sound reproduction 5165 Paris
[13]
Goodwin MM and Jot J 2007 Primary-ambient signal decomposition and vector-based localization for spatial audio coding and enhancement IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 1 -9 Honolulu
[14]
Sakaida S, Iguchi K, Nakajima N, Nishidam Y, Ichigaya A, Nakasu E, Kurozumi M, and Gohshi S 2007 The super Hi-vision codec IEEE International Conference on Image Processing 21 -24 San Antonio
Index terms have been assigned to the content through auto-classification.
Recommendations
Comments
Please enable JavaScript to view thecomments powered by Disqus.Information & Contributors
Information
Published In
IOS Press and the authors. All rights reserved.
This is an open access article distributed under the terms of the Creative Commons Attribution Non-Commercial (CC BY-NC 4.0) License, which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
Publisher
IOS Press
Netherlands
Publication History
Published: 21 November 2015
Author Tags
Qualifiers
- Research-article
Contributors
Other Metrics
Bibliometrics & Citations
Bibliometrics
Article Metrics
- 0Total Citations
- 0Total Downloads
- Downloads (Last 12 months)0
- Downloads (Last 6 weeks)0
Reflects downloads up to 05 Mar 2025