Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Three-dimensional audio parametric encoding based on perceptual characteristics of spatial cue

Published: 21 November 2015 Publication History

Abstract

Three-dimensional sound effects require a considerable number of sound channels, causing audio-visual spatial orientation sense distortion under circumstances in which the code rate is restricted by the limitation of transmission channel bandwidth and storage capacity. As a result, existing 3D audio systems are incompatible with real-time broadcasting and home theatre applications, severely limiting the application and development of 3D audio systems. By investigating the mechanism of orientation parameters perceptual redundancy, this paper studied 3D spatial orientation cue perceptual characteristics, established an orientation cue perceptual model, developed a heterogeneous quantification table accordingly, and controlled the differences between each quantified value below the quantitative value perception threshold. Using this method, only the information perceptible to the human ear was quantified and perceptual distortions were minimized. The experimental results revealed that, compared to the SLQP method, the quantified bit of the proposed method was reduced by 8.66% in low resolution and 65.23% in high resolution. In addition, the accuracy of this method was higher than that of the SLQP method, enabling better alignment with human perceptual characteristics.

References

[1]
Mills AW 1958 On the minimum audible angle The Journal of the Acoustical Society of America 30 237 -246
[2]
Cheng B 2001 Spatial squeezing techniques for low bit-rate multichannel audio coding University of Wollongong Ph.D. Dissertation
[3]
Cheng B, Ritz C, and Burnett I 2008 A spatial squeezing approach to ambisonic audio compression IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 369 -372 Las Vegas
[4]
Cheng B, Ritz C, and Burnett I 2008 Psychoacoustic-based quantisation of spatial audio cues Electronics Letters 44 1098 -1099
[5]
Faller C and Baumgarte F 2003 Binaural cue coding-Part II: Schemes and applications IEEE Transactions on Speech and Audio Processing 11 520 -531
[6]
Hellerud E, Solvang A, and Svensson P 2009 Spatial redundancy in higher order ambisonics and its use for low-delay lossless compression IEEE International Conference on Acoustics, Speech and Signal Processing(ICASSP) 269 -272 Taipei
[7]
Pinto F and Vetterli M 2010 Space-time-frequency processing of acoustic wave fields: Theory, algorithms, and applications IEEE Transactions on Signal Processing 58 4608 -4620
[8]
ISO/IEC JTC1/SC29/WG11 (MPEG), Call for proposals on spatial audio coding, Doc. N6455, Munich, Germany, 2004
[9]
Johnston JD and Ferreira AJ 1992 Sum-difference stereo transform coding IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 569 -572 California
[10]
Herre J, Brandenburg K, and Lederer D 1994 Audio Engineering Society Convention 96 of Audio Engineering Society Intensity stereo coding 3799 Amsterdam
[11]
Brandenburg K 1999 MP3 and AAC explained 17th International Conference on High-Quality Audio Coding of Audio Engineering Society 17 -009 Florence
[12]
Marinus MB, de PJ, and Werner B 2000 108th AES Convention On the applicability of distributed mode loudspeaker panels for wave field synthesis based sound reproduction 5165 Paris
[13]
Goodwin MM and Jot J 2007 Primary-ambient signal decomposition and vector-based localization for spatial audio coding and enhancement IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 1 -9 Honolulu
[14]
Sakaida S, Iguchi K, Nakajima N, Nishidam Y, Ichigaya A, Nakasu E, Kurozumi M, and Gohshi S 2007 The super Hi-vision codec IEEE International Conference on Image Processing 21 -24 San Antonio

Index Terms

  1. Three-dimensional audio parametric encoding based on perceptual characteristics of spatial cue1
          Index terms have been assigned to the content through auto-classification.

          Recommendations

          Comments

          Please enable JavaScript to view thecomments powered by Disqus.

          Information & Contributors

          Information

          Published In

          cover image Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology
          Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology  Volume 29, Issue 6
          The fuzzy system and its application in East Asia
          Oct 2015
          479 pages
          This is an open access article distributed under the terms of the Creative Commons Attribution Non-Commercial (CC BY-NC 4.0) License, which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

          Publisher

          IOS Press

          Netherlands

          Publication History

          Published: 21 November 2015

          Author Tags

          1. 3D Audio
          2. orientation cue
          3. perceptual characteristic
          4. parameter coding

          Qualifiers

          • Research-article

          Contributors

          Other Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • 0
            Total Citations
          • 0
            Total Downloads
          • Downloads (Last 12 months)0
          • Downloads (Last 6 weeks)0
          Reflects downloads up to 09 Jan 2025

          Other Metrics

          Citations

          View Options

          View options

          Media

          Figures

          Other

          Tables

          Share

          Share

          Share this Publication link

          Share on social media