Abstract
Binaural cue coding (BCC) was introduced as an efficient representation method for MPEG-4 SAC (Spatial Audio Coding). However, in a low bit-rate environment, the spectrum of BCC output signals degrades with respect to the perceptual level. The proposed system in this paper estimates VSLI (virtual source location information) as the side information. The VSLI is the angle representation of spatial images between channels on playback layout. The subjective assessment results show that the proposed method provides better audio quality than the BCC method for encoding multi-channel signals.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Generic Coding of Moving Pictures and Associated Audio Information-Part 7: Advanced Audio Coding, ISO/IEC Std. 13 818-7 (1997)
Bosi, M., Brandenburg, K., Quackenbush, S.R., Fielder, L., Akagiri, K., Fuchs, H., Dietz, M., Herre, J., Davidson, G., Oikawa, Y.: ISO/IEC MPEG-2 advanced audio coding. J. Audio Eng. Soc. 45(10), 789–814 (1997)
Shinha, D., Johnston, J.D., Dorward, S., Quackenbush, S.R.: The perceptual audio coder (PAC). In: Madisetti, V., Williams, D.B. (eds.) The Digital Signal Processing Handbook, vol. ch. 42. CRC Press/ IEEE Press (1997)
Glasberg, B.R., Moore, B.C.J.: Derivation of auditory filter shapes from notched-noise data. Hear. Res. 47, 103–138 (1990)
Pulkki, V., Karjalainen, M.: Localization of Amplitude-Panned Virtual Sources I: Stereophonic Pannig. J. Audio Eng. Soc. 49(9), 739–752 (2001)
Pulkki, V.: Localization of Amplitude-Panned Virtual Sources II: three-dimensional panning. J. Audio Eng. Soc. 49(9), 753–767 (2001)
West, J.R.: Five-channel panning laws: an analytic and experimental comparison, Master’s Thesis, Music Engineering, University of Miami (1998)
Stylianou, Y., Syrdal, A.K.: Perceptual and objective detection of discontinuities in concatenative speech synthesis. In: Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), May 2001, vol. 2, pp. 837–840 (2001)
ITU-R Recommendation, Subjective Assessment of Sound Quality, International Telecommunication Union, BS. 562-3, Geneva (1990)
ITU-R Recommendation, Method for the Subjective Assessment of Intermediate Sound Quality (MUSHRA), International Telecommunication Union, BS. 1534-1, Geneva (2001)
ISO/IEC JTC1/SC29/WG11 (MPEG), Procedures for the Evaluation of Spatial Audio Coding Systems, Document N6691, Redmond (July 2004)
Faller, C., Baumgarte, F.: Efficient representation of spatial audio using perceptual parametrization. In: IEEE Workshop on Appl. of Sig. Proc. to Audio and Acoust. (October 2001)
Faller, C., Baumgarte, F.: Binaural cue coding applied to audio compression with flexible rendering. In: Proc. AES 113th Conv., Los Angeles, CA (October 2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Moon, Hg., Seo, Ji., Beak, S., Sung, KM. (2005). A Multi-channel Audio Compression Method with Virtual Source Location Information. In: Ho, YS., Kim, H.J. (eds) Advances in Multimedia Information Processing - PCM 2005. PCM 2005. Lecture Notes in Computer Science, vol 3767. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11581772_65
Download citation
DOI: https://doi.org/10.1007/11581772_65
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-30027-4
Online ISBN: 978-3-540-32130-9
eBook Packages: Computer ScienceComputer Science (R0)