Abstract
With a standard compact disc (CD) audio player, the only possibility for the user is to listen to the recorded track, passively: the interaction is limited to changing the global volume or the track. Imagine now that the listener can turn into a musician, playing with the sound sources present in the stereo mix, changing their respective volumes and locations in space. For example, a given instrument or voice can be either muted, amplified, or more generally moved in the acoustic space. This will be a kind of generalized karaoke, useful for disc jockeys and also for music pedagogy (when practicing an instrument). Our system shows that this dream has come true, with active CDs fully backward compatible while enabling interactive music. The magic is that “the music is in the sound”: the structure of the mix is embedded in the sound signal itself, using audio watermarking techniques, and the embedded information is exploited by the player to perform the separation of the sources (patent pending) used in turn by a spatializer.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Algazi, V.R., Duda, R.O., Thompson, D.M., Avendano, C.: The CIPIC HRTF database. In: Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, New York, pp. 99–102 (2001)
Araki, S., Sawada, H., Makino, S.: K-means based underdetermined blind speech separation. In: Makino, S., et al. (eds.) Blind Source Separation, pp. 243–270. Springer, Heidelberg (2007)
Araki, S., Sawada, H., Mukai, R., Makino, S.: Underdetermined blind sparse source separation for arbitrarily arranged multiple sensors. Signal Processing 87(8), 1833–1847 (2007)
Bass, H., Sutherland, L., Zuckerwar, A., Blackstock, D., Hester, D.: Atmospheric absorption of sound: Further developments. Journal of the Acoustical Society of America 97(1), 680–683 (1995)
Berg, R.E., Stork, D.G.: The Physics of Sound, 2nd edn. Prentice Hall, Englewood Cliffs (1994)
Blauert, J.: Spatial Hearing. revised edn. MIT Press, Cambridge (1997); Translation by J.S. Allen
Bofill, P., Zibulevski, M.: Underdetermined blind source separation using sparse representations. Signal Processing 81(11), 2353–2362 (2001)
Chen, B., Wornell, G.: Quantization index modulation: A class of provably good methods for digital watermarking and information embedding. IEEE Transactions on Information Theory 47(4), 1423–1443 (2001)
Chowning, J.M.: The simulation of moving sound sources. Journal of the Acoustical Society of America 19(1), 2–6 (1971)
International Organization for Standardization, Geneva, Switzerland: ISO 9613-1:1993: Acoustics – Attenuation of Sound During Propagation Outdoors – Part 1: Calculation of the Absorption of Sound by the Atmosphere (1993)
ISO/IEC JTC1/SC29/WG11 MPEG: Information technology Generic coding of moving pictures and associated audio information Part 7: Advanced Audio Coding (AAC) IS13818-7(E) (2004)
ITU-R: Method for objective measurements of perceived audio quality (PEAQ) Recommendation BS1387-1 (2001)
Kuhn, G.F.: Model for the interaural time differences in the azimuthal plane. Journal of the Acoustical Society of America 62(1), 157–167 (1977)
Mouba, J., Marchand, S., Mansencal, B., Rivet, J.M.: RetroSpat: a perception-based system for semi-automatic diffusion of acousmatic music. In: Proceedings of the Sound and Music Computing (SMC) Conference, Berlin, pp. 33–40 (2008)
O’Grady, P., Pearlmutter, B.A., Rickard, S.: Survey of sparse and non-sparse methods in source separation. International Journal of Imaging Systems and Technology 15(1), 18–33 (2005)
Parvaix, M., Girin, L.: Informed source separation of underdetermined instantaneous stereo mixtures using source index embedding. In: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Dallas, Texas (2010)
Parvaix, M., Girin, L.: Informed source separation of linear instantaneous under-determined audio mixtures by source index embedding. IEEE Transactions on Audio, Speech, and Language Processing (accepted, pending publication, 2011)
Pinel, J., Girin, L., Baras, C.: A high-rate data hiding technique for uncompressed audio signals. IEEE Transactions on Audio, Speech, and Language Processing (submitted)
Pinel, J., Girin, L., Baras, C., Parvaix, M.: A high-capacity watermarking technique for audio signals based on MDCT-domain quantization. In: International Congress on Acoustics (ICA), Sydney, Australia (2010)
Plumbley, M.D., Blumensath, T., Daudet, L., Gribonval, R., Davies, M.E.: Sparse representations in audio and music: From coding to source separation. Proceedings of the IEEE 98(6), 995–1005 (2010)
Princen, J., Bradley, A.: Analysis/synthesis filter bank design based on time domain aliasing cancellation. IEEE Transactions on Acoustics, Speech, and Signal Processing 64(5), 1153–1161 (1986)
Strutt (Lord Rayleigh), J.W.: Acoustical observations i. Philosophical Magazine 3, 456–457 (1877)
Strutt (Lord Rayleigh), J.W.: On the acoustic shadow of a sphere. Philosophical Transactions of the Royal Society of London 203A, 87–97 (1904)
Thiede, T., Treurniet, W., Bitto, R., Schmidmer, C., Sporer, T., Beerends, J., Colomes, C.: PEAQ - the ITU standard for objective measurement of perceived audio quality. Journal of the Audio Engineering Society 48(1), 3–29 (2000)
Tournery, C., Faller, C.: Improved time delay analysis/synthesis for parametric stereo audio coding. Journal of the Audio Engineering Society 29(5), 490–498 (2006)
Vincent, E., Gribonval, R., Plumbley, M.D.: Oracle estimators for the benchmarking of source separation algorithms. Signal Processing 87, 1933–1950 (2007)
Viste, H.: Binaural Localization and Separation Techniques. Ph.D. thesis, École Polytechnique Fédérale de Lausanne, Switzerland (2004)
Woodworth, R.S.: Experimental Psychology. Holt, New York (1954)
Yılmaz, O., Rickard, S.: Blind separation of speech mixtures via time-frequency masking. IEEE Transactions on Signal Processing 52(7), 1830–1847 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Marchand, S., Mansencal, B., Girin, L. (2011). Interactive Music with Active Audio CDs. In: Ystad, S., Aramaki, M., Kronland-Martinet, R., Jensen, K. (eds) Exploring Music Contents. CMMR 2010. Lecture Notes in Computer Science, vol 6684. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23126-1_3
Download citation
DOI: https://doi.org/10.1007/978-3-642-23126-1_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23125-4
Online ISBN: 978-3-642-23126-1
eBook Packages: Computer ScienceComputer Science (R0)