Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1065385.1065432acmconferencesArticle/Chapter ViewAbstractPublication PagesjcdlConference Proceedingsconference-collections
Article

On the extraction of vocal-related information to facilitate the management of popular music collections

Published: 07 June 2005 Publication History

Abstract

With the explosive growth of networked collections of musical material, there is a need to establish a mechanism like a digital library to manage music data. This paper presents a content-based processing paradigm of popular song collections to facilitate the realization of a music digital library. The paradigm is built on the automatic extraction of information of interest from music audio signals. Because the vocal part is often the heart of a popular song, we focus on developing techniques to exploit the solo vocal signals underlying an accompanied performance. This supports the necessary functions of a music digital library, namely, music data organization, music information retrieval/recommendation, and copyright protection.

References

[1]
Akeroyd, M. A., Moore, B. C. J., and Moore, G. A. Melody Recognition Using Three Types of Dichotic-pitch Stimulus. Journal of the Acoustical Society of America, 110 (3), 2001, 1498-1504.
[2]
Bainbridge, D., Nevill-Manning, C. G., Witten, I. H., Smith, L. A., McNab, R. J. Towards A digital Library of Popular Music. In Proceedings of the ACM International Conference on Digital Libraries, 1999.
[3]
Cunningham, S. J., Reeves, N., and Britland M. An Ethnographic Study of Music Information Seeking: Implications for the Design of a Music Digital Library. In Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, 2003.
[4]
Davis, S. B., and Mermelstein, P. Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Sentences. IEEE Transactions on Acoustics, speech, and Signal Processing, 28, 1980, 357-366.
[5]
Dempster, A., Laird, N., and Rubin, D. Maximum Likelihood from Incomplete Data via the EM Algorithm. Journal of the Royal Statistical Society, 39, 1977, 1-38.
[6]
Durey, A. S., and Clements, M. A. Features for Melody Spotting Using Hidden Markov Models. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, 2002.
[7]
Eronen, A. Musical Instrument Recognition Using ICA-based Transform of Features and Discriminatively Trained HMMS. In Proceedings of the International Symposium on Signal Processing and Its Applications, 2003.
[8]
Hacker, S. MP3: The Definitive Guide. O'Reilly, 2000.
[9]
Haitsma, J., and Kalker, T. A Highly Robust Audio Fingerprinting System, In Proceedings of the International Conference on Music Information Retrieval, 2002.
[10]
Herrera, P., Amatriain, X., Batlle, E., and Serra. X. Towards Instrument Segmentation for Music Content Description: A Critical Review of Instrument Classification Techniques. In Proceedings of the International Symposium on Music Information Retrieval, 2000.
[11]
http://music-ir.org/evaluation/wp.html
[12]
ISO-IEC/JTC1 SC29 WG11 Moving Pictures Expert Group. Information technology - multimedia content description interface - part 4: Audio. Committee Draft 15938-4, ISO/IEC, 2000.
[13]
Kim, Y. E., and Whitman, B. Singer Identification in Popular Music Recordings Using Voice Coding Features. In Proceedings of the International Conference on Music Information Retrieval, 2002.
[14]
Li, T., Ogihara, M., and Li, Q. A Comparative Study on Content-Based Music Genre Classification. In Proceedings of the ACM SIGIR Conference on Research and Development in Information Retrieval, 2003.
[15]
Liu, C. C., and Huang, C. S. A Singer Identification Technique for Content-based Classification of MP3 Music Objects. In Proceedings of the ACM International Conference on Information and Knowledge Management, 2002.
[16]
Liu, D., Lu, L., and Zhang, H. J. Automatic Mood Detection from Acoustic Music Data. In Proceedings of the International Conference on Music Information Retrieval, 2003.
[17]
Martin, A., Doddington, G., Kamm, T., Ordowski, M., and Przybocki, M. The DET Curve in Assessment of Detection Task Performance. In Proceedings of the European Conference on Speech Communication and Technology, 1997.
[18]
McNab, R. J., Smith, L. A., Witten, I. H. Towards the Digital Music Library: Tune Retrieval from Acoustic Input. In Proceedings of the ACM International Conference on Digital Libraries, 1996.
[19]
Oppenheim, A. V., and Schafer, R. W. Homomorphic Analysis of Speech. IEEE Transactions on Audio and Electroacoustics, 16, 1968, 221-226.
[20]
Reynolds, D. A., and Rose, R. C. Robust Text-independent Speaker Identification Using Gaussian Mixture Speaker Models. IEEE Transactions on Speech and Audio Processing, 3 (1), 1995, 72-83.
[21]
Reynolds, D. A., Quatieri, T. F., and Dunn, R. B. Speaker Verification Using Adapted Gaussian Mixture Models. Digital Signal Processing, 10, 2000, 19-41.
[22]
Salton, G. Automatic Text Processing. Addison-Wesley, Reading, MA, 1989.
[23]
Solomonoff, A., Mielke, A., Schmidt, M., and Gish, H. Clustering Speakers by Their Voices. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, 1998.
[24]
Tsai, W. H., Wang, H. M., and Rodgers D. Blind Clustering of Popular Music Recordings Based on Singer Voice Characteristics. Computer Music Journal, 28 (3), 2004, 68-78.
[25]
Tsai, W. H., and Wang, H. M. A Query-by-example Framework to Retrieve Music Documents by Singer. In Proceedings of the IEEE International Conference on Multimedia and Expo, 2004.
[26]
Tzanetakis, G., and Cook, P. Musical Genre Classification of Audio Signals. IEEE Transactions on Speech and Audio Processing, 10 (5), 2002, 293-302.
[27]
Tzanetakis, G., Gao, J., and Steenkiste, P. A Scalable Peer-to-Peer System for Music Content and Information Retrieval. In Proceedings of the International Conference on Music Information Retrieval, 2003.
[28]
Venkatachalam, V., Cazzanti, L., Dhillon, N., and Wells, M. Automatic Identification of Sound Recordings. IEEE Signal Processing Magazine, March 2004, 92-99.
[29]
Wang, C. K., Lyu, R. Y., and Chiang, Y. C. An Automatic Singing Transcription System with Multilingual Singing Lyric Recognizer and Robust Melody Tracker. In Proceedings of the European Conference on Speech Communication and Technology, 2003.
[30]
Whitman, B., Flake, G., and Lawrence, S. Artist Detection in Music with Minnowmatch. In Proceedings of the IEEE Workshop on Neural Networks for Signal Processing, 2001.
[31]
Yang, D., and Lee, W. Disambiguating Music Emotion Using Software Agents. In Proceedings of the International Conference on Music Information Retrieval, 2004.

Cited By

View all
  • (2016)Information Retrieval Technologies and the “Realities” of Music Information SeekingBusiness Intelligence10.4018/978-1-4666-9562-7.ch032(612-631)Online publication date: 2016
  • (2016)Information Retrieval Technologies and the “Realities” of Music Information SeekingExperimental Multimedia Systems for Interactivity and Strategic Innovation10.4018/978-1-4666-8659-5.ch005(102-121)Online publication date: 2016
  • (2011)Research and Realization of a SVS Algorithm Based on STFT and NDFTProceedings of the 2011 International Conference on Network Computing and Information Security - Volume 0210.1109/NCIS.2011.178(396-400)Online publication date: 14-May-2011
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
JCDL '05: Proceedings of the 5th ACM/IEEE-CS joint conference on Digital libraries
June 2005
450 pages
ISBN:1581138768
DOI:10.1145/1065385
  • General Chair:
  • Mary Marlino,
  • Program Chairs:
  • Tamara Sumner,
  • Frank Shipman
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 June 2005

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. music digital library
  2. music information retrieval
  3. query-by-example
  4. solo voice modeling
  5. vocal/non-vocal segmentation

Qualifiers

  • Article

Conference

JCDL05

Acceptance Rates

Overall Acceptance Rate 415 of 1,482 submissions, 28%

Upcoming Conference

JCDL '24
The 2024 ACM/IEEE Joint Conference on Digital Libraries
December 16 - 20, 2024
Hong Kong , China

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 10 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2016)Information Retrieval Technologies and the “Realities” of Music Information SeekingBusiness Intelligence10.4018/978-1-4666-9562-7.ch032(612-631)Online publication date: 2016
  • (2016)Information Retrieval Technologies and the “Realities” of Music Information SeekingExperimental Multimedia Systems for Interactivity and Strategic Innovation10.4018/978-1-4666-8659-5.ch005(102-121)Online publication date: 2016
  • (2011)Research and Realization of a SVS Algorithm Based on STFT and NDFTProceedings of the 2011 International Conference on Network Computing and Information Security - Volume 0210.1109/NCIS.2011.178(396-400)Online publication date: 14-May-2011
  • (2011)Automatic singer identification based on auditory features2011 Seventh International Conference on Natural Computation10.1109/ICNC.2011.6022500(1624-1628)Online publication date: Jul-2011
  • (2010)Visual Expression for Organizing and Accessing Music Collections in MusicWizResearch and Advanced Technology for Digital Libraries10.1007/978-3-642-15464-5_10(80-91)Online publication date: 2010
  • (2009)Pitch Oriented Automatic Singer Identification in Pop MusicProceedings of the 2009 IEEE International Conference on Semantic Computing10.1109/ICSC.2009.28(161-166)Online publication date: 14-Sep-2009
  • (2006)Tree-based overlay networks for scalable applicationsProceedings of the 20th international conference on Parallel and distributed processing10.5555/1898699.1898719(223-223)Online publication date: 25-Apr-2006
  • (2006)Automatic categorization of figures in scientific documentsProceedings of the 6th ACM/IEEE-CS joint conference on Digital libraries10.1145/1141753.1141778(129-138)Online publication date: 11-Jun-2006
  • (2006)Tree-based overlay networks for scalable applicationsProceedings 20th IEEE International Parallel & Distributed Processing Symposium10.1109/IPDPS.2006.1639493(8 pp.)Online publication date: 2006

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media