Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/354384.354520acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
Article
Free access

A practical query-by-humming system for a large music database

Published: 30 October 2000 Publication History

Abstract

A music retrieval system that accepts hummed tunes as queries is described in this paper. This system uses similarity retrieval because a hummed tune may contain errors. The retrieval result is a list of song names ranked according to the closeness of the match. Our ultimate goal is that the correct song should be first on the list. This means that eventually our system's similarity retrieval should allow for only one correct answer.
The most significant improvement our system has over general query-by-humming systems is that all processing of musical information is done based on beats instead of notes. This type of query processing is robust against queries generated from erroneous input. In addition, acoustic information is transcribed and converted into relative intervals and is used for making feature vectors. This increases the resolution of the retrieval system compared with other general systems, which use only pitch direction information.
The database currently holds over 10,000 songs, and the retrieval time is at most one second. This level of performance is mainly achieved through the use of indices for retrieval. In this paper, we also report on the results of music analyses of the songs in the database. Based on these results, new technologies for improving retrieval accuracy, such as partial feature vectors and or'ed retrieval among multiple search keys, are proposed. The effectiveness of these technologies is evaluated quantitatively, and it is found that the retrieval accuracy increases by more than 20% compared with the previous system [9]. Practical user interfaces for the system are also described.

References

[1]
S. Blackburn and D. DeRoure. A Tool for Content Based Navigation of Music. In Proc. ACM Multimedia 98, 1998.
[2]
K. Curtis, N. Taniguchi, J. Nakagawa, and M. Yamamuro. A comprehensive image similarity retrieval system that utilizes multiple feature vectors in high dimensional space. In Proceedings of International Conference on Inforraation, Communication and Signal Processing, pages 180-184, September 1997.
[3]
J. J. Dubnowski, R. W. Schafer, and L. R. Rabiner. Real-Time Digital Hardware Pitch Detector. IEEE Trans. on Acoustics, Speech, and Signal Processing, ASSP-24(1):2-8, February 1976.
[4]
J. Foote. An overview of audio information retrieval. In Multimedia System 7, pages 2-10. ACM, January 1999.
[5]
J. Foote. Visualizing Music and Audio using Self-Similarity. In Proc. A CM Multimedia 99, pages 77-80, November 1999.
[6]
J. T. Foote . Content-Based Retrieval of Music and Audio. In Proc. SPIE, vol3229, pages 138-147, 1997.
[7]
A. Ghias, J. Logan, and D. Chamberlin. Query By Humming. In Proc. ACM Multimedia 95, pages 231-236, November 1995.
[8]
N. Kosugi, Y. Nishihara, S. Kon'ya, M. Yamamuro, and K. Kushima. Let's Search for Songs by Humming! In Proc. ACM Multimedia 99 (Part 2), page 194, November 1999.
[9]
N. Kosugi, Y. Nishihara, S. Kon'ya, M. Yamamuro, and K. Kushima. Music Retrieval by Humming. In Proceedings of PACRIM'99, pages 404-407. IEEE, August 1999.
[10]
W. Y. Ma, B. S. Manjunath, Y. Luo, Y. Deng, and X. Sun. NETRA: A Content-Based Image Retrieval System. http://maya.ece.uesb.edu/Netra/.
[11]
R. J. MeNab, L. A. Smith, D. Bainbridge, and I. H. Witten. The New Zealand Digital Library MELody inDEX. http://www.dlib.org/dlib/may97/meldex/OSwritten.html, May 1997.
[12]
Muscle Fish LLC. http://www.musclefish.eom/.
[13]
Y. Nishihara, N. Kosugi, S. Kon'ya, and M. Yamamuro. Humming Query System Using Normalized Time Scale. In Proceedings of CODAS'99, March 1999.
[14]
P.Y. Rolland, G. Rasldnis, and J. G. Ganascia. Musical Content- Based Retrieval: an Overview of the Melodiscov Approach and System. In Proc. ACM Multimedia 99, pages 81-84, November 1999.
[15]
J. R. Smith and C. S. Li. Image classification and querying using composite region templates. In Journal of Computer Vision and Image Understanding, 1999.
[16]
WILDCAT CANYON SOFTWARE. AUTOSCORE. http://www.wildcat.com/Pages/AutoscoreMain.htm.
[17]
N. Taniguchi and M. Yamamuro. Multiple Inverted Array Structure for Similar Image Retrieval. In IEEE Multimedia '98, pages 160-169, 1998.
[18]
A. Uitdenbogerd and J. Zobel. Melodic Matching Techniques for Large Music Database. In Proc. A CM Multimedia 99, pages 57-66, November 1999.
[19]
S. Wu and U. Manber. Fast Text Searching Allowing Errors. Communications of the ACM, 35(10):83-91, October 1996.
[20]
M. Yamamuro, K. Knshima, H. Kimoto, H. Akama, S. Kon'ya, J. Nakagawa, K. Mii, N. Taniguchi, and K. Curtis. ExSight - Multimedia Information Retrieval System. In 2Oth Annual Pacific Telecommunications Conference, PTC'98 Proceedings, pages 734-739, 1998.
[21]
A. Yoshitaka and T. Ichikawa. A Survey on Content-Based Retrieval for Multimedia Databases. IEEE Trans. Knowled9e and Data Engineering, 11(1):81-93, Feb. 1999.

Cited By

View all
  • (2021)Learning to Rank-based Approach for Movie Search by Keyword Query and Example QueryThe 23rd International Conference on Information Integration and Web Intelligence10.1145/3487664.3487779(137-145)Online publication date: 29-Nov-2021
  • (2021)Similarity Analysis of Visual Sketch-based Search for SoundsProceedings of the 16th International Audio Mostly Conference10.1145/3478384.3478423(101-108)Online publication date: 1-Sep-2021
  • (2020)Accurate Onset Detection Algorithm using Feature-Layer-Based Deep Learning Architecture2020 IEEE International Symposium on Circuits and Systems (ISCAS)10.1109/ISCAS45731.2020.9181255(1-5)Online publication date: Oct-2020
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
MULTIMEDIA '00: Proceedings of the eighth ACM international conference on Multimedia
October 2000
523 pages
ISBN:1581131984
DOI:10.1145/354384
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 30 October 2000

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Article

Conference

MM00: ACM Multimedia 2000
California, Marina del Rey, USA

Acceptance Rates

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)130
  • Downloads (Last 6 weeks)22
Reflects downloads up to 20 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2021)Learning to Rank-based Approach for Movie Search by Keyword Query and Example QueryThe 23rd International Conference on Information Integration and Web Intelligence10.1145/3487664.3487779(137-145)Online publication date: 29-Nov-2021
  • (2021)Similarity Analysis of Visual Sketch-based Search for SoundsProceedings of the 16th International Audio Mostly Conference10.1145/3478384.3478423(101-108)Online publication date: 1-Sep-2021
  • (2020)Accurate Onset Detection Algorithm using Feature-Layer-Based Deep Learning Architecture2020 IEEE International Symposium on Circuits and Systems (ISCAS)10.1109/ISCAS45731.2020.9181255(1-5)Online publication date: Oct-2020
  • (2020)Where does Haydn end and Mozart begin? Composer classification of string quartetsJournal of New Music Research10.1080/09298215.2020.1814822(1-20)Online publication date: 2-Sep-2020
  • (2018)On large-scale genre classification in symbolically encoded music by automatic identification of repeating patternsProceedings of the 5th International Conference on Digital Libraries for Musicology10.1145/3273024.3273035(34-37)Online publication date: 28-Sep-2018
  • (2016)A Scheme of MIDI Music Emotion Classification Based on Fuzzy Theme Extraction and Neural Network2016 12th International Conference on Computational Intelligence and Security (CIS)10.1109/CIS.2016.0079(323-326)Online publication date: Dec-2016
  • (2015)Design and Realization of Music Retrieval System Based on Feature ContentMATEC Web of Conferences10.1051/matecconf/2015220102422(01024)Online publication date: 9-Jul-2015
  • (2014)Music Information RetrievalFoundations and Trends in Information Retrieval10.1561/15000000428:2-3(127-261)Online publication date: 12-Sep-2014
  • (2014)Music Retrieval and Adjustment Technique to Support and Motivate Ergotherapy and Daily ExercisesProceedings of the 16th International Conference on Information Integration and Web-based Applications & Services10.1145/2684200.2684313(385-394)Online publication date: 4-Dec-2014
  • (2014)Popular music representationMultimedia Tools and Applications10.1007/s11042-013-1687-273:3(2103-2128)Online publication date: 1-Dec-2014
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media