Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2484028.2484048acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

Competence-based song recommendation

Published: 28 July 2013 Publication History

Abstract

Singing is a popular social activity and a good way of expressing one's feelings. One important reason for unsuccessful singing performance is because the singer fails to choose a suitable song. In this paper, we propose a novel singing competence-based song recommendation framework. It is distinguished from most existing music recommendation systems which rely on the computation of listeners' interests or similarity. We model a singer's vocal competence as singer profile, which takes voice pitch, intensity, and quality into consideration. Then we propose techniques to acquire singer profiles. We also present a song profile model which is used to construct a human annotated song database. Finally, we propose a learning-to-rank scheme for recommending songs by singer profile. The experimental study on real singers demonstrates the effectiveness of our approach and its advantages over two baseline methods. To the best of our knowledge, our work is the first to study competence-based song recommendation.

References

[1]
http://www.fon.hum.uva.nl/praat/manual/Voice.html.
[2]
R. Baeza-Yates and B. Ribeiro-Neto. Modern Information Retrieval. Addison-Wesley, 1999.
[3]
E. Benetos, S. Dixon, D. Giannoulis, H. Kirchhoff, and A. Klapuri. Automatic music transcription: Breaking the glass ceiling. In ISMIR, pages 379--384, 2012.
[4]
P. Boersma and D. Weenink. Praat: doing phonetics by computer (Version 5.3.06) {Computer program}, Retrieved May 1, 200. from http://www.praat.org/.
[5]
Z. Cao, T. Qin, T.-Y. Liu, M.-F. Tsai, and H. Li. Learning to rank: from pairwise approach to listwise approach. In ICML, pages 129--136, 2007.
[6]
D. Deliyski. Acoustic model and evaluation of pathological voice production. In Proceedings of Eurospeech, pages 1969--1972, 1993.
[7]
A. Ghias, J. Logan, D. Chamberlin, and B. C. Smith. Query by humming: Musical information retrieval in an audio database. In ACM Multimedia, pages 231--236, 1995.
[8]
D. Goldberg, D. A. Nichols, B. M. Oki, and D. B. Terry. Using collaborative filtering to weave an information tapestry. Commun. ACM, 35(12):61--70, 1992.
[9]
L. Heylen, F. Wuyts, F. Mertens, M. D. Bodt, and P. V. de Heyning. Normative voice range profiles of male and female professional voice users. Journal of voice, 16:1--17, 2002.
[10]
L. G. Heylen, F. L. Wuyts, F. W. Mertens, and J. E. Pattyn. Phonetography in voice diagnoses. Acta Oto-Rhino-Laryngologica, 50:299--308, 1996.
[11]
K. Hoashi, K. Matsumoto, and N. Inoue. Personalization of user profiles for content-based music retrieval based on relevance feedback. In ACM Multimedia, pages 110--119, 2003.
[12]
K. Järvelin and J. Kekäläinen. Ir evaluation methods for retrieving highly relevant documents. In SIGIR, pages 41--48, 2000.
[13]
H. Kirchhoff, S. Dixon, and A. Klapuri. Multi-template shift-variant non-negative matrix deconvolution for semi-automatic music transcription. In ISMIR, pages 415--420, 2012.
[14]
K. Mao, X. Luo, K. Chen, G. Chen, and L. Shou. mydj: recommending karaoke songs from one's own voice. In SIGIR, page 1009, 2012.
[15]
Y. Maryn, P. Corthals, P. V. Cauwenberge, N. Roy, and M. D. Bodt. Toward improved ecological validity in the acoustic measurement of overall voice quality: combining continuous speech and sustained vowels. Journal of voice, 24:410--426, 2010.
[16]
J. P. Pabon and R. Plomp. Automatic phonetogram recording supplemented with acoustical voice-quality parameters. Journal of Speech and Hearing Research, 31:710--722, 1988.
[17]
G. Peeters. A large set of audio features for sound description. Technical report, IRCAM, 2004.
[18]
J. Peter and H. Pabon. Objective acoustic voice-quality parameters in the computer phonetogram. Journal of Voice, 5:203--216, 1991.
[19]
B. Schneider, M. Zumtobel, W. Prettenhofer, B. Aichstill, and W. Jocher. Normative voice range profiles in vocally trained and untrained children aged between 7 and 10 years. Journal of voice, 24:153--160, 2010.
[20]
H. Schutte and W. Seidner. Recommendation by the union of european phoniatricians (uep): Standardizing voice area measurement/phonetography. Folia Phoniatr (Basel), 35:286--288, 1983.
[21]
R. Speyer, G. H. Wieneke, I. van Wijck-Warnaar, and P. H. Dejonckere. Efficacy of voice therapy assessed with the voice range profile (phonetogram). Journal of Voice, 17:544--559, 2003.
[22]
A. M. Sulter, H. K. Schutte, and D. G. Miller. Differences in phonetogram features between male and female subjects with and without vocal training. Journal of voice, 9:363--377, 1995.
[23]
W.-H. Tsai and H.-C. Lee. Automatic evaluation of karaoke singing based on pitch, volume, and rhythm features. IEEE Transactions on Audio, Speech, and Language Processing, 20:1233--1243, 2012.
[24]
C. Watts, K. Barnes-Burroughs, J. Estis, and D. Blanton. The singing power ratio as an objective measure of singing voice quality in untrained talented and nontalented singers. Journal of voice, 20:82--88, 2006.
[25]
S. K. Wolf, D. Stanley, and W. J. Sette. Quantitative studies on the singing voice. The Journal of the Acoustical Society of America, 6:255--266, 1935.
[26]
E. Yumoto, W. Gould, and T. Baer. Harmonics-to-noise ratio as an index of the degree of hoarseness. The Journal of the Acoustical Society of America, 71:1544--1550, 1982.

Cited By

View all

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGIR '13: Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
July 2013
1188 pages
ISBN:9781450320344
DOI:10.1145/2484028
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 28 July 2013

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. learning-to-rank
  2. singing competence
  3. song recommendation

Qualifiers

  • Research-article

Conference

SIGIR '13
Sponsor:

Acceptance Rates

SIGIR '13 Paper Acceptance Rate 73 of 366 submissions, 20%;
Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)1
Reflects downloads up to 03 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2016)Music recommendation using graph based quality modelSignal Processing10.1016/j.sigpro.2015.03.026120:C(806-813)Online publication date: 1-Mar-2016
  • (2015)Competence-Based Song Recommendation: Matching Songs to One’s Singing SkillIEEE Transactions on Multimedia10.1109/TMM.2015.239256217:3(396-408)Online publication date: Mar-2015
  • (2015)Next-song recommendation with temporal dynamicsKnowledge-Based Systems10.1016/j.knosys.2015.07.03988:C(134-143)Online publication date: 1-Nov-2015
  • (2014)Song Recommendation for Social Singing CommunityProceedings of the 22nd ACM international conference on Multimedia10.1145/2647868.2654921(127-136)Online publication date: 3-Nov-2014

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media