Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1040830.1040904acmconferencesArticle/Chapter ViewAbstractPublication PagesiuiConference Proceedingsconference-collections
Article

Towards automatic transcription of expressive oral percussive performances

Published: 10 January 2005 Publication History

Abstract

We describe a tool for transcribing voice generated percussive rhythms. The system consists of: (a) a segmentation component which separates the monophonic input stream into percussive events (b) a descriptors generation component that computes a set of acoustic features from each of the extracted segments, (c) a machine learning component which assigns to each of the segmented sounds of the input stream a symbolic class. We describe each of these components and compare different machine learning strategies that can be used to obtain a symbolic representation of the oral percussive performance.

References

[1]
Herrera, P. and Dehamel, A. and Gouyon, F. (2003). Automatic labeling of unpitched percussion Sounds, Proceedings of Audio Engineering Society, 114th Convention, Amsterdam, The Netherlands.
[2]
Quinlan, J.R. (1993). C4.5: Programs for Machine Learning, San Francisco, Morgan Kaufmann.
[3]
Kapur, A., Benning, M., Tzanetakis, G. (2004) Query-By-Beat-Boxing: Music Retrieval for the DJ. Proc. of the 5th International Conference on Music Information Retrieval (ISMIR 2004), Barcelona, October 10-14.
[4]
Nakano, T., Ogata, J., Goto, M., Hiraga, Y. (2004) A Drum Pattern Retrieval Method by Voice Percussion. Proc. of the 5th International Conference on Music Information Retrieval (ISMIR 2004), Barcelona, October 10-14.
[5]
Brossier, P., Bello, J., Plumbley, M. (2004). Fast Labelling of Notes in Music Signals. Proc. of the 5th International Conference on Music Information Retrieval (ISMIR 2004), Barcelona, October 10-14.
[6]
Klapuri, A. (1999). Sound Onset Detection by Applying Psychoacoustic Knowledge, Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP.
[7]
Hazan, A. (2004) Interfaz oral para el reconocimiento de ritmos. Master thesis. Facultat d'Informatica de Barcelona, February 2004.
[8]
Gouyon, F. Herrera, P. (2003). Determination of the Meter of musical audio signals: Seeking recurrences in beat segment descriptors. Proceedings of Audio Engineering Society, 114th Convention Amsterdam, The Netherlands.
[9]
Gillet, O. Richard,G. (2003). Automatic Labelling of Tabla Signals, Proc of ISMIR 2003, Baltimore, USA Oct. 2003.
[10]
Paulus, J., Klapuri, A. (2003) Model-Based Event Labeling in the Transcription Of Percussive Audio Signals Proc. of the 6th Int. Conference on Digital Audio Effects (DAFX-03), London, UK, September 8-11.
[11]
Ramirez, R., Hazan, A., Gómez, E., Maestre, E. (2004). Understanding Expressive Transformations in Monophonic Musical Phrases. Proc. of Sound and Music Computing (SMC'04), Paris, October 20-22.

Cited By

View all
  • (2021)Human beatbox sound recognition using an automatic speech recognition toolkitBiomedical Signal Processing and Control10.1016/j.bspc.2021.10246867(102468)Online publication date: May-2021
  • (2019)A New Dataset for Amateur Vocal Percussion AnalysisProceedings of the 14th International Audio Mostly Conference: A Journey in Sound10.1145/3356590.3356844(17-23)Online publication date: 18-Sep-2019
  • (2018)A Review of Automatic Drum TranscriptionIEEE/ACM Transactions on Audio, Speech and Language Processing10.1109/TASLP.2018.283011326:9(1457-1483)Online publication date: 1-Sep-2018
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
IUI '05: Proceedings of the 10th international conference on Intelligent user interfaces
January 2005
344 pages
ISBN:1581138946
DOI:10.1145/1040830
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 10 January 2005

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. knowledge-based approaches
  2. performance transcription
  3. speech processing

Qualifiers

  • Article

Conference

IUI05
IUI05: Tenth International Conference on Intelligent User Interfaces
January 10 - 13, 2005
California, San Diego, USA

Acceptance Rates

Overall Acceptance Rate 746 of 2,811 submissions, 27%

Upcoming Conference

IUI '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)5
  • Downloads (Last 6 weeks)1
Reflects downloads up to 23 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2021)Human beatbox sound recognition using an automatic speech recognition toolkitBiomedical Signal Processing and Control10.1016/j.bspc.2021.10246867(102468)Online publication date: May-2021
  • (2019)A New Dataset for Amateur Vocal Percussion AnalysisProceedings of the 14th International Audio Mostly Conference: A Journey in Sound10.1145/3356590.3356844(17-23)Online publication date: 18-Sep-2019
  • (2018)A Review of Automatic Drum TranscriptionIEEE/ACM Transactions on Audio, Speech and Language Processing10.1109/TASLP.2018.283011326:9(1457-1483)Online publication date: 1-Sep-2018
  • (2014)Singing information processing2014 12th International Conference on Signal Processing (ICSP)10.1109/ICOSP.2014.7015431(2431-2438)Online publication date: Oct-2014

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media