Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2976796.2988171acmotherconferencesArticle/Chapter ViewAbstractPublication PageswebmediaConference Proceedingsconference-collections
short-paper

A Deep Approach for Handwritten Musical Symbols Recognition

Published: 08 November 2016 Publication History

Abstract

Preserving the world musical heritage comes down to digitalising and provision of music works to further query on the acquired data. However, to do the processing it is necessary an Optical Music Recognition (OMR) system capable of decoding the original manuscripts into a machine-readable data. Developing a precise and robust OMR system for handwritten musical scores is still an open issue. A fundamental step of improve such task is to recognise musical notes. Hence, trying to provide ways to produce a truly robust OMR sys- tem, we present in this paper a new methodology applying deep learning techniques to recognise musical notes in digitalised handwritten musical scores. The proposed methodology has been tested on a ground truth dataset of music scores reaching a minimum error rate of 3.99%, 96.46% of precision and 96.56% of recall on the HOMUS dataset.

References

[1]
J. Calvo-Zaragoza and J. Oncina. Recognition of pen-based music notation: the homus dataset. In 2014 22nd International Conference on Pattern Recognition (ICPR), pages 3038--3043. IEEE, 2014.
[2]
A. Fornés, J. Lladós, and G. Sánchez. Old handwritten musical symbol classification by a dynamic time warping based method. In Graphics Recognition. Recent Advances and New Opportunities, pages 51--60. Springer, 2007.
[3]
D. H. Hubel and T. N. Wiesel. Receptive fields and functional architecture of monkey striate cortex. The Journal of physiology, 195(1):215--243, 1968.
[4]
A. Krizhevsky, I. Sutskever, and G. E. Hinton. Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems, pages 1097--1105, 2012.
[5]
S. V. Lab. Large scale visual recognition challenge 2012 (ilsvrc2012), 2012.
[6]
Y. LeCun, L. Bottou, Y. Bengio, and P. Haffner. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11):2278--2324, 1998.
[7]
Y. LeCun, C. Cortes, and C. J. Burges. The mnist database. URL http://yann. lecun. com/exdb/mnist, 1998.
[8]
H. Miyao and M. Maruyama. An online handwritten music score recognition system. In Pattern Recognition, 2004. ICPR 2004. Proceedings of the 17th International Conference on, volume 1, pages 461--464. IEEE, 2004.
[9]
A. Rebelo, G. Capela, and J. S. Cardoso. Optical recognition of music symbols. International Journal on Document Analysis and Recognition (IJDAR), 13(1):19--31, 2010.
[10]
O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Z. Huang, A. Karpathy, A. Khosla, M. Bernstein, et al. Imagenet large scale visual recognition challenge. International Journal of Computer Vision, 115(3):211--252, 2015.
[11]
C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, and A. Rabinovich. Going deeper with convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 1--9, 2015.
[12]
J. J. Valero-Mas, J. Calvo-Zaragoza, J. R. Rico-Juan, and J. M. I\ nesta. An experimental study on rank methods for prototype selection. Soft Computing, pages 1--13, 2016.
[13]
C. Wen, A. Rebelo, J. Zhang, and J. Cardoso. A new optical music recognition system based on combined neural network. Pattern Recognition Letters, 58:1--7, 2015.
[14]
S. Zhang, C. Zhang, Z. You, R. Zheng, and B. Xu. Asynchronous stochastic gradient descent for dnn training. In Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on, pages 6660--6663. IEEE, 2013.

Cited By

View all
  • (2024)Deep Multilevel Cascade Residual Recurrent Framework (MCRR) for Sheet Music RecognitionIEEE Access10.1109/ACCESS.2024.335088012(6941-6960)Online publication date: 2024
  • (2020)Offline music symbol recognition using Daisy feature and quantum Grey wolf optimization based feature selectionMultimedia Tools and Applications10.1007/s11042-020-09638-3Online publication date: 25-Aug-2020
  • (2019)OMR metrics and evaluation: a systematic reviewMultimedia Tools and Applications10.1007/s11042-019-08200-0Online publication date: 14-Dec-2019
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
Webmedia '16: Proceedings of the 22nd Brazilian Symposium on Multimedia and the Web
November 2016
384 pages
ISBN:9781450345125
DOI:10.1145/2976796
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

  • FAPEPI: Fundacao de Amparo a Pesquisa do Estado do Piaui
  • SBC: Brazilian Computer Society
  • CNPq: Conselho Nacional de Desenvolvimento Cientifico e Tecn
  • CGIBR: Comite Gestor da Internet no Brazil
  • CAPES: Brazilian Higher Education Funding Council

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 November 2016

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. convolutional neural network
  2. deep learning
  3. document analyses
  4. optical musical recognition

Qualifiers

  • Short-paper

Funding Sources

Conference

Webmedia '16
Sponsor:
  • FAPEPI
  • SBC
  • CNPq
  • CGIBR
  • CAPES
Webmedia '16: 22nd Brazilian Symposium on Multimedia and the Web
November 8 - 11, 2016
Piauí State, Teresina, Brazil

Acceptance Rates

Webmedia '16 Paper Acceptance Rate 29 of 94 submissions, 31%;
Overall Acceptance Rate 270 of 873 submissions, 31%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 13 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Deep Multilevel Cascade Residual Recurrent Framework (MCRR) for Sheet Music RecognitionIEEE Access10.1109/ACCESS.2024.335088012(6941-6960)Online publication date: 2024
  • (2020)Offline music symbol recognition using Daisy feature and quantum Grey wolf optimization based feature selectionMultimedia Tools and Applications10.1007/s11042-020-09638-3Online publication date: 25-Aug-2020
  • (2019)OMR metrics and evaluation: a systematic reviewMultimedia Tools and Applications10.1007/s11042-019-08200-0Online publication date: 14-Dec-2019
  • (2018)End-to-End Neural Optical Music Recognition of Monophonic ScoresApplied Sciences10.3390/app80406068:4(606)Online publication date: 11-Apr-2018
  • (2017)Towards Self-Learning Optical Music Recognition2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA)10.1109/ICMLA.2017.00-60(795-800)Online publication date: Dec-2017
  • (2017)Recognition of Handwritten Music Symbols with Convolutional Neural Codes2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)10.1109/ICDAR.2017.118(691-696)Online publication date: Nov-2017

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media