Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2502081.2502229acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

ESSENTIA: an open-source library for sound and music analysis

Published: 21 October 2013 Publication History

Abstract

We present Essentia 2.0, an open-source C++ library for audio analysis and audio-based music information retrieval released under the Affero GPL license. It contains an extensive collection of reusable algorithms which implement audio input/output functionality, standard digital signal processing blocks, statistical characterization of data, and a large set of spectral, temporal, tonal and high-level music descriptors. The library is also wrapped in Python and includes a number of predefined executable extractors for the available music descriptors, which facilitates its use for fast prototyping and allows setting up research experiments very rapidly. Furthermore, it includes a Vamp plugin to be used with Sonic Visualiser for visualization purposes. The library is cross-platform and currently supports Linux, Mac OS X, and Windows systems. Essentia is designed with a focus on the robustness of the provided music descriptors and is optimized in terms of the computational cost of the algorithms. The provided functionality, specifically the music descriptors included in-the-box and signal processing algorithms, is easily expandable and allows for both research experiments and development of large-scale industrial applications.

References

[1]
D. Bogdanov, M. Haro, F. Fuhrmann, A. Xambó, E. Gómez, and P. Herrera. Semantic audio content-based music recommendation and visualization based on user preference examples. Inf. Process. & Management, 49(1):13--33, 2013.
[2]
D. Bogdanov, J. Serrà, N. Wack, P. Herrera, and X. Serra. Unifying low-level and high-level music similarity measures. IEEE Trans. on Multimedia, 13(4):687--701, 2011.
[3]
D. Bogdanov, N. Wack, E. Gómez, S. Gulati, P. Herrera, O. Mayor, G. Roma, J. Salamon, J. Zapata, and X. Serra. ESSENTIA: an audio analysis library for music information retrieval. In Int. Soc. for Music Inf. Retrieval Conf. (ISMIR'13), 2013.
[4]
C. Cannam, C. Landone, and M. Sandler. Sonic visualiser: An open source application for viewing, analysing, and annotating music audio files. In ACM Int. Conf. on Multimedia (MM'05), page 1467--1468, 2010.
[5]
F. Eyben, M. Wöllmer, and B. Schuller. Opensmile: the munich versatile and fast open-source audio feature extractor. In ACM Int. Conf. on Multimedia (MM'10), page 1459--1462, 2010.
[6]
F. Fuhrmann, P. Herrera, and X. Serra. Detecting solo phrases in music using spectral and pitch-related descriptors. Journal of New Music Research, 38(4):343--356, 2009.
[7]
C. F. Julià and S. Jordà. SongExplorer: a tabletop application for exploring large collections of songs. In Int. Soc. for Music Inf. Retrieval Conf. (ISMIR'09), 2009.
[8]
S. Koelsch, S. Skouras, T. Fritz, P. Herrera, C. Bonhage, M. Kuessner, and A. M. Jacobs. Neural correlates of music-evoked fear and joy: The roles of auditory cortex and superficial amygdala. Neuroimage. In press.
[9]
K. R. Page, B. Fields, D. De Roure, T. Crawford, and J. S. Downie. Reuse, remix, repeat: the workflows of MIR. In Int. Soc. for Music Inf. Retrieval Conf. (ISMIR'12), 2012.
[10]
G. Roma, J. Janer, S. Kersten, M. Schirosa, P. Herrera, and X. Serra. Ecological acoustics perspective for content-based retrieval of environmental sounds. EURASIP Journal on Audio, Speech, and Music Process., 2010.
[11]
J. Serrà, E. Gómez, P. Herrera, and X. Serra. Chroma binary similarity and local alignment applied to cover song identification. IEEE Trans. on Audio, Speech, and Language Process., 16(6):1138--1151, 2008.
[12]
M. Sordo. Semantic Annotation of Music Collections: A Computational Approach. PhD thesis, UPF, Barcelona, Spain, 2012.
[13]
N. Wack, C. Laurier, O. Meyers, R. Marxer, D. Bogdanov, J. Serra, E. Gomez, and P. Herrera. Music classification using high-level models. In Music Inf. Retrieval Evaluation Exchange (MIREX'10), 2010.

Cited By

View all
  • (2024)The Emotion-to-Music Mapping Atlas (EMMA): A systematically organized online database of emotionally evocative music excerptsBehavior Research Methods10.3758/s13428-024-02336-056:4(3560-3577)Online publication date: 30-Jan-2024
  • (2024)Ensemble of Multimodal Deep Learning Models for Violin Bowing Techniques ClassificationJournal of Advances in Information Technology10.12720/jait.15.1.40-4815:1(40-48)Online publication date: 2024
  • (2024)Study protocol of a randomized control trial on the effectiveness of improvisational music therapy for autistic childrenBMC Psychiatry10.1186/s12888-024-06086-324:1Online publication date: 27-Sep-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
MM '13: Proceedings of the 21st ACM international conference on Multimedia
October 2013
1166 pages
ISBN:9781450324045
DOI:10.1145/2502081
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 October 2013

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. audio analysis
  2. music information retrieval
  3. open source
  4. signal processing
  5. sound and music computing

Qualifiers

  • Research-article

Conference

MM '13
Sponsor:
MM '13: ACM Multimedia Conference
October 21 - 25, 2013
Barcelona, Spain

Acceptance Rates

MM '13 Paper Acceptance Rate 47 of 235 submissions, 20%;
Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)53
  • Downloads (Last 6 weeks)3
Reflects downloads up to 25 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)The Emotion-to-Music Mapping Atlas (EMMA): A systematically organized online database of emotionally evocative music excerptsBehavior Research Methods10.3758/s13428-024-02336-056:4(3560-3577)Online publication date: 30-Jan-2024
  • (2024)Ensemble of Multimodal Deep Learning Models for Violin Bowing Techniques ClassificationJournal of Advances in Information Technology10.12720/jait.15.1.40-4815:1(40-48)Online publication date: 2024
  • (2024)Study protocol of a randomized control trial on the effectiveness of improvisational music therapy for autistic childrenBMC Psychiatry10.1186/s12888-024-06086-324:1Online publication date: 27-Sep-2024
  • (2024)MARingBA: Music-Adaptive Ringtones for Blended Audio Notification DeliveryProceedings of the 2024 CHI Conference on Human Factors in Computing Systems10.1145/3613904.3642376(1-15)Online publication date: 11-May-2024
  • (2024)Emotion Recognition of Playing Musicians From EEG, ECG, and Acoustic SignalsIEEE Transactions on Human-Machine Systems10.1109/THMS.2024.343032754:5(619-629)Online publication date: Oct-2024
  • (2024)Artist Similarity Based on Heterogeneous Graph Neural NetworksIEEE/ACM Transactions on Audio, Speech, and Language Processing10.1109/TASLP.2024.343717032(3717-3729)Online publication date: 2024
  • (2024)Musical Genre Classification Using Advanced Audio Analysis and Deep Learning TechniquesIEEE Open Journal of the Computer Society10.1109/OJCS.2024.34312295(457-467)Online publication date: 2024
  • (2024)SoundSignature: What Type of Music do you Like?2024 IEEE 5th International Symposium on the Internet of Sounds (IS2)10.1109/IS262782.2024.10704174(1-10)Online publication date: 30-Sep-2024
  • (2024)AudioInsight: Online Exploration of Large Audio Datasets for Musical Acoustics2024 IEEE 5th International Symposium on the Internet of Sounds (IS2)10.1109/IS262782.2024.10704135(1-10)Online publication date: 30-Sep-2024
  • (2024)MusicoNet: A Social Network for Musicians Based on the Internet of Musical Things and People Paradigm2024 IEEE 5th International Symposium on the Internet of Sounds (IS2)10.1109/IS262782.2024.10704117(1-9)Online publication date: 30-Sep-2024
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media