The Song Remains the Same: Lossless Conversion and Streaming of MIDI to RDF and Back

Albert Meroño-Peñuela^19,20 &
Rinke Hoekstra^19,21

Part of the book series: Lecture Notes in Computer Science ((LNCCN,volume 9989))

Included in the following conference series:

European Semantic Web Conference

1758 Accesses
6 Citations

Abstract

In this demo, we explore the potential of RDF as a representation format for digital music. Digital music is broadly used today in many professional music production environments. For decades, MIDI (Musical Instrument Digital Interface) has been the standard for digital music exchange between musicians and devices, albeit not in a Web friendly way. We show the potential of expressing digital music as Linked Data, using our midi2rdf suite of tools to convert and stream digital music in MIDI format to RDF. The conversion allows for lossless round tripping: we can reconstruct a MIDI file identical to the original using its RDF representation. The streaming uses an existing, novel generative audio matching algorithm that we use to broadcast, with very low latency, RDF triples of MIDI events coming from arbitrary analog instruments.

You have full access to this open access chapter, Download conference paper PDF

Enabling Interactive and Interoperable Semantic Music Applications

From raw audio to a seamless mix: creating an automated DJ system for Drum and Bass

Article Open access 24 September 2018

Dynamic Semantic Music Notation

Keywords

1 Introduction

The Semantic Web is all about data diversity. The use of Linked Data principles [4, a.o.] boosted the growth of the Web of Data in a wide variety of domains. At the heart of Linked Data lies RDF, the Resource Description Framework (RDF)^{Footnote 1}, which is a data model for expressing metadata about any resource. The Linked Open Data Cloud [6] and LOD Laundromat [1] consist of thousands of millions of such metadata statements. With a focus on what these RDF statements are about, we can distinguish between two types of Linked Data datasets. First of all, datasets that directly describe a domain – the statements are about real world facts. Examples are Bio2RDF, GeoNames and DBPedia.^{Footnote 2} The second type capture metadata only – the statements are about data artifacts that themselves describe data. Examples are L3S DBLP and Semantic Web Dogfood^{Footnote 3} on bibliographic data, and – in our case – Linked Data about music.

In the case of the latter type of Linked Data, the actual digital objects represented by RDF statements are not (yet) machine interpretable from a semantic perspective (texts, images, video, audio), or require the use of legacy tooling to do so. Thus, although the resources that describe digital objects are promoted to first-class citizens of the Web – the objects themselves remain, to a large extent, non-interoperable.

One of the most prominent types of digital objects are music pieces. Music metadata has received a lot of attention by the Linked Data community. For example, DBTune.org links several music-related data sources on the Semantic Web [7], among them MusicBrainz [8], MySpace, and BBC Music. Other works have looked into publishing music recording metadata, and results from audio analysis algorithms [2]. Although the publication of music metadata about artists, songs, albums, musical events, and experimental results is an obvious contribution to the variety of Linked Data, music itself is currently only composed, published, and exchanged offline or using monolithic systems.

Currently, musicians compose and interpret music in a data-silo setup, with no use of scalable, global, machine readable, or resource-linkable formats. Consequently, existing musical data is not shared nor reused when musicians create, mix, combine, and publish music. Reuse only happens at an abstract, hardly reproducible level – listening, reading scores, transcribing, etc. A large-scale analysis of musical compositions and artifacts, and the relationships between them, their properties, fundamental nature, and intended meaning, is, with current musical representations, impossible. The fact that Linked Music Data is limited to metadata proper means that musicians cannot exploit Web technologies to address these issues.

Many works have studied symbolic representations for music suitable for the Web. The Notation Interchange File Format (NIFF), the Music Encoding Initiative (MEI), and MusicXML [3] are standards aimed at exchanging digital sheet music in a machine-readable way. Beyond music scores, MIDI (Musical Instrument Digital Interface) allows machines to exchange, manipulate and interpret musical events, and is to date the only symbolic music interchange format with wide adoption. The W3C Audio Working Group is working on a Web MIDI API^{Footnote 4} for “enabling web applications to enumerate and select MIDI input and output devices on the client system, and send and receive MIDI messages”. This relates to the actual music in the same way that semantic web services relate to Linked Data. Some works presented at the International Workshop on Semantic Music and Media^{Footnote 5} (SMAM) specifically address the task of broadcasting chord and other recognized information from analog musical events in RDF on the Web.

In this paper, we describe the first steps towards representing music in MIDI format as Linked Data. We investigate whether RDF is a suitable format to represent the content of digital music, and we explore the potential of such a representation. The novelty of the demonstration lies in three aspects. First, we study the essential concepts of MIDI to create a conversion workflow of MIDI music to RDF. Second, we invert this transformation, finding that it is possible to recompose the original MIDI from its RDF representation, thus providing a lossless round-trip. Third, we reuse a novel generative audio matching algorithm [5] to create RDF streams of music from any analog instrument, using MIDI as a discretization proxy.

The rest of the paper is organized as follows. In Sect. 2 we describe the basic concepts of MIDI, and we propose two methods: one to transform MIDI files to RDF and back; and another to stream live music from analog instruments encoded in RDF, using a novel generative audio matching algorithm. We also detail the key technology used in their implementations. In Sect. 3 we illustrate the contents of the demonstration, focusing on the lessons that can be learned from it, and we briefly discuss future work.

2 midi2rdf

MIDI is a standard that allows communication between a wide variety of electronic musical instruments, computers, and other devices. As a universal synthesizer interface, it abstracts musical events from the hardware, facilitating music exchange in a hardware independent manner.

Figure 1 depicts the fundamental MIDI concepts of patterns, tracks and events. A pattern acts as a high level container of a musical work (e.g. a song). It contains global MIDI metadata, such as resolution and format, and a list of tracks. Tracks are the logical divisions of a pattern, and typically represent musical instruments that play simultaneously. Finally, tracks contain events, which are sequential occurrences of instrumental actions, such as “a note starts playing” (mid:NoteOnEvent) or “a note stops playing” (mid:NoteOffEvent). MIDI defines 28 different types of events that control various aspects of the musical play. All MIDI events have associated a tick offset, which indicates the relative distance between events in discrete units. Each event type has its own attributes. For example, a mid:NoteOnEvent has a pitch (the note to be played), a velocity (its duration), and a channel (the tone or “instrument”).

We use the data model of Fig. 1 to implement a conversion workflow transforming the internal components of a MIDI to an RDF model. midi2rdf ^{Footnote 6} is an open-source suite of programs for encoding and decoding digital music between the MIDI and RDF formats. midi2rdf consists of several tools addressing two tasks: conversion, and streaming.

Conversion. The conversion programs are midi2rdf and rdf2midi. Both are Python scripts written on top of the python-midi ^{Footnote 7} (a comprehensive abstraction over MIDI that facilitates reading and writing contents from and to MIDI files) and rdflib ^{Footnote 8} libraries. We use midi2rdf to iterate over all tracks and contained events in the pattern of any MIDI file, and transform them to an RDF graph as shown in Fig. 1. The inverse process is carried out by rdf2midi, which transforms any RDF graph compliant with the model of Fig. 1 back to MIDI. Auxiliary tools for displaying contents of MIDI files, and for playing RDF files with MIDI contents in just one command, are also supplied.

Streaming. To stress the importance of a symbolic and Web-friendly representation of music during a performance, we propose the workflow depicted in Fig. 2. We implement this workflow in the program stream-midi-rdf, also included in midi2rdf. The basic idea is to create a stream of RDF data according with the model of Fig. 1, using a discretization of input analog instruments through MIDI^{Footnote 9}. A key step here is the generative audio matching system: an algorithm that can translate analog, continuous input audio to digital, discrete MIDI events. For this, we use the algorithm provided in Guitar MIDI 2^{Footnote 10} [5]. We route the output of this algorithm to an IAC (Inter-Application Communication Driver) virtual MIDI device, dumping raw MIDI event data. To read them, we use pygame ^{Footnote 11}, a set of Python modules designed for writing video games with good support of MIDI interfaces. Finally, we use again rdflib to send streams of RDF triples according to Fig. 1 to the standard output.

3 Demonstration

The demonstration of the midi2rdf suite will consist of two parts: a validation on the lossless round-trip conversion between MIDI and RDF; and a live performance, with real instruments, showing the RDF streaming workflow of Fig. 2. In the first part, visitors of the demonstration will evaluate whether the round-trip conversion is lossless in a variety of MIDI files. They will learn the basic features of symbolic music representation, as well as the atomic concepts of MIDI, and how this explains that both representations are equivalent. We will show why the translation of these concepts to RDF is challenging, especially regarding issues like preserving the order of MIDI events. Furthermore, visitors will be challenged to think over a Web of Linked Music Data beyond musical metadata. We are interested in the adsvantages that Linked Data provides over current approaches like MusicXML, in e.g. uniquely (and globally) representing notes using URIs – thus enabling a more standard way of music comparison and combination –, the content of their dereferencing, and the ability to query across (streamed) music – something that is not possible with current standards.

In the second part, visitors will see the state of the art ins generative audio matching in action, witnessing how a live performance with a real instrument is turned into an RDF stream of triples describing MIDI events of the music they hear. The audience will learn that, despite the precision and performance of these methods, detecting correct notes at real-time latency is a problem with an impact on the usefulness of such RDF streams. We hope to engage in discussions about the pros and cons of discretizing music this way.

We plan to improve this work in several ways. First, we plan on issuing PROV triples indicating which prov:Agent performed which prov:Activity on what prov:Entity to better document performances. Second, we will extend midi2rdf to better support less common types of MIDI events. Finally, we plan to use the proposed tools to deploy a Linked Dataset of MIDI music, link it to related resources in the LOD cloud, compose new pieces by combining existing ones with SPARQL, and explore the semantics of music.

Notes

1.
See http://www.w3.org/RDF.
2.
See http://bio2rdf.org, http://www.geonames.org and http://dbpedia.org, respectively.
3.
See http://dblp.l3s.de/d2r/ and http://data.semanticweb.org.
4.
https://www.w3.org/TR/webmidi/.
5.
See http://semanticmedia.org.uk/smam2013/.
6.
Source code available at https://github.com/albertmeronyo/midi2rdf.
7.
See https://github.com/vishnubob/python-midi.
8.
https://github.com/RDFLib/rdflib.
9.
Instrument and audio interfaces can be replaced by any analog input (microphone).
10.
See also http://www.jamorigin.com/products/midi-guitar/.
11.
See http://pygame.org/.

References

Beek, W., Rietveld, L., Bazoobandi, H.R., Wielemaker, J., Schlobach, S.: LOD laundromat: a uniform way of publishing other people’s dirty data. In: Mika, P., et al. (eds.) ISWC 2014, Part I. LNCS, vol. 8796, pp. 213–228. Springer, Heidelberg (2014). doi:10.1007/978-3-319-11964-9_14
Google Scholar
Cannam, C., Sandler, M., Jewell, M., Rhodes, C., d’Inverno, M.: Linked data and you: bringing music research software into the semantic web. J. New Music Res. 39(4), 313–325 (2010)
Article Google Scholar
Good, M.: MusicXML: An Internet-Friendly Format for Sheet Music (2001). http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.118.5431
Heath, T., Bizer, C.: Linked Data: Evolving the Web into a Global Data Space, 1st edn. Morgan and Claypool, New York (2011)
Google Scholar
Kristensen, O.: Generative audio matching game system, 3 March 2011. https://www.google.com/patents/WO2010142297A3?cl=en, wO Patent App. PCT/DK2010/050,132
Schmachtenberg, M., Christian Bizer, A.J., Cyganiak, R.: Linking Open Data cloud diagram 2014 (2014). http://lod-cloud.net/
Raimond, Y., Sandler, M.: A web of musical information. In: Ninth International Conference on Music Information Retrieval (ISMIR2008) (2008)
Google Scholar
Swartz, A.: Musicbrainz: a semantic web service. IEEE Intell. Syst. 17, 76–77 (2002)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Vrije Universiteit Amsterdam, Amsterdam, Netherlands
Albert Meroño-Peñuela & Rinke Hoekstra
Data Archiving and Networked Services, KNAW, The Hague, Netherlands
Albert Meroño-Peñuela
Faculty of Law, University of Amsterdam, Amsterdam, Netherlands
Rinke Hoekstra

Authors

Albert Meroño-Peñuela
View author publications
You can also search for this author in PubMed Google Scholar
Rinke Hoekstra
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Albert Meroño-Peñuela .

Editor information

Editors and Affiliations

Hasso-Plattner-Institut für Softwaresystemtechnik, Universität Potsdam, Potsdam, Germany
Harald Sack
Innovation Development, Istituto Superiore Mario Boella, Turin, Italy
Giuseppe Rizzo
Technical University of Ilmenau, Ilemnau, Germany
Nadine Steinmetz
Artiﬁcial Intelligence Laboratory, J. Stefan Institute, Ljubljana, Slovenia
Dunja Mladenić
Institut für Informatik III, University of Bonn, Bonn, Germany
Sören Auer
Institut für Informatik III, Universität Bonn, Bonn, Germany
Christoph Lange

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Meroño-Peñuela, A., Hoekstra, R. (2016). The Song Remains the Same: Lossless Conversion and Streaming of MIDI to RDF and Back. In: Sack, H., Rizzo, G., Steinmetz, N., Mladenić, D., Auer, S., Lange, C. (eds) The Semantic Web. ESWC 2016. Lecture Notes in Computer Science(), vol 9989. Springer, Cham. https://doi.org/10.1007/978-3-319-47602-5_38

Download citation

DOI: https://doi.org/10.1007/978-3-319-47602-5_38
Published: 20 October 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-47601-8
Online ISBN: 978-3-319-47602-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

The Song Remains the Same: Lossless Conversion and Streaming of MIDI to RDF and Back

Abstract

Similar content being viewed by others

Enabling Interactive and Interoperable Semantic Music Applications