data

Data preparation

This directory contains scripts and other files needed to prepare the training and test datasets. Before starting, make sure that the groove2groove package is installed in your environment.

The scripts will download the data and perform some preprocessing. This includes conversion to the format used by Groove2Groove: an LMDB database of Magenta note sequences plus a JSON file with metadata.

Synthetic data

Go to the synth directory and run ./download.sh. This will download the synthetic dataset from Zenodo and extract the relevant files to the train, val and test subdirectories.

Preprocess each part of the dataset:

./prepare.sh train
./prepare.sh val
./prepare.sh test

Bodhidharma

Go to the bodhidharma directory and run ./prepare.sh. This will download, extract and preprocess the dataset.

Additionally, velocity normalization needs to be performed by running the vel_norm.ipynb notebook.

Name		Name	Last commit message	Last commit date
parent directory ..
bodhidharma		bodhidharma
synth		synth
README.md		README.md
vel_norm.ipynb		vel_norm.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

README.md

Data preparation

Synthetic data

Bodhidharma

Files

data

Directory actions

More options

Directory actions

More options

Latest commit

History

data

Folders and files

parent directory

README.md

Data preparation

Synthetic data

Bodhidharma