MINGUS is a Transformer-based Seq2Seq architecture for modelling and generating monophonic jazz melodic lines. MINGUS relies on two dedicated embedding models (respectively for pitch and duration) and exploits in prediction features such as chords (current and following), bass line, position inside the measure. The obtained results are comparable with the state of the art of music generation with neural models, with particularly good performances on jazz music.
More information in the paper (bib):
Vincenzo Madaghiele, Pasquale Lisena, Raphaël Troncy. MINGUS: Melodic Improvisation Neural Generator Using Seq2Seq. In Proceedings of 22nd International Society of Music Information Retrieval Conference (ISMIR), 8-12 November 2021, Online.
The web application for user evaluation is available at mingus.tools.eurecom.fr.
Supplementary material with additional results about the paper is available here.
You can listen to jazz melodies generated by MINGUS here
An interactive browser demo of the midi generation is available here
To use the model run the following code from the main project directory. After downloading this repository, run the following code in the terminal to install the required libraries in a new conda environment named MINGUS and activate it:
$ conda env create -f environment.yml
$ conda activate MINGUS
If this installation does not work correctly you can install the dependencies listed in the file requirement.txt individually. It will probably be necessary to run the following command:
$ export PYTHONPATH="$PWD"
To train MINGUS with a custom dataset place your musicXML data in 00_preprocessData/data/xml and run the following command in your terminal:
$ python3 A_preprocessData/data_preprocessing.py --format xml
This will generate the file 00_preprocessData/data/DATA.json which will be used for training. This file should be the same for all the following commands (even generation and evaluation). WjazzDB requires specific pre-processing from the csv files provided by its authors. To train with WjazzDB run:
$ python3 A_preprocessData/wjazzDB_csv_to_xml.py
$ python3 A_preprocessData/data_preprocessing.py --format xml
To train MINGUS from scratch run:
$ python3 B_train/train.py
The saved model will be in B_train/models. It is possible to visualize training details with tensorboard by running:
$ tensorboard --logdir=B_train/runs
The results are showed in your browser at http://localhost:6006/.
To generate music with MINGUS you can run the following command. Please ensure that the arguments correspond to an already trained model. You can use the files in C_generate/xml4gen as a starting point. These files are also used for evaluation.
$ python3 C_generate/generate.py --xmlSTANDARD <path_to_xml_standard>
Output midi files will be in C_generate/generated_tunes.
To generate with our pre-trained model run:
$ python3 C_generate/generate.py --EPOCHS 100 --xmlSTANDARD <path_to_xml_standard>
The evaluation metrics are computed by comparing the melodies generated by MINGUS and a corpus of original samples, so to evaluate the model you should first choose some reference xml files and put them in D_eval/reference (you can randomly select some files from your original dataset). Then you should generate a corpus of melodies with MINGUS and evaluate the results:
$ python3 C_generate/generate.py --GENERATE_CORPUS
$ python3 D_evaluate/evaluate.py
The evaluation metrics are available in D_evaluate/metrics.
To evaluate our pre-trained model run:
$ python3 C_generate/generate.py --GENERATE_CORPUS --EPOCHS 100
$ python3 D_evaluate/evaluate.py --EPOCHS 100
@inproceedings{madaghiele2021mingus,
author = {Madaghiele, Vincenzo and Lisena, Pasquale and Troncy, Raphael},
title = {{MINGUS: Melodic Improvisation Neural Generator Using Seq2Seq}},
booktitle = {22$^{nd}$ International Society for Music Information Retrieval Conference (ISMIR)},
year = {2021},
month = {11},
address = {Online}
}