История

Xin Xu bc164aa7a3 Update README.md		2023-06-27 10:21:28 +08:00
..
1-text2attribute_dataprepare	musecoco	2023-06-01 01:00:52 +08:00
1-text2attribute_model	Update model.py	2023-06-12 20:05:54 +08:00
2-attribute2music_dataprepare	musecoco	2023-06-01 01:00:52 +08:00
2-attribute2music_model	musecoco	2023-06-01 01:00:52 +08:00
README.md	Update README.md	2023-06-27 10:21:28 +08:00
requirements.txt	Update requirements.txt	2023-06-09 14:05:51 +08:00

README.md

MuseCoco: generating symbolic music from text

Environment

conda create -n MuseCoco python=3.8
pip install -r requirements.txt

Training

Text-to-Attribute Understanding

1 Construct attribute-text pairs

Attribute: We provide attributes of the standard test set in text.bin.
Construct Text:

cd 1-text2attribute_dataprepare
bash run.sh

Obtain attribute-text pairs (the input dataset for the text-to-attribute understanding model) including att_key.json and test.json. We have provided the off-the-shelf standard test set in the folder too.

2. Train the model

cd 1-text2attribute_model
bash train.sh

The checkpoint of the fine-tuned model and num_labels.json are obtained.

Attribute-to-Music Generation

1. Data processing

Switch to 2-attribute2music_dataprepare folder, and set midi_data_extractor_path in config.py to the path that contains midi_data_extractor.

Then, in data_tool folder, run the following command to obtain the packed data.

python extract_data.py path/to/the/folder/containing/midi/files path/to/save/the/dataset

Note: The tool can only automatically extract the objective attributes' values from MIDI files. If you want to insert values for the subjective attributes' values, please input it manually at L40-L42 in extract_data.py.

Prepare Token.bin, Token_index.json, RID.bin, RID_index.json in folder data/. Then run the following command to process the data into train, validation, test.

cd data_process

# The following script splits the midi corpus into "train.txt", "valid.txt" and "test.txt", using "5120" as the maximum length of the token sequence.
python split_data.py

#The following script binarizes the data in fairseq format.
python util.py

2. Training

Run the following command to train a model with approximately 200M parameters.

bash train-xl.sh

Inference

I. Text-to-Attribute Understanding

Switch to 1-text2attribute_model folder

Set model_name_or_path as the checkpoint path and num_labels as the path of num_labels.json in predict.sh.
Prepare the text, from which attribute values will be extracted, as the format in predict.json.
Set test_file as the path of predict.json in predict.sh.
Then,
```
bash predict.sh
```
The predict_attributes.json and softmax_probs.json are obtained.
Preprocess the input of the attribute-to-music generation stage for inference After inference, set the path of predict.json, predict_attributes.json, softmax_probs.json and att_key.json in stage2_pre.py and then,
```
python stage2_pre.py
```
The stage1.bin is obtained as the inference input of the attribute-to-music generation stage.

II. Attribute-to-Music Generation

Switch to 2-attribute2music_model folder

Prepare the model checkpoint:

checkpoint/linear_mask-xl-truncated_5120/checkpoint_best.pt
Prepare the input for inference in the folder data/infer_input from the output of text-to-attribute understanding stage.
Run the following command to generate 64 samples for each input.

# The following script takes "data/infer_input/infer_test.bin" as input.
bash interactive.sh 0 10 infer_test
# bash interactive.sh start_idx end_idx input_name

The generated results are located in the folder generation/