Supervised Symbolic Music Style Translation Using Synthetic Data

Abstract

Research on style transfer and domain translation has clearly demonstratedthe ability of deep learning-based algorithms to manipulate images in terms ofartistic style. More recently, several attempts have been made to extend suchapproaches to music (both symbolic and audio) in order to enable transformingmusical style in a similar manner. In this study, we focus on symbolic musicwith the goal of altering the 'style' of a piece while keeping its original'content'. As opposed to the current methods, which are inherently restrictedto be unsupervised due to the lack of 'aligned' data (i.e. the same musicalpiece played in multiple styles), we develop the first fully supervisedalgorithm for this task. At the core of our approach lies a synthetic datageneration scheme which allows us to produce virtually unlimited amounts ofaligned data, and hence avoid the above issue. In view of this data generationscheme, we propose an encoder-decoder model for translating symbolic musicaccompaniments between a number of different styles. Our experiments show thatour models, although trained entirely on synthetic data, are capable ofproducing musically meaningful accompaniments even for real (non-synthetic)MIDI recordings.

Quick Read (beta)

loading the full paper ...