Beatty: Automatic Tempo Curve Synthesis for Expressive MIDI Track Playback

Wong, Madeline

Author(s)

Wong, Madeline

DownloadThesis PDF (1.177Mb)

Advisor

Egozy, Eran

Terms of use

In Copyright - Educational Use Permitted Copyright MIT http://rightsstatements.org/page/InC-EDU/1.0/

Metadata

Show full item record

Abstract

Beatty is a sequence-to-sequence machine learning model to predict expressive timing decisions for excerpts of classical solo piano music. Composed of a bidirectional encoder LSTM and decoder LSTM with attention, Beatty predicts tempo labels based on input note sequences. The input note sequence is obtained by transforming a MIDI file representation of the musical score into a series of one-hot note vectors, which encode the MIDI note pitches, velocities, and durations, and are augmented with additional harmonic tension information. The target output is a sequence of tempo labels, represented as ratios of the sequence’s initial starting tempo. We demonstrate that the harmonic tension augmentation, as well as learning from filtered tempo label sequences, improve model performance. In qualitative evaluation, the model output receives positive feedback when its predicted tempo sequence is subtle and smooth and criticism when it fluctuates too greatly, suggesting areas for future exploration and improvement.

Date issued

2022-05

URI

https://hdl.handle.net/1721.1/145078

Department

Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science

Publisher

Massachusetts Institute of Technology

Collections

Graduate Theses