Skip to content
Hannah Davis edited this page Nov 9, 2018 · 28 revisions

EARLY MODELS, NON-MAGENTA MODELS:

Feynman Liang, BachBot: http://bachbot.com

FolkRNN: https://folkrnn.org/, https://github.com/IraKorshunova/folk-rnn

Wavenet (DeepMind version): https://deepmind.com/blog/wavenet-generative-model-raw-audio/

Audio texture synthesis and style transfer: https://dmitryulyanov.github.io/audio-texture-synthesis-and-style-transfer/

SampleRNN: https://github.com/soroushmehr/sampleRNN_ICLR2017

Tacotron: https://google.github.io/tacotron/

MAGENTA MODELS:

Drums RNN: Applies language modeling to drum track generation using an LSTM.

Image Stylization: A "Multistyle Pastiche Generator" that generates artistics representations of photographs. non-music

Improv RNN: Generates melodies a la Melody RNN, but conditions the melodies on an underlying chord progression.

Melody RNN: Applies language modeling to melody generation using an LSTM.

Music VAE: A hierarchical recurrent variational autoencoder for music.

NSynth: "Neural Audio Synthesis" as described in NSynth: Neural Audio Synthesis with WaveNet Autoencoders.

Onsets and Frames: Automatic piano music transcription model as described in Onsets and Frames: Dual-Objective Piano Transcription

Performance RNN: Applies language modeling to polyphonic music using a combination of note on/off, timeshift, and velocity change events.

Pianoroll RNN-NADE: Applies language modeling to polyphonic music generation using an LSTM combined with a NADE, an architecture called an RNN-NADE. Based on the architecture described in Modeling Temporal Dependencies in High-Dimensional Sequences: Application to Polyphonic Music Generation and Transcription.

Polyphony RNN: Applies language modeling to polyphonic music generation using an LSTM. Based on the BachBot architecture described in Automatic Stylistic Composition of Bach Choralies with Deep LSTM.

RL Tuner: Takes an LSTM that has been trained to predict the next note in a monophonic melody and enhances it using reinforcement learning (RL).

Sketch RNN: A recurrent neural network model for generating sketches. non-music