Skip to content
Hannah Davis edited this page Nov 11, 2018 · 28 revisions

EARLY MODELS, NON-MAGENTA MODELS:

Feynman Liang, BachBot:
http://bachbot.com

FolkRNN:
https://folkrnn.org/, https://github.com/IraKorshunova/folk-rnn

Wavenet (DeepMind version):
https://deepmind.com/blog/wavenet-generative-model-raw-audio/

Audio texture synthesis and style transfer:
https://dmitryulyanov.github.io/audio-texture-synthesis-and-style-transfer/

SampleRNN:
https://github.com/soroushmehr/sampleRNN_ICLR2017

Tacotron, Speech Synthesis:
https://google.github.io/tacotron/

MAGENTA MODELS:

https://github.com/tensorflow/magenta/tree/master/magenta/models

Drums RNN:
Applies language modeling to drum track generation using an LSTM.

Image Stylization:
A "Multistyle Pastiche Generator" that generates artistics representations of photographs. non-music

Improv RNN:
Generates melodies a la Melody RNN, but conditions the melodies on an underlying chord progression.

Melody RNN:
Applies language modeling to melody generation using an LSTM.

Music VAE:
A hierarchical recurrent variational autoencoder for music.

NSynth:
"Neural Audio Synthesis" as described in NSynth: Neural Audio Synthesis with WaveNet Autoencoders.

Onsets and Frames:
Automatic piano music transcription model as described in Onsets and Frames: Dual-Objective Piano Transcription

Performance RNN:
Applies language modeling to polyphonic music using a combination of note on/off, timeshift, and velocity change events.

Pianoroll RNN-NADE:
Applies language modeling to polyphonic music generation using an LSTM combined with a NADE, an architecture called an RNN-NADE. Based on the architecture described in Modeling Temporal Dependencies in High-Dimensional Sequences: Application to Polyphonic Music Generation and Transcription.

Polyphony RNN:
Applies language modeling to polyphonic music generation using an LSTM. Based on the BachBot architecture described in Automatic Stylistic Composition of Bach Choralies with Deep LSTM.

RL Tuner:
Takes an LSTM that has been trained to predict the next note in a monophonic melody and enhances it using reinforcement learning (RL).

MH
Sketch RNN:
A recurrent neural network model for generating sketches. non-music