Reformer-pytorch

Implements Reformer: The Efficient Transformer in pytorch. (Work in progress)

Prerequisites

Tested with Python 3.7.5, Pytorch 1.4.0.
This code is built upon the pytorch-lightning framework.
pip install -r requirements.txt

How to train

Datasets

If you want to modify trainer.py or model\model.py, it is recommended that you familiarize with youself the pytorch-lightning library beforehand.
A custom copy task & music dataset has been implemented under datasets\dataloader.py. Modify as needed.
A config yaml file must be placed under config. See provided yaml files for basic framework.

Running the code

python3 trainer.py -c \path\to\config\yaml -n [name of run] -b [batch size] -f [fast dev run] -v [version number]
The -f flag is used for debugging; only one batch of training, validation, and testing will be calculated.
The -v flag is used for resuming from checkpoints; leave empty for new version.
A toy copy task of length 32, vocab 128 converges around ~6k steps using a batch size of 1024, learning rate of 1e-3 and Adam. The checkpoint is located under checkpoints\.

How to sample

Preparing the checkpoints

A complete checkpoint folder must be placed under logs\. Use the entire folder pytorch-lightning automatically saves.

Running the code

A corresponding version number must be provided with a -v flag.
Run the code with the -s flag set to True. This will generate 1 sample under sample\, if using the music dataset.

To-do

Implement general framework of Reformer
Rewrite using pytorch-lightning framework
Implement Label Smoothing
Implement LSH attention
Implement reversible layer
Implement autoregressive sampling
Implement various datasets

Implementation Authors

June Young Yi @ MINDsLab Inc. ([email protected], [email protected])

License

MIT License

Acknowlegdements

The general structure of this code is based on The Annotated Transformer, albeit heavily modified.
I am aware that reformer-lm exists. However, I was frustrated with the original trax implementation that the authors provided, and decided to rewrite the entire thing from the ground up. Naturally, expect bugs everywhere.
Thanks to MINDsLab for providing training resources.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
assets		assets
checkpoints		checkpoints
config		config
datasets		datasets
model		model
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
trainer.py		trainer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reformer-pytorch

Prerequisites

How to train

Datasets

Running the code

How to sample

Preparing the checkpoints

Running the code

To-do

Implementation Authors

License

Acknowlegdements

About

Releases

Packages

Languages

License

PhilippMarquardt/Reformer-pytorch

Folders and files

Latest commit

History

Repository files navigation

Reformer-pytorch

Prerequisites

How to train

Datasets

Running the code

How to sample

Preparing the checkpoints

Running the code

To-do

Implementation Authors

License

Acknowlegdements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages