OpenSeq2Seq: toolkit for distributed and mixed precision training of sequence-to-sequence models

OpenSeq2Seq main goal is to allow researchers to most effectively explore various sequence-to-sequence models. The efficiency is achieved by fully supporting distributed and mixed-precision training. OpenSeq2Seq is built using TensorFlow and provides all the necessary building blocks for training encoder-decoder models for neural machine translation, automatic speech recognition, speech synthesis, and language modeling.

Documentation and installation instructions

https://nvidia.github.io/OpenSeq2Seq/

Features

Models for:
1. Neural Machine Translation
2. Automatic Speech Recognition
3. Speech Synthesis
4. Language Modeling
5. NLP tasks (sentiment analysis)
Data-parallel distributed training
1. Multi-GPU
2. Multi-node
Mixed precision training for NVIDIA Volta/Turing GPUs

Software Requirements

Python >= 3.5
TensorFlow >= 1.10
CUDA >= 9.0, cuDNN >= 7.0
Horovod >= 0.13 (using Horovod is not required, but is highly recommended for multi-GPU setup)

Acknowledgments

Speech-to-text workflow uses some parts of Mozilla DeepSpeech project.

Text-to-text workflow uses some functions from Tensor2Tensor and Neural Machine Translation (seq2seq) Tutorial.

Disclaimer

This is a research project, not an official NVIDIA product.

Related resources

Paper

If you use OpenSeq2Seq, please cite this paper

@article{openseq2seq,
  title={
OpenSeq2Seq: extensible toolkit for distributed and mixed precision training of sequence-to-sequence models},
  author={Kuchaiev, Oleksii and Ginsburg, Boris and Gitman, Igor and Lavrukhin, Vitaly and  Case, Carl and Micikevicius, Paulius},
  journal={arXiv preprint arXiv:1805.10387},
  year={2018}
}

Name		Name	Last commit message	Last commit date
Latest commit History 1,467 Commits
ctc_decoder_with_lm		ctc_decoder_with_lm
docker		docker
docs		docs
example_configs		example_configs
open_seq2seq		open_seq2seq
scripts		scripts
.gitignore		.gitignore
.pylintrc		.pylintrc
.style.yapf		.style.yapf
AUTHORS		AUTHORS
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
Dockerfile_public		Dockerfile_public
Interactive_Infer_example.ipynb		Interactive_Infer_example.ipynb
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
run.py		run.py
tokenizer_wrapper.py		tokenizer_wrapper.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OpenSeq2Seq: toolkit for distributed and mixed precision training of sequence-to-sequence models

Documentation and installation instructions

Features

Software Requirements

Acknowledgments

Disclaimer

Related resources

Paper

About

Releases

Packages

Languages

License

ka-bu/OpenSeq2Seq

Folders and files

Latest commit

History

Repository files navigation

OpenSeq2Seq: toolkit for distributed and mixed precision training of sequence-to-sequence models

Documentation and installation instructions

Features

Software Requirements

Acknowledgments

Disclaimer

Related resources

Paper

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages