OpenSeq2Seq by NVIDIA

TensorFlow toolkit for sequence-to-sequence model experimentation

Created 8 years ago

1,557 stars

Top 26.5% on SourcePulse

View on GitHub

6 Experts Love This Project

Piotr Dąbkowski

Cofounder of ElevenLabs

Andreas Jansson

Cofounder of Replicate

and 2 more!

Project Summary

OpenSeq2Seq is a research toolkit designed for efficient experimentation with sequence-to-sequence models across speech recognition, text-to-speech, and natural language processing tasks. It empowers researchers to explore various model architectures by providing robust support for distributed and mixed-precision training.

How It Works

Built on TensorFlow, OpenSeq2Seq offers pre-built components for common encoder-decoder architectures. Its core advantage lies in enabling efficient training through data-parallel distributed training across multiple GPUs and nodes, coupled with mixed-precision training capabilities optimized for NVIDIA Volta and Turing architectures. This approach significantly accelerates the experimentation cycle for complex sequence-to-sequence tasks.

Quick Start & Requirements

Install: pip install openseq2seq (or via source)
Prerequisites: Python >= 3.5, TensorFlow >= 1.10, CUDA >= 9.0, cuDNN >= 7.0, Horovod >= 0.13 (recommended for multi-GPU).
Documentation: https://nvidia.github.io/OpenSeq2Seq/

Highlighted Details

Supports models for Neural Machine Translation, Automatic Speech Recognition, Speech Synthesis, Language Modeling, and NLP tasks like sentiment analysis.
Features data-parallel distributed training for multi-GPU and multi-node setups.
Enables mixed-precision training for enhanced performance on compatible NVIDIA hardware.

Maintenance & Community

This is a research project and not an official NVIDIA product. No community links or active maintenance signals are provided in the README.

Licensing & Compatibility

The project is released under a permissive license, allowing for commercial use and integration with closed-source projects.

Limitations & Caveats

The toolkit is designated as a research project, implying potential for rapid changes, incomplete features, or lack of long-term support. The TensorFlow 1.x dependency may pose compatibility challenges with newer TensorFlow versions.

Health Check

Last Commit

4 years ago

Responsiveness

1 day

Pull Requests (30d)

Issues (30d)

Star History

2 stars in the last 30 days