OpenNMT-tf by OpenNMT

Sequence learning toolkit for neural machine translation

Created 8 years ago

1,486 stars

Top 27.4% on SourcePulse

View on GitHub

3 Experts Love This Project

Ross Taylor

Cofounder of General Reasoning; Cocreator of Papers with Code

Chaoyu Yang

Founder of Bento

Sasha Rush

Research Scientist at Cursor; Professor at Cornell Tech

Project Summary

Neural machine translation and sequence learning using TensorFlow 2. OpenNMT-tf is a versatile, production-oriented toolkit for neural machine translation and general sequence learning tasks, built on TensorFlow 2. It empowers researchers and developers by offering a modular architecture, seamless integration with the TensorFlow ecosystem, and compatibility with optimized inference engines like CTranslate2, facilitating efficient deployment and experimentation.

How It Works

This toolkit leverages TensorFlow 2's capabilities, providing reusable Keras layers, multi-GPU/distributed training support, mixed-precision, and TensorBoard visualization. Its core design emphasizes modularity, allowing users to define custom sequence-to-sequence models, encoders, and decoders with ease, as demonstrated by its support for complex architectures like self-attentional encoders and RNN decoders. A key advantage is its dynamic data pipeline, which enables on-the-fly preprocessing and data augmentation without prior compilation, streamlining the training workflow.

Quick Start & Requirements

Installation: pip install OpenNMT-tf
Requirements: Python 3.7+, TensorFlow 2.6-2.13.
Resources: Official documentation, forum, and Gitter channel are available for further guidance.

Highlighted Details

Modular architecture supports custom model designs, multiple input features, and hybrid architectures.
Full TensorFlow 2 integration includes tf.distribute, Horovod, mixed precision, and SavedModel export for TensorFlow Serving.
Optimized inference compatibility with CTranslate2 for fast CPU/GPU execution and quantization.
Dynamic data pipeline allows on-the-fly tokenization and data augmentation.
Supports advanced training techniques like model fine-tuning, guided alignment, and various decoding strategies (e.g., beam search).

Maintenance & Community

The project provides access to a forum and a Gitter channel for community support and discussion. (Specific details on contributors, sponsorships, or roadmap are not provided in the README excerpt.)

Licensing & Compatibility

The license type and any specific compatibility restrictions for commercial use or closed-source linking are not explicitly stated in the provided README content.

Limitations & Caveats

No specific limitations, known bugs, or alpha status are mentioned in the provided README excerpt.

Health Check

Last Commit

2 years ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

1 stars in the last 30 days