Sequence modeling toolkit for content generation research
Top 37.3% on sourcepulse
fairseq2 is a modular and extensible toolkit for training custom sequence modeling models, primarily for content generation tasks. It is designed for researchers and engineers working on advanced AI projects, offering a clean API and supporting large-scale, multi-GPU/multi-node training.
How It Works
fairseq2 is a complete rewrite of the original fairseq, adopting a less intrusive, extensible architecture. It leverages modern PyTorch features like torch.compile
and FSDP, and includes a C++ based streaming data pipeline for high throughput. Its extensibility is managed via a setuptools extension mechanism, allowing easy registration of new components without forking.
Quick Start & Requirements
pip install fairseq2
(ensure PyTorch is installed first, matching the fairseq2 variant).libsndfile
(install via system package manager or Homebrew).Highlighted Details
Maintenance & Community
Developed by Meta AI (FAIR). Contribution guidelines are available.
Licensing & Compatibility
MIT licensed. Compatible with commercial use.
Limitations & Caveats
No native Windows support; WSL2 is recommended. Pre-built packages require strict PyTorch/CUDA version matching due to C++ API compatibility issues. ARM64 macOS requires building from source for non-PyPI variants.
1 day ago
1 week