Discover and explore top open-source AI tools and projects—updated daily.
ByteDance-SeedFramework for scaling multimodal model training across accelerators
Top 31.3% on SourcePulse
VeOmni is a PyTorch-native framework designed for scaling large model training across diverse accelerators. It targets researchers and engineers working with single- or multi-modal models, offering flexibility and control by avoiding rigid trainer classes and exposing the full training logic.
How It Works
VeOmni emphasizes a modular, trainer-free design, allowing users to integrate custom components and maintain linear training scripts for maximum transparency. It leverages PyTorch's native functions for broad compatibility and performance, supporting advanced parallelism strategies like DeviceMesh, FSDP1/2, and experimental expert parallelism. Features such as activation offloading and checkpointing are integrated to manage memory and improve efficiency.
Quick Start & Requirements
pip3 install veomni or pip3 install -e . for source installation.bash train.sh $TRAIN_SCRIPT $CONFIG.yamlHighlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The "veScale" component for FSDP is not yet available. Some advanced features like expert parallelism are marked as experimental. Performance benchmarks are pending a technical report.
13 hours ago
1 day
microsoft
volcengine
tunib-ai
ELS-RD
huggingface
facebookresearch
pytorch