Framework for scaling multimodal model training across accelerators
Top 74.0% on sourcepulse
VeOmni is a PyTorch-native framework designed for scaling large model training across diverse accelerators. It targets researchers and engineers working with single- or multi-modal models, offering flexibility and control by avoiding rigid trainer classes and exposing the full training logic.
How It Works
VeOmni emphasizes a modular, trainer-free design, allowing users to integrate custom components and maintain linear training scripts for maximum transparency. It leverages PyTorch's native functions for broad compatibility and performance, supporting advanced parallelism strategies like DeviceMesh, FSDP1/2, and experimental expert parallelism. Features such as activation offloading and checkpointing are integrated to manage memory and improve efficiency.
Quick Start & Requirements
pip3 install veomni
or pip3 install -e .
for source installation.bash train.sh $TRAIN_SCRIPT $CONFIG.yaml
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The "veScale" component for FSDP is not yet available. Some advanced features like expert parallelism are marked as experimental. Performance benchmarks are pending a technical report.
1 day ago
1 day