lightning-hydra-template by ashleve

ML experimentation template using PyTorch Lightning + Hydra

Created 5 years ago

5,089 stars

Top 9.8% on SourcePulse

View on GitHub

5 Experts Love This Project

Jiayi Pan

Author of SWE-Gym; MTS at xAI

Christian Laforte

Distinguished Engineer at NVIDIA; Former CTO at Stability AI

Chuan Li

Chief Scientific Officer at Lambda

Jeff Hammerbacher

Cofounder of Cloudera

and 1 more!

Project Summary

This template provides a robust and user-friendly structure for deep learning projects, leveraging PyTorch Lightning and Hydra for efficient experimentation and configuration management. It's designed for researchers and engineers who need to quickly set up, manage, and scale their ML experiments, offering a clean boilerplate and MLOps best practices.

How It Works

The core of the template relies on Hydra for flexible configuration management, allowing dynamic composition and overriding of settings via YAML files and the command line. PyTorch Lightning handles the training loop, device management, and logging, abstracting away much of the boilerplate. Modules (models, datasets, callbacks) are dynamically instantiated using hydra.utils.instantiate based on paths defined in the configuration files, enabling easy swapping and iteration.

Quick Start & Requirements

Install: Clone the repository and install dependencies using pip install -r requirements.txt. PyTorch installation instructions should be followed separately.
Prerequisites: Python 3.8-3.10, PyTorch 2.0+, PyTorch Lightning 2.0+.
Resources: A basic MNIST example is provided. For larger models and datasets, GPU acceleration is recommended.
Docs: PyTorch Lightning, Hydra

Highlighted Details

Dynamic instantiation of PyTorch Lightning modules via Hydra configs.
Extensive command-line interface for controlling training, debugging, and hyperparameter sweeps.
Support for multiple experiment trackers (Tensorboard, W&B, Neptune, etc.).
Integrated testing framework with pytest and @RunIf decorator.
Built-in CI workflows for testing and code quality.

Maintenance & Community

This is an unofficial community project. Contributions are welcome via issues and pull requests. Links to community channels are not explicitly provided in the README.

Licensing & Compatibility

License: MIT License.
Compatibility: Permissive MIT license allows for commercial use and integration with closed-source projects.

Limitations & Caveats

The template notes that integration of evolving libraries like Lightning and Hydra can occasionally lead to breaking changes. It's also noted that the configuration setup is primarily for simple Lightning training and may require adjustments for more complex use cases like Lightning Fabric. Resuming Hydra multiruns or hyperparameter searches is not supported.

Health Check

Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

46 stars in the last 30 days