determined  by determined-ai

Open-source ML platform for simplifying training, tuning, tracking, and resource management

created 5 years ago
3,165 stars

Top 15.6% on sourcepulse

GitHubView on GitHub
Project Summary

Determined AI is an open-source platform designed to streamline deep learning workflows for researchers and engineers. It simplifies distributed training, hyperparameter tuning, experiment tracking, and resource management, aiming to accelerate model development and reduce cloud costs.

How It Works

Determined operates through a Python library, a command-line interface (CLI), and a web UI. The Python library allows users to integrate their PyTorch or TensorFlow code by organizing it within PyTorchTrial or TFv1Trial classes or by using the lower-level Core API. This abstraction handles distributed execution and hyperparameter search, enabling faster training and automated tuning. The CLI manages cluster deployment (local, AWS, GCP) and experiment execution, while the Web UI provides comprehensive visualization of training progress, resource utilization, and model artifacts.

Quick Start & Requirements

  • Install CLI: pip install determined
  • Deployment: Use det deploy local cluster-up for local setup or det deploy aws up / det deploy gcp up for cloud.
  • Prerequisites: Python, Docker (for local deployment). Cloud deployment requires cloud provider credentials.
  • Resources: Cloud GPU instances are recommended for training.
  • Links: Examples, Documentation, Quick Start Guide

Highlighted Details

  • Supports distributed training across multiple GPUs and nodes.
  • Integrates advanced hyperparameter search algorithms like Adaptive SHA.
  • Provides robust experiment tracking, including code snapshots and configuration history.
  • Offers performance profiling and resource management for cost optimization.

Maintenance & Community

Health Check
Last commit

4 months ago

Responsiveness

Inactive

Pull Requests (30d)
1
Issues (30d)
1
Star History
37 stars in the last 90 days

Explore Similar Projects

Starred by Lewis Tunstall Lewis Tunstall(Researcher at Hugging Face), Lysandre Debut Lysandre Debut(Chief Open-Source Officer at Hugging Face), and
3 more.

FARM by deepset-ai

0%
2k
NLP framework for transfer learning with BERT & Co
created 6 years ago
updated 1 year ago
Starred by Aravind Srinivas Aravind Srinivas(Cofounder of Perplexity), Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), and
8 more.

higgsfield by higgsfield-ai

0.3%
3k
ML framework for large model training and GPU orchestration
created 7 years ago
updated 1 year ago
Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Ben Firshman Ben Firshman(Cofounder of Replicate), and
6 more.

Made-With-ML by GokuMohandas

0.4%
41k
ML course for production-grade applications
created 6 years ago
updated 11 months ago
Feedback? Help us improve.