clearml  by clearml

MLOps suite for experiment tracking, automation, and data management

created 6 years ago
6,184 stars

Top 8.5% on sourcepulse

GitHubView on GitHub
Project Summary

ClearML is an open-source MLOps/LLMOps platform designed to streamline AI development and production workflows. It offers automated experiment tracking, data management, pipeline orchestration, and model serving, targeting ML engineers and researchers seeking a unified solution for managing complex AI projects.

How It Works

ClearML integrates seamlessly into existing Python code with minimal modification, automatically logging experiments, code versions, environments, hyperparameters, and outputs. Its architecture comprises a Python SDK for instrumentation, a central ClearML Server for data storage and UI, and ClearML Agents for remote execution and orchestration. This approach automates reproducibility and scalability, reducing manual configuration and potential errors.

Quick Start & Requirements

  • Install via pip: pip install clearml
  • Initialize: clearml-init
  • Add two lines to Python code: from clearml import Task; task = Task.init(...)
  • Prerequisites: Python 3.x. Optional: GPU for agent execution.
  • Setup time: Under 2 minutes for basic experiment tracking.
  • Documentation: https://clear.ml/docs/

Highlighted Details

  • Comprehensive experiment tracking: logs code, environment, hyperparameters, resources, and artifacts.
  • MLOps/LLMOps automation: Orchestrates pipelines, remote execution, and scheduling.
  • Data management: Version control and management for datasets on object storage (S3, GCS, Azure).
  • Model serving: Cloud-ready, scalable serving with NVIDIA-Triton integration.
  • Fractional GPUs: Container-based GPU memory limitation.

Maintenance & Community

Licensing & Compatibility

  • License: Apache License, Version 2.0.
  • Compatible with commercial use and closed-source linking.

Limitations & Caveats

The demo server is public and not recommended for sensitive experiments. While the core SDK is straightforward, advanced orchestration and self-hosting require deeper understanding of its components.

Health Check
Last commit

1 day ago

Responsiveness

1 day

Pull Requests (30d)
2
Issues (30d)
14
Star History
242 stars in the last 90 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Daniel Han Daniel Han(Cofounder of Unsloth), and
1 more.

airweave by airweave-ai

0.6%
3k
Semantic MCP server for AI agents
created 7 months ago
updated 2 days ago
Starred by Aravind Srinivas Aravind Srinivas(Cofounder of Perplexity), Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), and
8 more.

higgsfield by higgsfield-ai

0.3%
3k
ML framework for large model training and GPU orchestration
created 7 years ago
updated 1 year ago
Starred by John Yang John Yang(Author of SWE-bench, SWE-agent), Lysandre Debut Lysandre Debut(Chief Open-Source Officer at Hugging Face), and
3 more.

cleanrl by vwxyzjn

0.5%
8k
RL algorithms implementation with research-friendly features
created 6 years ago
updated 3 weeks ago
Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Ben Firshman Ben Firshman(Cofounder of Replicate), and
6 more.

Made-With-ML by GokuMohandas

0.4%
41k
ML course for production-grade applications
created 6 years ago
updated 11 months ago
Feedback? Help us improve.