ml-intern by huggingface

AI engineer for autonomous ML research, training, and deployment

Created 7 months ago

10,388 stars

Top 5.1% on SourcePulse

View on GitHub

9 Experts Love This Project

Clement Delangue

Cofounder of Hugging Face

Luis Capelo

Cofounder of Lightning AI

Vincent Weisser

Cofounder of Prime Intellect

Johannes Hagemann

Cofounder of Prime Intellect

and 5 more!

Project Summary

Summary

The huggingface/ml-intern project offers an open-source AI agent designed to automate complex machine learning workflows. It autonomously researches ML papers, writes and trains models, and deploys them, leveraging deep access to the Hugging Face ecosystem, including documentation, papers, datasets, and cloud compute. This tool is targeted at ML engineers and researchers seeking to accelerate their development cycles and streamline the process of bringing ML models to production.

How It Works

The agent operates through an agentic loop, managed by agent_loop.py, which processes user requests via a submission queue. Core components include a ContextManager that maintains message history and automatically compacts context to 170k tokens, with options for session uploads to Hugging Face. A ToolRouter orchestrates interactions with various resources: Hugging Face docs and research, repositories, datasets, jobs, papers, GitHub code search, local tools, and external MCP servers. The loop involves LLM calls, parsing tool calls, obtaining user approval for sensitive operations (like sandbox or destructive actions), executing tools via the ToolRouter, and feeding results back into the context. A Doom Loop Detector monitors for repetitive tool patterns and injects corrective prompts to prevent infinite loops, with a maximum of 300 iterations per agentic session.

Quick Start & Requirements

Installation involves cloning the repository, navigating into the directory, and executing uv sync followed by uv tool install -e .. The project requires a .env file in the root directory containing essential API keys: ANTHROPIC_API_KEY (if using Anthropic models), HF_TOKEN (for Hugging Face access, prompted on first launch if not set), and GITHUB_TOKEN (for GitHub integration). Usage can be interactive via the ml-intern command or in headless mode with a specific prompt, e.g., ml-intern "fine-tune llama on my dataset". Optional flags include --model to specify the LLM, --max-iterations to set the loop limit, and --no-stream to disable token streaming.

Highlighted Details

Autonomous ML workflow automation, encompassing research, coding, training, and deployment.
Extensive integration with the Hugging Face ecosystem for data, models, and research resources.
Agentic loop with a built-in Doom Loop Detector and context auto-compaction (170k tokens).
Support for custom tools and MCP servers, allowing for extensible functionality.

Maintenance & Community

No specific details regarding maintainers, community channels (e.g., Discord, Slack), or project roadmap were provided in the README excerpt.

Licensing & Compatibility

The license type is not explicitly stated in the provided README excerpt.

Limitations & Caveats

The agentic loop is capped at a maximum of 300 iterations. User approval is mandatory for executing sensitive operations, including sandbox interactions and destructive actions. The functionality is dependent on the availability and correct configuration of external API keys for LLM providers and GitHub.

Health Check

Last Commit

12 hours ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

1,052 stars in the last 30 days