Discover and explore top open-source AI tools and projects—updated daily.
OpenBMBEnd-to-end LLM agent infrastructure
New!
Top 52.2% on SourcePulse
OpenBMB/AgentCPM provides an end-to-end, open-source infrastructure for training and evaluating LLM agents. It targets researchers and developers, offering a competitive 4B parameter model (AgentCPM-Explore) and a unified tool sandbox to accelerate agent development and benchmarking on long-horizon tasks.
How It Works
The project features AgentCPM-Explore, a 4B LLM agent excelling in deep exploration via 100+ interaction turns, dynamic strategy adjustment, and multi-source validation. Its key advantage is SOTA performance at its scale, rivaling larger models. The accompanying infrastructure includes AgentDock (tool sandbox), AgentRL (async training), and AgentToLeaP (evaluation), forming a complete ecosystem for agent research.
Quick Start & Requirements
Setup involves launching the AgentDock tool sandbox (docker compose up -d). For evaluation, use the Docker image yuyangfu/agenttoleap-eval:v1.0 (docker pull, docker run -dit --gpus all ...). Run custom tasks via python quickstart.py after configuring API keys, model details, and AgentDock URL in quickstart.py. Prerequisites include Docker and GPU access. Links to Hugging Face/ModelScope models are available.
Highlighted Details
Maintenance & Community
Developed collaboratively by THUNLP, Renmin University of China, ModelBest, and OpenBMB. Specific community channels (Discord, Slack) or a public roadmap are not detailed. The "Latest News" date of 2026-01-12 appears to be a future placeholder.
Licensing & Compatibility
Released under the permissive Apache-2.0 license, generally allowing commercial use and integration into closed-source projects without significant restrictions.
Limitations & Caveats
Key components like the Technical Report and AgentRL framework are "Coming Soon." The QuickStart script, by default, skips automatic scoring, focusing on execution demonstration. The futuristic "Latest News" date may indicate potentially outdated or aspirational information.
1 day ago
Inactive