InternBootcamp by InternLM

Scale LLM reasoning with diverse, verifiable tasks

Created 8 months ago

330 stars

Top 83.1% on SourcePulse

Project Summary

InternBootcamp is an open-source framework providing 1000+ diverse, verifiable task environments for LLM reasoning research. It simplifies integrating varied reasoning tasks for model optimization, synthetic data generation, and evaluation. Its core benefit is enhanced LLM reasoning performance and training efficiency via "Task Scaling," exposing models to a wide task spectrum.

How It Works

The framework standardizes task integration with a unified interface for RL or synthetic data pipelines. Its key innovation, "Task Scaling," uses an automated agent workflow to synthesize 1000+ diverse, verifiable reasoning tasks across 8 domains. This workflow includes task collection, evolutionary code generation, and unittest filtering, enabling scalable expansion. The approach facilitates broad experiential learning and emergent abilities, making initially unsolvable tasks learnable through cross-task exposure.

Quick Start & Requirements

Installation: Clone the repository (git clone https://github.com/InternLM/InternBootcamp.git), navigate (cd InternBootcamp), and install (pip install -e .).
Prerequisites: Python environment, Git. No specific hardware or Python version requirements are explicitly stated.
Resources: Links to the project's paper, GitHub repository, and evaluation benchmarks are available.

Highlighted Details

Features 1000+ complex reasoning tasks across 8 domains, including algorithms, puzzles, scientific reasoning, and benchmarks like ARC-AGI and BBEH.
Over 90% of tasks are automatically synthesized via an evolutionary pipeline, enabling rapid, scalable expansion.
Demonstrates "Task Scaling" improves LLM performance and training efficiency, fostering emergent abilities where tasks unsolvable in isolation become learnable.
InternThinker-GO, trained using the framework, achieves performance comparable to professional Go players, surpassing current LLM reasoning models.

Maintenance & Community

Recent updates include v1.0 and a technical report, both dated August 2025, indicating recent development.
The project encourages community contributions for expanding task scope and verifying generated bootcamps.
Acknowledgments mention integrations with projects like Intern-S1, VeRL, Xtuner, and OpenCompass.

Licensing & Compatibility

The provided README does not specify the project's license. This omission requires clarification for adoption decisions, particularly regarding commercial use or derivative works.

Limitations & Caveats

The framework excludes tasks requiring specific world-view knowledge, such as counterfactual reasoning.
The automated generation pipeline filters out bootcamps with accuracy outside a specific range ([0.03, 0.85]), potentially excluding edge cases.
Future-dated release information suggests the project may still be under active, potentially unstable, development.

Health Check

Last Commit

4 months ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

3 stars in the last 30 days

Explore Similar Projects

CoT-Collection by kaistAI

Chain-of-Thought fine-tuning dataset and models for enhanced LLM learning

Created 2 years ago

Updated 2 years ago

ToRL by GAIR-NLP

Tool-integrated RL for autonomous tool discovery and refinement

Created 9 months ago

Updated 7 months ago

Tool-Star by RUC-NLPIR

LLM multi-tool reasoning powered by reinforcement learning

Created 8 months ago

Updated 1 week ago

LightReasoner by HKUDS

LLM reasoning enhancement via SLM-LLM knowledge transfer

Created 3 months ago

Updated 2 months ago

deepconf by facebookresearch

Parallel thinking framework for enhanced LLM reasoning

Created 4 months ago

Updated 3 months ago

Awesome-RL-based-Reasoning-MLLMs by Sun-Haoyuan23

Curated list for RL-based reasoning in multimodal LLMs

Created 10 months ago

Updated 1 month ago

Starred by

Yiran Wu

Yiran Wu(Coauthor of AutoGen) and

Phil Wang

Phil Wang(Prolific Research Paper Implementer).

R-Zero by Chengsong-Huang

Self-evolving LLM training from zero data

Created 5 months ago

Updated 3 weeks ago

Awesome-System2-Reasoning-LLM by zzli2022

Survey paper for System 2 reasoning in LLMs

Created 11 months ago

Updated 7 months ago

Starred by

Casper Hansen

Casper Hansen(Author of AutoAWQ),

Pawel Garbacki

Pawel Garbacki(Cofounder of Fireworks AI), and

2 more.

rStar by zhentingqi

Research paper for improving small LLM reasoning via mutual reasoning

Created 1 year ago

Updated 11 months ago

train-deepseek-r1 by FareedKhan-dev

Replicate DeepSeek R1 LLM training from scratch

Created 11 months ago

Updated 9 months ago

Starred by

Travis Fischer

Travis Fischer(Founder of Agentic) and

Elvis Saravia

Elvis Saravia(Founder of DAIR.AI).

Awesome-LLM-Reasoning by atfortes

Reasoning resources for LLMs/MLLMs, covering chain-of-thought & DeepSeek-R1

Created 3 years ago

Updated 8 months ago

Starred by

Eric Zhu

Eric Zhu(Coauthor of AutoGen; Research Scientist at Microsoft Research) and

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems").

PromptWizard by microsoft

Agent-driven framework for task-aware prompt optimization

Created 1 year ago

Updated 3 months ago

Feedback? Help us improve.