codescientist by allenai

Automated system for code-based scientific discovery

Created 10 months ago

305 stars

Top 87.9% on SourcePulse

View on GitHub

2 Experts Love This Project

Jeff Hammerbacher

Cofounder of Cloudera

Elvis Saravia

Founder of DAIR.AI

Project Summary

CodeScientist is an end-to-end system for automating scientific discovery through code-based experiments. It targets researchers and engineers who want to automate the design, execution, and analysis of experiments, leveraging LLMs to generate novel hypotheses and implement them via a robust experiment builder. The system aims to accelerate scientific progress by reducing the manual effort in experimental setup and iteration.

How It Works

CodeScientist employs a "genetic mutation" approach, using LLMs to mutate combinations of scientific articles and code examples to generate novel experiment ideas. These ideas are then realized by an "Experiment Builder" that automatically creates, runs, and debugs the experiment code within containers. The system supports both human-in-the-loop and fully-automatic modes, generating reports and meta-analyses of experimental outcomes.

Quick Start & Requirements

Installation: Clone the repository, create a conda environment (conda create --name codescientist python=3.12), activate it, and install dependencies (pip install -r requirements.txt).
Prerequisites:
- Python 3.12
- Modal.com account for containerized experiment execution.
- API keys for LLM providers (OpenAI, Anthropic, etc.) configured in api_keys.donotcommit.json.
- LaTeX distribution (e.g., texlive-full on Ubuntu) for report generation.
Setup: Requires signing up for Modal.com and configuring API keys. Processing the paper corpus (for ideation) takes approximately 40 minutes.
Running: Start the backend server (python src/CodeScientistWebServer.py) and the frontend GUI (python src/CodeScientistWebInterface.py).
Documentation: Quick Start, Installation and Running, Usage.

Highlighted Details

Generates experiment ideas by mutating scientific papers and code examples using LLMs.
Automatically builds, runs, and debugs experiments in Modal.com containers.
Supports human-in-the-loop refinement of ideas and experiment plans.
Includes a "Hello World" example and a more complex LLM-based addition problem experiment.
Allows adding custom codeblocks to extend the system's capabilities.

Maintenance & Community

The project is from Allen Institute for AI (AI2). For questions, contact Peter Jansen (peterj@allenai.org). For issues, bugs, or feature requests, submit a GitHub issue.

codescientist by allenai

Explore Similar Projects

Curie by Just-Curieous

robin by Future-House

freephdlabor by ltjed

tango by allenai

InternAgent by InternScience

Kosmos by jimmc414

data-to-paper by Technion-Kishony-lab

ToolUniverse by mims-harvard

autonomous-researcher by mshumer

AI-Scientist-v2 by SakanaAI

Biomni by snap-stanford

AI-Scientist by SakanaAI