Automated system for code-based scientific discovery
Top 92.3% on sourcepulse
CodeScientist is an end-to-end system for automating scientific discovery through code-based experiments. It targets researchers and engineers who want to automate the design, execution, and analysis of experiments, leveraging LLMs to generate novel hypotheses and implement them via a robust experiment builder. The system aims to accelerate scientific progress by reducing the manual effort in experimental setup and iteration.
How It Works
CodeScientist employs a "genetic mutation" approach, using LLMs to mutate combinations of scientific articles and code examples to generate novel experiment ideas. These ideas are then realized by an "Experiment Builder" that automatically creates, runs, and debugs the experiment code within containers. The system supports both human-in-the-loop and fully-automatic modes, generating reports and meta-analyses of experimental outcomes.
Quick Start & Requirements
conda create --name codescientist python=3.12
), activate it, and install dependencies (pip install -r requirements.txt
).api_keys.donotcommit.json
.texlive-full
on Ubuntu) for report generation.python src/CodeScientistWebServer.py
) and the frontend GUI (python src/CodeScientistWebInterface.py
).Highlighted Details
Maintenance & Community
The project is from Allen Institute for AI (AI2). For questions, contact Peter Jansen (peterj@allenai.org). For issues, bugs, or feature requests, submit a GitHub issue.
Licensing & Compatibility
Limitations & Caveats
1 month ago
Inactive