chemcrow-public  by ur-whitelab

Open-source package for reasoning-intensive chemical tasks

created 2 years ago
792 stars

Top 45.2% on sourcepulse

GitHubView on GitHub
1 Expert Loves This Project
Project Summary

ChemCrow is an open-source Python package designed to solve complex, reasoning-intensive chemistry tasks by augmenting large language models (LLMs) with specialized chemical tools. It targets researchers and practitioners in chemistry who need to leverage LLMs for tasks like molecular property prediction, retrosynthesis, and literature searching. The primary benefit is enabling LLMs to perform accurate chemical reasoning and operations through a unified interface.

How It Works

ChemCrow leverages the Langchain framework to orchestrate interactions between LLMs and a suite of chemical tools. It integrates RDKit for molecular manipulation, paper-qa for literature retrieval, and databases like PubChem. The system dynamically selects and utilizes these tools based on the user's query, enabling LLMs to perform tasks that would otherwise be beyond their inherent capabilities. This approach allows for accurate, tool-assisted chemical problem-solving.

Quick Start & Requirements

  • Install via pip: pip install chemcrow
  • Requires OpenAI API key. Optionally, a Serp API key can be used.
  • Self-hosting of RXN4Chem tools (retrosynthesis, reaction prediction) is supported via Docker containers, requiring GPU access for optimal performance.
  • Official documentation and HuggingFace demo are available.

Highlighted Details

  • Augments LLMs (e.g., GPT-4) with chemical tools like RDKit, paper-qa, PubChem, and chem-space.
  • Supports self-hosting of reaction prediction and retrosynthesis tools via Docker.
  • Aims to provide accurate solutions for reasoning-intensive chemical tasks.

Maintenance & Community

The project is associated with the paper "ChemCrow: Augmenting large-language models with chemistry tools" (arXiv:2304.05376). Further details on experiments are available under "ChemCrow runs".

Licensing & Compatibility

The README does not explicitly state the license. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

This public repository does not include all tools described in the original paper due to API usage restrictions, meaning results may differ from the published work. The reliance on external APIs (like RXN4Chem by default) can lead to performance variability and dependency on API key availability.

Health Check
Last commit

7 months ago

Responsiveness

1+ week

Pull Requests (30d)
0
Issues (30d)
1
Star History
56 stars in the last 90 days

Explore Similar Projects

Starred by Jason Liu Jason Liu(Author of Instructor) and Ross Taylor Ross Taylor(Cofounder of General Reasoning; Creator of Papers with Code).

Search-R1 by PeterGriffinJin

1.3%
3k
RL framework for training LLMs to use search engines
created 5 months ago
updated 3 weeks ago
Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Calvin French-Owen Calvin French-Owen(Coounder of Segment), and
2 more.

ReAct by ysymyth

0.8%
3k
GPT-3 prompting code for ReAct research paper
created 2 years ago
updated 1 year ago
Feedback? Help us improve.