cumulative-reasoning by iiis-ai

Research paper implementation for cumulative reasoning with LLMs

Created 2 years ago

308 stars

Top 87.3% on SourcePulse

View on GitHub

3 Experts Love This Project

Yiran Wu

Coauthor of AutoGen

Benny Chen

Cofounder of Fireworks AI

Jeff Hammerbacher

Cofounder of Cloudera

Project Summary

This repository provides the official implementation for "Cumulative Reasoning With Large Language Models," a method designed to enhance large language model performance on complex reasoning tasks, particularly mathematics. It targets researchers and practitioners seeking to improve LLM accuracy and efficiency in problem-solving, offering significant gains over existing techniques like Tree-of-Thoughts and Program-Aided Language models.

How It Works

The core innovation is "Cumulative Reasoning" (CR), a technique that iteratively builds upon previous reasoning steps by accumulating context. This approach, implemented in the CR Agent, leverages a minimalist Python string concatenation strategy without external frameworks like Langchain. It achieves state-of-the-art results on the MATH dataset, demonstrating superior performance, especially on challenging Level 5 problems, by effectively managing and expanding the reasoning context.

Quick Start & Requirements

Install via Conda: conda create -n cr python==3.10 followed by conda activate cr and pip install -r requirements.txt.
Requires Python 3.10.
Official demo for CR Agent Assistant: https://chat.openai.com/g/g-L3a4ZCIHx-cr-agent-v0-1
Further details in subdirectory READMEs.

Highlighted Details

Achieved 72.2% accuracy on the MATH dataset with a code environment using GPT-4-1106-preview, a +20.2% improvement over PAL.
Demonstrated 100% success rate and 0.08s per sample on the Game of 24 using MP-CR-Agent-XML v0.2.
Outperformed ToRA by 12.7% relative on Level 5 MATH problems.
Utilizes "Meta Prompting," focusing on the structure and syntax of examples.

Maintenance & Community

The project is based on Guidance, HuggingFace, Tree of Thoughts, and ToRA. Contact information for questions is provided.

Licensing & Compatibility

The repository does not explicitly state a license. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

The CR Agent v0.1 implementation is described as minimalist. Performance claims are based on specific GPT-4 versions and experimental setups, and may vary with different models or environments.

Health Check

Last Commit

5 months ago

Responsiveness

1 day

Pull Requests (30d)

Issues (30d)

Star History

2 stars in the last 30 days