cumulative-reasoning  by iiis-ai

Research paper implementation for cumulative reasoning with LLMs

Created 2 years ago
302 stars

Top 88.2% on SourcePulse

GitHubView on GitHub
Project Summary

This repository provides the official implementation for "Cumulative Reasoning With Large Language Models," a method designed to enhance large language model performance on complex reasoning tasks, particularly mathematics. It targets researchers and practitioners seeking to improve LLM accuracy and efficiency in problem-solving, offering significant gains over existing techniques like Tree-of-Thoughts and Program-Aided Language models.

How It Works

The core innovation is "Cumulative Reasoning" (CR), a technique that iteratively builds upon previous reasoning steps by accumulating context. This approach, implemented in the CR Agent, leverages a minimalist Python string concatenation strategy without external frameworks like Langchain. It achieves state-of-the-art results on the MATH dataset, demonstrating superior performance, especially on challenging Level 5 problems, by effectively managing and expanding the reasoning context.

Quick Start & Requirements

  • Install via Conda: conda create -n cr python==3.10 followed by conda activate cr and pip install -r requirements.txt.
  • Requires Python 3.10.
  • Official demo for CR Agent Assistant: https://chat.openai.com/g/g-L3a4ZCIHx-cr-agent-v0-1
  • Further details in subdirectory READMEs.

Highlighted Details

  • Achieved 72.2% accuracy on the MATH dataset with a code environment using GPT-4-1106-preview, a +20.2% improvement over PAL.
  • Demonstrated 100% success rate and 0.08s per sample on the Game of 24 using MP-CR-Agent-XML v0.2.
  • Outperformed ToRA by 12.7% relative on Level 5 MATH problems.
  • Utilizes "Meta Prompting," focusing on the structure and syntax of examples.

Maintenance & Community

The project is based on Guidance, HuggingFace, Tree of Thoughts, and ToRA. Contact information for questions is provided.

Licensing & Compatibility

The repository does not explicitly state a license. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

The CR Agent v0.1 implementation is described as minimalist. Performance claims are based on specific GPT-4 versions and experimental setups, and may vary with different models or environments.

Health Check
Last Commit

3 months ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
0
Star History
0 stars in the last 30 days

Explore Similar Projects

Starred by Shizhe Diao Shizhe Diao(Author of LMFlow; Research Scientist at NVIDIA), Eric Zhu Eric Zhu(Coauthor of AutoGen; Research Scientist at Microsoft Research), and
7 more.

reasoning-gym by open-thought

0.5%
1k
Procedural dataset generator for reasoning models
Created 9 months ago
Updated 4 weeks ago
Feedback? Help us improve.