SwiftSage by SwiftSage

Agent system for reasoning with LLMs via in-context reinforcement learning

Created 2 years ago

319 stars

Top 84.9% on SourcePulse

1 Expert Loves This Project

winglian

Founder of Axolotl AI

Project Summary

SwiftSage is a generative agent system designed for complex interactive reasoning tasks, mimicking human fast and slow thinking processes. It targets researchers and developers working with Large Language Models (LLMs) who need a flexible framework for enhancing LLM reasoning capabilities through in-context reinforcement learning. The system aims to improve LLM performance on tasks requiring planning, execution, and iterative refinement.

How It Works

SwiftSage v2 employs a "plan-ground-execute" paradigm, unifying task formats and using a Python executor for code-based solutions. It leverages in-context reinforcement learning, a tuning-free, prompting-based strategy, to adapt reasoning strategies. Feedback is generated by LLMs, acting as critics and reward signals to update the agent's approach. The system comprises a Swift Agent (smaller LM for intuitive reasoning), a Feedback Agent (larger LM for critique), and a Sage Agent (even larger LM for analytical thinking if the Swift Agent fails). The workflow involves the Swift Agent generating a plan and code, the executor running it, the Feedback Agent critiquing the output, and iterating or escalating to the Sage Agent if necessary.

Quick Start & Requirements

Install via pip: pip install git+https://github.com/SwiftSage/SwiftSage.git
Requires API keys for LLM providers (e.g., Together AI, Groq, SambaNova).
Environment variables for API keys and model IDs (e.g., TOGETHER_API_KEY, ENGINE, SWIFT_MODEL_ID, FEEDBACK_MODEL_ID, SAGE_MODEL_ID) must be set.
Example usage: swiftsage --problem "QUERY" --api_provider ${ENGINE} ...
Demo available: https://hf.co/spaces/swiftsage-ai/SwiftSage

Highlighted Details

Mimics human fast (intuitive) and slow (analytical) thinking.
Tuning-free, prompting-based in-context reinforcement learning.
Utilizes a Python executor for code-based problem-solving.
Iterative refinement loop with LLM-generated feedback.
Escalation path to a more powerful "Sage Agent" for complex tasks.

Maintenance & Community

Beta version (v2) is under active development; v1 code is available on the science_world branch.
Contact: Bill Yuchen Lin (email provided).
A retriever module is planned for future implementation.

Licensing & Compatibility

MIT License.
Compatible with commercial use and closed-source linking.

Limitations & Caveats

This is a beta version and may not be stable. The retriever module is not yet implemented, which is expected to further improve reasoning.

Health Check

Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

3 stars in the last 30 days

Explore Similar Projects

sweet_rl by facebookresearch

LLM agents trained for collaborative reasoning

Created 8 months ago

Updated 6 months ago

Tool-Star by RUC-NLPIR

LLM multi-tool reasoning powered by reinforcement learning

Created 6 months ago

Updated 1 month ago

LLM-Planning-Papers by AGI-Edgerunners

Paper list for LLM planning research

Created 2 years ago

Updated 1 year ago

Starred by

Georgios Konstantopoulos

Georgios Konstantopoulos(CTO, General Partner at Paradigm).

agent-actors by shaman-ai

Agentic framework for parallelized LLM agent trees

Created 2 years ago

Updated 2 years ago

o1 by win4r

LLM reasoning chains via prompting strategies

Created 1 year ago

Updated 1 year ago

Starred by

Jason Knight

Jason Knight(Director AI Compilers at NVIDIA; Cofounder of OctoML),

Tim J. Baek

Tim J. Baek(Founder of Open WebUI), and

6 more.

awesome-o1 by srush

Bibliography for OpenAI's o1 project

Created 1 year ago

Updated 1 year ago

ReCode by FoundationAgents

LLM agent framework for unified planning and action via recursive code

Created 1 month ago

Updated 4 weeks ago

Awesome-System2-Reasoning-LLM by zzli2022

Survey paper for System 2 reasoning in LLMs

Created 9 months ago

Updated 5 months ago

M_GRPO by baibizhe

Stabilizing LLM reasoning with self-supervised RL

Created 2 months ago

Updated 1 month ago

Starred by

Casper Hansen

Casper Hansen(Author of AutoAWQ),

Pawel Garbacki

Pawel Garbacki(Cofounder of Fireworks AI), and

2 more.

rStar by zhentingqi

Research paper for improving small LLM reasoning via mutual reasoning

Created 1 year ago

Updated 10 months ago

thinkgpt by jina-ai

Python library for augmenting LLMs with agentic techniques

Created 2 years ago

Updated 1 year ago

Starred by

Vincent Weisser

Vincent Weisser(Cofounder of Prime Intellect),

Travis Fischer

Travis Fischer(Founder of Agentic), and

2 more.

mini-agi by muellerberndt

Simple autonomous agent for GPT-3.5/4

Created 2 years ago

Updated 2 years ago

Feedback? Help us improve.