saplings by shobrook

Reasoning library for agentic tree search & tool use

Created 2 years ago

271 stars

Top 95.1% on SourcePulse

Project Summary

Saplings is a Python library for building AI agents that leverage tree search algorithms to improve reasoning and tool use. It targets developers and researchers aiming to create more capable agents for complex tasks like coding, Q&A, and web navigation, offering state-of-the-art performance on benchmarks.

How It Works

Saplings enables agents to explore multiple tool-use trajectories using algorithms like Monte Carlo Tree Search (MCTS), A*, and greedy best-first search. This allows agents to look ahead, evaluate different paths, and backtrack, reducing errors and enhancing decision-making compared to simpler chain-of-thought or ReAct approaches. The library integrates with LiteLLM for broad LLM support and allows customization of evaluation functions, prompts, and search parameters.

Quick Start & Requirements

Install via pip: pip install saplings
Requires Python.
Supports 100+ LLMs via LiteLLM (e.g., Model(model="openai/gpt-4o")).
Official Docs: https://github.com/shobrook/saplings

Highlighted Details

Achieves SOTA on HumanEval (92.7% coding), HotPotQA (63% Q&A/RAG), and VisualWebArena (26.4% web navigation) using MCTS.
Plug-and-play design allows integrating tree search with minimal code changes.
Supports MCTS, A*, and greedy best-first search algorithms, plus a baseline COTAgent.
Customizable evaluators, tool definitions (with JSON schema parameters), and agent configurations.

Maintenance & Community

Developed by shobrook.
Roadmap includes support for chat history, vision agents, and LLM call budgets.

Licensing & Compatibility

MIT License. Permissive for commercial use and integration with closed-source projects.

Limitations & Caveats

MCTS agent can be slow and expensive due to extensive LLM calls in worst-case scenarios.
The COTAgent provides a baseline but lacks the advanced reasoning capabilities of the search agents.

Health Check

Last Commit

5 months ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

2 stars in the last 30 days

Explore Similar Projects

Awesome-Agent-RL by 0russwest0

Curated collection of papers and resources for training LLM agents with RL

Created 10 months ago

Updated 3 months ago

LLM-Agent-Survey by xinzhel

Reading list for LLM agent research

Created 1 year ago

Updated 3 months ago

awesome-autonomous-gpt by ScarletPan

Curated list of autonomous AI agent projects and resources

Created 2 years ago

Updated 2 years ago

Tree-GRPO by AMAP-ML

LLM agent reinforcement learning with tree search

Created 3 months ago

Updated 3 months ago

awesome-deep-research-agent by ai-agents-2030

Curated research on deep research agents

Created 9 months ago

Updated 3 months ago

Agent_Foundation_Models by OPPO-PersonalAI

Agent foundation models for complex problem-solving

Created 5 months ago

Updated 4 months ago

Starred by

Phil Wang

Phil Wang(Prolific Research Paper Implementer),

Vincent Weisser

Vincent Weisser(Cofounder of Prime Intellect), and

1 more.

system-2-research by open-thought

Reasoning resources for AI systems, agents, and cognitive architectures

Created 1 year ago

Updated 10 months ago

LanguageAgentTreeSearch by lapisrocks

ICML 2024 research paper implementation

Created 2 years ago

Updated 1 year ago

agent-as-a-judge by metauto-ai

Agent-as-a-Judge framework for agentic system evaluation

Created 1 year ago

Updated 8 months ago

Starred by

Jon Bratseth

Jon Bratseth(Cofounder of Vespa).

KwaiAgents by KwaiKEG

Agent framework for information-seeking using LLMs

Created 2 years ago

Updated 1 year ago

Starred by

Omar Sanseviero

Omar Sanseviero(DevRel at Google DeepMind).

vscode-ai-toolkit by microsoft

VS Code extension for agent development

Created 2 years ago

Updated 5 days ago

Starred by

Toran Bruce Richards

Toran Bruce Richards(Founder of AutoGPT),

Elvis Saravia

Elvis Saravia(Founder of DAIR.AI), and

2 more.

agent-lightning by microsoft

Train any AI agent with rollouts and feedback

Created 6 months ago

Updated 2 days ago

Feedback? Help us improve.