textgrad by zou-group

Autograd engine for textual gradients, enabling LLM-driven optimization

Created 1 year ago

3,273 stars

Top 14.6% on SourcePulse

View on GitHub

3 Experts Love This Project

Eric Zhu

Coauthor of AutoGen; Research Scientist at Microsoft Research

Chip Huyen

Author of "AI Engineering", "Designing Machine Learning Systems"

Daniel Han

Cofounder of Unsloth

Project Summary

TextGrad enables automatic differentiation for text-based tasks by leveraging Large Language Models (LLMs) to provide gradient feedback. This framework allows users to define loss functions and optimize textual outputs, such as reasoning steps, code snippets, or prompts, using a PyTorch-like API. It's designed for researchers and developers working with LLMs who need to fine-tune or improve the quality of generated text through an iterative optimization process.

How It Works

TextGrad implements a novel "textual gradient" concept, where LLMs act as differentiators. Instead of numerical gradients, LLMs provide textual feedback on the quality or correctness of an output. This feedback is then used by a Textual Gradient Descent (TGD) optimizer to iteratively refine the textual variable, guided by a natural-language loss function. This approach allows optimization of complex, unstructured data like natural language, code, or even multimodal inputs.

Quick Start & Requirements

Install via pip: pip install textgrad or pip install textgrad[vllm] for vllm integration.
Requires an LLM API key (e.g., OpenAI, Anthropic, Gemini) for the backward pass.
Tutorials are available in Google Colab: https://github.com/zou-group/TextGrad/tree/main/examples/notebooks

Highlighted Details

Published in Nature (March 2025).
Supports multiple LLM backends via litellm, including Bedrock, Together, and Gemini.
Enables optimization of text, code, prompts, and multimodal inputs.
Features a PyTorch-like API for intuitive usage.

Maintenance & Community

Active development with recent updates introducing new litellm-based engines.
Key contributors include Federico Bianchi and Mert Yuksekgonul.
Inspiration drawn from PyTorch, DSPy, Micrograd, ProTeGi, and Reflexion.

Licensing & Compatibility

The repository does not explicitly state a license in the provided README. Further investigation is required for licensing details and commercial use compatibility.

Limitations & Caveats

The new litellm engines are experimental and may have issues.
The effectiveness of optimization is highly dependent on the quality of the LLM feedback and the defined loss function.
Requires access to LLM APIs, which may incur costs.

Health Check

Last Commit

5 months ago

Responsiveness

1 day

Pull Requests (30d)

Issues (30d)

Star History

139 stars in the last 30 days