promptfoo  by promptfoo

CLI tool for LLM prompt/agent/RAG testing and red-teaming

created 2 years ago
7,798 stars

Top 6.8% on sourcepulse

GitHubView on GitHub
Project Summary

Promptfoo is a developer-focused, local tool designed to streamline the testing, evaluation, and security of Large Language Model (LLM) applications. It enables users to move beyond trial-and-error by providing automated testing, red teaming capabilities, and side-by-side model comparisons, ultimately aiming to help ship more secure and reliable AI applications.

How It Works

Promptfoo operates via a declarative configuration system, allowing users to define test cases, prompts, and evaluation metrics. It supports a wide array of LLM providers, including OpenAI, Anthropic, Azure, Bedrock, and Ollama, facilitating direct comparison of model performance. The tool emphasizes a developer-first approach with features like live reload and caching for rapid iteration, and it runs entirely locally to ensure prompt privacy.

Quick Start & Requirements

Highlighted Details

  • Facilitates red teaming and vulnerability scanning for LLMs.
  • Supports comparison of multiple LLM providers (OpenAI, Anthropic, Azure, Bedrock, Ollama, etc.).
  • Integrates with CI/CD pipelines for automated checks.
  • Offers features like live reload and caching for developer efficiency.

Maintenance & Community

  • Active community on Discord.
  • Open source with a contributing guide.

Licensing & Compatibility

  • MIT License.
  • Permissive license suitable for commercial use and closed-source linking.

Limitations & Caveats

The tool is primarily command-line driven, with a focus on local execution. While it supports numerous LLM providers, integration with specific or self-hosted models not exposed via standard APIs might require custom configurations.

Health Check
Last commit

14 hours ago

Responsiveness

1 day

Pull Requests (30d)
381
Issues (30d)
34
Star History
1,491 stars in the last 90 days

Explore Similar Projects

Starred by Peter Norvig Peter Norvig(Author of Artificial Intelligence: A Modern Approach; Research Director at Google), Michael Han Michael Han(Cofounder of Unsloth), and
15 more.

open-interpreter by openinterpreter

0.1%
60k
Natural language interface for computers
created 2 years ago
updated 4 days ago
Feedback? Help us improve.