Discover and explore top open-source AI tools and projects—updated daily.
promptfooCLI tool for LLM prompt/agent/RAG testing and red-teaming
Top 5.7% on SourcePulse
Promptfoo is a developer-focused, local tool designed to streamline the testing, evaluation, and security of Large Language Model (LLM) applications. It enables users to move beyond trial-and-error by providing automated testing, red teaming capabilities, and side-by-side model comparisons, ultimately aiming to help ship more secure and reliable AI applications.
How It Works
Promptfoo operates via a declarative configuration system, allowing users to define test cases, prompts, and evaluation metrics. It supports a wide array of LLM providers, including OpenAI, Anthropic, Azure, Bedrock, and Ollama, facilitating direct comparison of model performance. The tool emphasizes a developer-first approach with features like live reload and caching for rapid iteration, and it runs entirely locally to ensure prompt privacy.
Quick Start & Requirements
npx promptfoo@latest init followed by npx promptfoo evalHighlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The tool is primarily command-line driven, with a focus on local execution. While it supports numerous LLM providers, integration with specific or self-hosted models not exposed via standard APIs might require custom configurations.
7 hours ago
1 day
teknium1
ianarawjo
qodo-ai
comet-ml