promptbeat  by tophant-ai

LLM Red-teaming and Safety Evaluation Toolkit

Created 2 weeks ago

New!

324 stars

Top 83.9% on SourcePulse

GitHubView on GitHub
Project Summary

<2-3 sentences summarising what the project addresses and solves, the target audience, and the benefit.> Promptbeat is a red-teaming toolkit designed for LLM safety evaluation. It enables users to generate adversarial test cases from curated seeds and assess model robustness against various attacks. The tool targets AI developers and researchers seeking to proactively identify and mitigate vulnerabilities in their large language models before deployment.

How It Works

The core approach leverages seed-driven generation of adversarial prompts, powered by the promptfoo backend. It facilitates multi-provider evaluation, allowing side-by-side comparisons of LLM responses to the same adversarial inputs. Promptbeat then calculates Attack Success Rate metrics, categorized by risk and provider, to quantify model weaknesses and generate interactive HTML reports for detailed analysis.

Quick Start & Requirements

Installation involves downloading a platform-specific release tarball, extracting it, and running the promptbeat binary.

  • Primary Install/Run: Download tarball, extract, then use ./bin/promptbeat <command>.
  • Prerequisites: An LLM provider API key (e.g., OPENAI_API_KEY).
  • Dependencies: No local Python, Node.js, or promptfoo installation is required as these are bundled (Python 3.12.13, Node.js 22.22.2, promptfoo 0.121.9).
  • Links: Example configurations are available in the examples/bootstrap directory.

Highlighted Details

  • Automated adversarial test case generation from curated seeds.
  • Multi-provider LLM evaluation with comparative analysis.
  • Attack Success Rate metrics broken down by risk category and provider.
  • Interactive HTML report generation for visualized results.
  • Self-contained release package eliminating local environment setup.

Maintenance & Community

No information on contributors, sponsorships, community channels, or roadmap is provided in the README.

Licensing & Compatibility

No license information is specified in the README. Compatibility for commercial use or closed-source linking cannot be determined without a license.

Limitations & Caveats

The tool requires external LLM provider API keys for operation. The current release (0.1.0) suggests it may be in an early development stage. Support is limited to the specific platforms for which release tarballs are provided.

Health Check
Last Commit

2 weeks ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
326 stars in the last 18 days

Explore Similar Projects

Feedback? Help us improve.