promptimize by preset-io

Prompt engineering toolkit for evaluating and testing prompts

Created 2 years ago

488 stars

Top 63.2% on SourcePulse

View on GitHub

1 Expert Loves This Project

Maxime Beauchemin

Author of Apache Airflow, Superset; Founder of Preset

Project Summary

Promptimize is a Python toolkit for structured prompt engineering and evaluation, targeting developers building AI-powered products. It brings test-driven development (TDD) principles to prompt engineering, enabling users to define, execute, and compare prompt performance across various models and parameters with confidence.

How It Works

Promptimize treats prompts as "prompt cases," similar to unit tests, each associated with evaluation functions that score responses on a scale of 0 to 1. These cases are organized into "suites" for batch execution. The framework supports dynamic prompt generation, hyperparameter tuning (temperature, max tokens), and efficient re-runs of only changed or failed cases to minimize API calls. It leverages Langchain for LLM interactions, offering integrations for Langchain-specific prompt structures.

Quick Start & Requirements

Install via pip: pip install promptimize
Requires an OpenAI API key set as an environment variable (OPENAI_API_KEY).
Example usage: p9e run ./examples --output ./report.yaml
Official Docs: Preset Blog Promptimize DOCS

Highlighted Details

Configuration as code for prompt cases, suites, and evaluations.
Expressive DSL for defining prompts and assertions.
Supports prompt weighting and categorization for nuanced reporting.
Includes pre_run and post_run hooks for advanced response processing (e.g., executing generated code).
AI-powered suite expansion to generate new prompt cases.

Maintenance & Community

Project creator is Maxime Beauchemin (Apache Superset, Apache Airflow).
Described as in "super early stages" (as of 0.2.0), encouraging contributions.
Blog post available: Mastering AI-Powered Product Development: Introducing Promptimize for Test-Driven Prompt Engineering

Licensing & Compatibility

License not explicitly stated in the README. Compatibility for commercial use or closed-source linking is therefore undetermined.

Limitations & Caveats

The project is in its early stages, indicating potential for rapid changes and evolving APIs. The license is not specified, which may pose a barrier for commercial adoption or integration into closed-source projects.

Health Check

Last Commit

1 month ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

3 stars in the last 30 days