The_Prompt_Report by trigaten

Research paper code for structured understanding of prompts via taxonomy

Created 2 years ago

381 stars

Top 74.9% on SourcePulse

View on GitHub

1 Expert Loves This Project

Shyamal Anadkat

Research Scientist at OpenAI

Project Summary

This repository provides the code for "The Prompt Report," a research project aiming to establish a structured understanding of prompt engineering in Generative AI. It offers tools for automated paper review, data collection, and experiment execution, targeting researchers and developers in the GenAI space.

How It Works

The project automates a systematic review of research papers related to prompt engineering. It utilizes scripts to collect papers, deduplicate and filter them, and then run various experiments to analyze prompting techniques. The core logic resides in src/prompt_systematic_review, with configurations managed in config_data.py and keywords for review in keywords.py.

Quick Start & Requirements

Install: pip install -r requirements.txt
Prerequisites: API keys for OpenAI, Hugging Face, and Semantic Scholar. Requires git lfs.
Setup: Create a .env file with API keys. Install pytest-dotenv for testing.
Data: Clone dataset from Hugging Face (datasets/PromptSystematicReview/ThePromptReport) and move to data/.
Run: python main.py (downloads papers, runs review, and experiments).
Docs: Website, Paper, Dataset

Highlighted Details

Automates systematic review of prompt engineering literature.
Includes a taxonomy of prompting techniques.
Allows customization of review keywords.
Experiments can be run individually.

Maintenance & Community

No specific contributors, sponsorships, or community links (Discord/Slack) are mentioned in the README.

Licensing & Compatibility

The repository's license is not explicitly stated in the provided README text.

Limitations & Caveats

The README notes potential discrepancies in paper titles between the arXiv API and actual paper content, which might affect automated retrieval. Some experiments, like graph_internal_references, are noted to have parallelism issues and are better run individually.

Health Check

Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

1 stars in the last 30 days