The_Prompt_Report  by trigaten

Research paper code for structured understanding of prompts via taxonomy

Created 1 year ago
375 stars

Top 75.7% on SourcePulse

GitHubView on GitHub
Project Summary

This repository provides the code for "The Prompt Report," a research project aiming to establish a structured understanding of prompt engineering in Generative AI. It offers tools for automated paper review, data collection, and experiment execution, targeting researchers and developers in the GenAI space.

How It Works

The project automates a systematic review of research papers related to prompt engineering. It utilizes scripts to collect papers, deduplicate and filter them, and then run various experiments to analyze prompting techniques. The core logic resides in src/prompt_systematic_review, with configurations managed in config_data.py and keywords for review in keywords.py.

Quick Start & Requirements

  • Install: pip install -r requirements.txt
  • Prerequisites: API keys for OpenAI, Hugging Face, and Semantic Scholar. Requires git lfs.
  • Setup: Create a .env file with API keys. Install pytest-dotenv for testing.
  • Data: Clone dataset from Hugging Face (datasets/PromptSystematicReview/ThePromptReport) and move to data/.
  • Run: python main.py (downloads papers, runs review, and experiments).
  • Docs: Website, Paper, Dataset

Highlighted Details

  • Automates systematic review of prompt engineering literature.
  • Includes a taxonomy of prompting techniques.
  • Allows customization of review keywords.
  • Experiments can be run individually.

Maintenance & Community

No specific contributors, sponsorships, or community links (Discord/Slack) are mentioned in the README.

Licensing & Compatibility

The repository's license is not explicitly stated in the provided README text.

Limitations & Caveats

The README notes potential discrepancies in paper titles between the arXiv API and actual paper content, which might affect automated retrieval. Some experiments, like graph_internal_references, are noted to have parallelism issues and are better run individually.

Health Check
Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
6 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Eugene Yan Eugene Yan(AI Scientist at AWS), and
1 more.

obsidian-copilot by eugeneyan

0.2%
553
Prototype assistant for writing and thinking
Created 2 years ago
Updated 1 year ago
Starred by Luca Soldaini Luca Soldaini(Research Scientist at Ai2), Shizhe Diao Shizhe Diao(Author of LMFlow; Research Scientist at NVIDIA), and
1 more.

s2orc by allenai

0.3%
967
Corpus for NLP/text mining research on scientific papers
Created 5 years ago
Updated 1 year ago
Feedback? Help us improve.