promptsource  by bigscience-workshop

Toolkit for creating, sharing, and using natural language prompts

created 4 years ago
2,914 stars

Top 16.8% on sourcepulse

GitHubView on GitHub
Project Summary

PromptSource is a toolkit for creating, sharing, and utilizing natural language prompts for large language models, enabling better zero-shot generalization. It targets NLP researchers and engineers seeking to leverage or contribute to a growing repository of prompts.

How It Works

Prompts are defined as functions mapping dataset examples to natural language inputs and target outputs, written in Jinja templating language. This structured approach allows for systematic prompt creation, sharing, and application, facilitating reproducible research and efficient LLM task adaptation.

Quick Start & Requirements

Highlighted Details

  • Features a Public Pool of Prompts (P3) with ~2,000 English prompts for 170+ datasets as of January 2022.
  • Includes a web-based GUI for prompt creation and visualization.
  • Integrates seamlessly with the Hugging Face Datasets library.
  • Supports datasets requiring manual downloads via a specified cache directory.

Maintenance & Community

Developed as part of the BigScience project, a large collaborative initiative for open research on large language models.

Licensing & Compatibility

The repository does not explicitly state a license in the README.

Limitations & Caveats

Local installation for prompt creation requires Python 3.7, though this constraint can be modified for usage-only scenarios. Occasional ConnectionRefusedError may require restarting the application.

Health Check
Last commit

1 year ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
0
Star History
86 stars in the last 90 days

Explore Similar Projects

Feedback? Help us improve.