awesome-prompt-injection by Joe-B-Security

Resource list for prompt injection attacks on ML models

Created 2 years ago

396 stars

Top 72.9% on SourcePulse

Project Summary

This repository serves as a curated collection of resources for understanding and mitigating prompt injection vulnerabilities in machine learning models, particularly those employing prompt-based learning. It targets AI researchers, security engineers, and developers working with LLMs, offering a centralized hub for articles, tutorials, research papers, and tools to combat this emerging threat.

How It Works

Prompt injection exploits the inability of ML models to differentiate between user-provided data and system instructions. Attackers craft malicious inputs that trick the model into executing unintended commands, potentially leading to data exfiltration, unauthorized actions, or behavioral manipulation. This collection provides insights into various attack vectors, including direct and indirect injection, and highlights techniques for detection and defense.

Quick Start & Requirements

Tools: Garak (Python 3.x) for LLM vulnerability scanning, Token Turbulenz (Python 3.x) for prompt injection fuzzing.
CTFs: Gandalf (requires interaction with a specific LLM setup), Promptalanche (scenario-based).
Resources: Links to articles, tutorials, and research papers are provided within the README.

Highlighted Details

Focuses on both direct and indirect prompt injection techniques.
Includes practical tools like Garak for automated LLM vulnerability scanning.
Features CTF challenges (Gandalf, Promptalanche) for hands-on learning.
Curates research papers detailing real-world attack scenarios and transferable adversarial attacks.

Maintenance & Community

Open to community contributions via contribution guidelines.
Links to the Learn Prompting Discord server for community discussion.

Licensing & Compatibility

The repository itself is not licensed. Individual tools and resources may have their own licenses.

Limitations & Caveats

This is a resource collection, not a software project with installable code. Practical application of tools requires separate setup and understanding of their dependencies.
Some CTF challenges may require specific LLM access (e.g., ChatGPT Plus with Browsing).

Health Check

Last Commit

3 months ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

20 stars in the last 30 days

Explore Similar Projects

Starred by

Andrey Vasnetsov

Andrey Vasnetsov(Cofounder of Qdrant) and

Andre Zayarni

Andre Zayarni(Cofounder of Qdrant).

aegis by automorphic-ai

LLM firewall for adversarial attack protection

Created 2 years ago

Updated 1 year ago

awesome-data-poisoning-and-backdoor-attacks by penghui-yang

Curated list of security research papers

Created 2 years ago

Updated 1 year ago

llm-security by dropbox

LLM security research code for prompt injection

Created 2 years ago

Updated 1 year ago

prompt-injection-defenses by tldrsec

Collection of prompt injection defenses

Created 1 year ago

Updated 10 months ago

Starred by

Elvis Saravia

Elvis Saravia(Founder of DAIR.AI).

PromptInject by agencyenterprise

Framework for LLM adversarial prompt attack analysis

Created 3 years ago

Updated 1 year ago

Open-Prompt-Injection by liu00222

Benchmark for prompt injection attacks and defenses in LLM apps

Created 2 years ago

Updated 2 months ago

arc_pi_taxonomy by Arcanum-Sec

A structured taxonomy for prompt injection attacks

Created 10 months ago

Updated 1 month ago

Starred by

Elie Bursztein

Elie Bursztein(Cybersecurity Lead at Google DeepMind).

awesome-ai-security by ottosulin

AI security resource collection

Created 2 years ago

Updated 3 days ago

advmlthreatmatrix by mitre

Adversarial ML threat matrix for security analysts

Created 5 years ago

Updated 2 years ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"),

Michele Castata

Michele Castata(President of Replit), and

3 more.

rebuff by protectai

SDK for LLM prompt injection detection

Created 2 years ago

Updated 1 year ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"),

Carol Willing

Carol Willing(Core Contributor to CPython, Jupyter), and

3 more.

llm-security by greshake

Research paper on indirect prompt injection attacks targeting app-integrated LLMs

Created 2 years ago

Updated 5 months ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"),

Elie Bursztein

Elie Bursztein(Cybersecurity Lead at Google DeepMind), and

3 more.

llm-guard by protectai

Security toolkit for LLM interactions

Created 2 years ago

Updated 3 weeks ago

Feedback? Help us improve.