Awesome-Jailbreak-on-LLMs by yueliu1999

Collection of jailbreak methods on LLMs

Created 1 year ago

1,153 stars

Top 33.4% on SourcePulse

Project Summary

This repository serves as a comprehensive, curated collection of state-of-the-art research on jailbreaking Large Language Models (LLMs). It targets researchers, security engineers, and practitioners interested in understanding and mitigating vulnerabilities in LLMs, providing a valuable resource for advancing LLM safety and security.

How It Works

The collection categorizes jailbreak methods into attack types (e.g., black-box, white-box, multi-turn, multimodal) and defense strategies (learning-based, strategy-based, guard models). It compiles relevant papers, code repositories, datasets, and evaluation methodologies, offering a structured overview of the evolving landscape of LLM security research.

Quick Start & Requirements

This repository is a curated list of research papers and code. There is no direct installation or execution command. Users are directed to individual paper links and associated code repositories for specific implementations.

Highlighted Details

Extensive coverage of recent (2023-2025) jailbreak techniques across various attack vectors.
Includes dedicated sections for defense strategies, guard models, and evaluation benchmarks.
Provides direct links to papers and code for most listed methods.
Features a categorized list of related "Awesome" repositories for further exploration.

Maintenance & Community

The repository is maintained by yueliu1999, with contributions welcomed via PRs and issues. Contact is available via email for specific inquiries. The project encourages citation of its featured papers.

Licensing & Compatibility

The repository itself is a collection of links and does not impose a specific license. Individual linked papers and code repositories will have their own respective licenses, which users must adhere to.

Limitations & Caveats

This is a curated list and does not provide a unified framework or tool for performing jailbreaks or defenses. Users must navigate to individual resources for implementation details and potential dependencies. The rapid pace of LLM research means the content may require frequent updates to remain fully comprehensive.

Awesome-Jailbreak-on-LLMs by yueliu1999

Explore Similar Projects

Awesome-Multimodal-Jailbreak by liuxuannan

llm-adaptive-attacks by tml-epfl

llm-sp by chawins

Prompt-Hacking-Resources by PromptLabs

GA by General-Analysis

AutoDAN-Turbo by SaFo-Lab

jailbreakbench by JailbreakBench

EasyJailbreak by EasyJailbreak

bon-jailbreaking by jplhughes

JailbreakingLLMs by patrickrchao

awesome-llm-security by corca-ai

PurpleLlama by meta-llama