L1B3RT4S by elder-plinius

AI jailbreak prompts

Created 1 year ago

16,521 stars

Top 2.9% on SourcePulse

View on GitHub

6 Experts Love This Project

Georgios Konstantopoulos

CTO, General Partner at Paradigm

Elie Bursztein

Cybersecurity Lead at Google DeepMind

Chip Huyen

Author of "AI Engineering", "Designing Machine Learning Systems"

Vincent Weisser

Cofounder of Prime Intellect

and 2 more!

Project Summary

This repository provides "liberation prompts" designed to bypass safety filters and unlock unrestricted responses from large language models. It aims to enable users to explore AI capabilities beyond their intended guardrails, targeting researchers and power users interested in AI alignment and censorship circumvention.

How It Works

The project offers a collection of prompts that leverage creative phrasing, role-playing scenarios, and specific formatting to trick LLMs into ignoring their safety protocols. These prompts are designed to exploit potential weaknesses in the models' instruction-following mechanisms, allowing for the generation of content that would typically be blocked.

Quick Start & Requirements

Installation: No specific installation is mentioned; prompts are likely intended for direct use with existing LLM interfaces.
Requirements: Access to a compatible large language model (e.g., flagship AI models mentioned).
Resources: Requires an LLM API or interface.

Highlighted Details

"Totally harmless liberation prompts for good lil AI's!"
"New paradigm" for AI interaction.
Promises to "liberate" all flagship AI models.
Includes a Discord server for community engagement.

Maintenance & Community

Maintained by "Pliny the Prompter/Liberator".
Community hub via a Discord server.

Licensing & Compatibility

No license is explicitly stated in the README.
Compatibility is claimed for "all flagship AI models."

Limitations & Caveats

The effectiveness of these prompts may vary significantly across different LLM architectures and versions. The "harmless" claim is subjective and depends on the nature of the unrestricted output generated. The project's long-term viability is tied to the ongoing efforts by LLM developers to patch vulnerabilities exploited by these prompts.

Health Check

Last Commit

2 weeks ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

610 stars in the last 30 days