AI red teaming training playground with challenges
Top 28.5% on sourcepulse
This repository provides a set of AI Red Teaming challenges and infrastructure, based on Chat Copilot, designed for security professionals to systematically test AI systems. It offers a practical learning environment for identifying novel adversarial machine learning and Responsible AI failures beyond traditional security vulnerabilities.
How It Works
The playground environment leverages a modified Chat Copilot instance for each challenge, presenting users with specific objectives. These challenges simulate real-world scenarios, requiring users to employ techniques like prompt injection, metaprompt extraction, and guardrail bypasses to achieve defined goals, such as credential exfiltration or generating restricted content. The architecture is containerized using Docker Compose for ease of deployment.
Quick Start & Requirements
docker-compose up
.env
file.Highlighted Details
challenges/challenges.json
and regenerating the Docker Compose configuration.Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The repository's challenges are no longer updated for future courses, meaning they may not reflect the latest AI vulnerabilities or defenses. While CTFd and chat-score components are included for reference, they are not utilized in the default Docker Compose setup, limiting built-in progress tracking and scoring for self-hosted instances.
2 weeks ago
Inactive