agentic_security  by msoedov

Open-source vulnerability scanner for LLMs and agent workflows

Created 1 year ago
1,674 stars

Top 25.3% on SourcePulse

GitHubView on GitHub
Project Summary

Agentic Security is an open-source toolkit for AI red teaming and vulnerability scanning, designed to protect Large Language Models (LLMs) from various attacks. It targets developers, researchers, and security teams seeking to proactively identify and mitigate risks in AI systems, offering robust defenses against jailbreaks, fuzzing, and multimodal threats.

How It Works

The tool employs an agentic approach, simulating sophisticated attack sequences and stress-testing LLMs with diverse inputs. It supports multimodal attacks (text, image, audio), multi-step jailbreaks, comprehensive fuzzing, API integration, and RL-based adaptive attacks. This methodology allows for proactive identification and mitigation of AI system vulnerabilities.

Quick Start & Requirements

  • Install via pip: pip install agentic_security
  • Run using the CLI: agentic_security or python -m agentic_security
  • Supports custom datasets via CSV files.
  • For detailed usage and advanced features, refer to the official documentation.

Highlighted Details

  • Supports multimodal attacks across text, image, and audio inputs.
  • Enables multi-step jailbreak simulations and comprehensive fuzzing.
  • Integrates with LLM APIs for stress testing and features RL-based attack capabilities.
  • Offers a CLI for initialization, listing datasets, and running CI checks.

Maintenance & Community

The project is actively developed, with a roadmap including RL-powered attacks, massive dataset expansion, daily attack updates, and community modules. Contributions are welcome via pull requests on GitHub.

Licensing & Compatibility

Released under the Apache License v2.0, allowing for commercial use and integration with closed-source projects.

Limitations & Caveats

The project is described as "just getting started," with some features like RL-powered attacks and extensive dataset expansion still under development. Custom integration examples for multimodal probes are provided, but specific requirements for these may vary.

Health Check
Last Commit

3 days ago

Responsiveness

Inactive

Pull Requests (30d)
8
Issues (30d)
1
Star History
55 stars in the last 30 days

Explore Similar Projects

Feedback? Help us improve.