aixcc-afc-archive by theori-io

AI cyber reasoning system for automated challenge participation

Created 1 year ago

272 stars

Top 94.6% on SourcePulse

Project Summary

Summary

This repository archives Theori's Cyber Reasoning System (CRS) submission for the DARPA AI Cyber Challenge (AIxCC). It offers a snapshot of the final release code, enabling technical users to study advanced AI security agent architectures. While unsupported, it provides insights for AI security, agent development, or potential commercial applications via direct contact with Theori.

How It Works

The system, codenamed "Robo Duck," is a Cyber Reasoning System (CRS) heavily leveraging Large Language Models (LLMs) for its decision-making. Designed for the AI Cyber Challenge environment, its architecture likely involves agents processing tasks and interacting with tools. Configuration is managed via environment variables or token files for LLM providers (Anthropic, OpenAI, Google, Azure), and the system is deployable via Docker.

Quick Start & Requirements

Primary run command: docker compose --profile main up --exit-code-from crs-main after docker pull ghcr.io/theori-io/crs:latest.
Prerequisites: API keys for LLM providers, Docker, Docker Compose.
Resource Footprint: High LLM operational costs are warned, potentially exceeding $1,000 per hour without careful model selection.
Links: Official documentation and architecture diagrams are referenced but not directly linked.

Highlighted Details

Contains the complete code submitted for the final round of the DARPA AI Cyber Challenge (July-August 2025).
System is tuned for the AI Cyber Challenge environment, with significant LLM budget implications.
Includes Azure deployment scripts via Terraform.
Features an "Agent Log Viewer" for introspection of agent behavior, conversations, and tool calls, including serialized agent states for debugging.
Evaluation dashboard scripts are provided for visualizing performance data.

Maintenance & Community

This repository is provided for archival and historical purposes only and will not be supported or updated.
Users interested in commercial applications should contact Theori directly.
Slack is mentioned for accessing the evaluation dashboard.

Licensing & Compatibility

The specific open-source license is not stated in the provided README content.
Commercial use is possible via direct engagement with Theori.

Limitations & Caveats

Code may contain bugs, be outdated, or rely on permissioned data.
The repository will not receive support or updates.
Significant operational costs associated with LLM usage are a major caveat.
The absence of a specified license poses an adoption blocker.

Health Check

Last Commit

11 months ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

3 stars in the last 30 days

Explore Similar Projects

Symbio by 854875058

AI infrastructure for controllable, observable, and evolvable multi-agent systems

Created 1 month ago

Updated 2 days ago

JoinAI-Agent by opencmit

AI agent engine for complex task automation

Created 5 months ago

Updated 5 months ago

station by cloudshipai

Runtime for building and deploying AI sub-agents

Created 11 months ago

Updated 5 months ago

BoxPwnr by 0ca

Agentic LLM security challenge solver

Created 1 year ago

Updated 2 weeks ago

Starred by

Dan Guido

Dan Guido(Cofounder of Trail of Bits).

JoySafeter by jd-opensource

Enterprise AI platform for autonomous security agent teams

Created 5 months ago

Updated 1 day ago

CoWork-OS by CoWork-OS

Local-first OS for AI agents with multi-channel, multi-provider support

Created 5 months ago

Updated 19 hours ago

evonic by anvie

Agentic AI platform for designing, deploying, and orchestrating intelligent agents

Created 2 months ago

Updated 20 hours ago

ai-platform-engineering by cnoe-io

AI Platform Engineering automation via multi-agent systems

Created 1 year ago

Updated 1 day ago

Tsec-Hackathon by Yeti-791

AI agents for intelligent cybersecurity penetration testing

Created 4 months ago

Updated 1 day ago

Starred by

Elie Bursztein

Elie Bursztein(Cybersecurity Lead at Google DeepMind) and

Deshraj Yadav

Deshraj Yadav(Cofounder of Mem0).

Cyber-AutoAgent by westonbrown

AI agent for autonomous cyber operations

Created 1 year ago

Updated 7 months ago

codefuse-chatbot by codefuse-ai

AI assistant for software development lifecycle, powered by multi-agent framework

Created 2 years ago

Updated 2 years ago

Decepticon by PurpleAILAB

Autonomous AI agents for offensive cybersecurity testing

Created 1 year ago

Updated 1 day ago

Feedback? Help us improve.