Discover and explore top open-source AI tools and projects—updated daily.
Framework for evaluating LLMs on cybersecurity CTF challenges
Top 98.9% on SourcePulse
CTFKnow is a research framework for measuring and enhancing Large Language Models' (LLMs) capabilities in solving cybersecurity Capture-the-Flag (CTF) challenges. It automates data collection, knowledge extraction, question generation, and model evaluation, providing a comprehensive pipeline for cybersecurity AI research and education.
How It Works
The framework operates through a multi-stage pipeline. It begins by scraping CTF write-ups from CTFtime.org, prioritizing high-quality content. Next, it employs LLMs to extract universal cybersecurity knowledge and practical exploitation examples from these write-ups. This extracted knowledge is then used to automatically generate multiple-choice and open-ended questions, tailored to the difficulty of the original challenges. Finally, it evaluates LLM performance on these generated questions using various metrics.
Quick Start & Requirements
pip install -r requirements.txt
.export OPENAI_API_KEY="your-openai-api-key"
).Highlighted Details
Maintenance & Community
The project is associated with a research paper and welcomes contributions via pull requests. Development setup instructions are provided, including pre-commit hooks.
Licensing & Compatibility
This project is licensed under the MIT License, permitting commercial use and integration with closed-source projects.
Limitations & Caveats
The framework relies heavily on LLM APIs, which can incur costs and are subject to the performance and availability of those services. The quality of extracted knowledge and generated questions is dependent on the LLM used and the quality of the scraped write-ups.
2 months ago
Inactive