Discover and explore top open-source AI tools and projects—updated daily.
ucsb-mlsecSpecialized LLM for code vulnerability detection
Top 91.4% on SourcePulse
Summary VulnLLM-R addresses automated vulnerability detection in code by developing a specialized Large Language Model (LLM) with enhanced reasoning capabilities. It aims to significantly improve the accuracy and efficiency of identifying security flaws, targeting software engineers and security researchers.
How It Works This project fine-tunes LLMs for vulnerability detection, emphasizing "specialized reasoning." It constructs comprehensive datasets by merging and processing sources like PrimeVul, SecCodePLT, Juliet, Sven, and Arvo. A key innovation is generating and refining reasoning chains from other LLMs (e.g., DeepSeek-r1, QwQ) to deepen analytical understanding. The training employs Supervised Fine-Tuning (SFT) and Direct Preference Optimization (DPO).
Quick Start & Requirements
Clone the repository, create a Conda environment with Python 3.11, activate it, and install dependencies via pip install -e . -e ./vulscan/train/LLaMA-Factory -e ./vulscan/model_zoo. Reproducing results requires specific dataset paths and model configurations. The paper is at arXiv:2512.07533. Testing commands suggest potential GPU requirements (e.g., --tp 2, --vllm). A web demo is mentioned but lacks a URL.
Highlighted Details
Maintenance & Community The project is associated with authors Yuzhou Nie, Hongwei Li, Chengquan Guo, Ruizhe Jiang, Zhun Wang, Bo Li, Dawn Song, and Wenbo Guo (via arXiv paper). No specific community channels (Discord/Slack), active forums, or roadmap details are provided in this README.
Licensing & Compatibility The license type and compatibility notes for commercial use or closed-source linking are not specified in the README, posing an adoption blocker.
Limitations & Caveats The project's license is unspecified. The URL for the mentioned web demo is missing. Running tests with commercial models requires users to provide their own API keys. Setup involves multi-step dataset processing and environment configuration.
1 month ago
Inactive