Zero-shot machine-generated text detection via conditional probability curvature
Top 85.3% on sourcepulse
This repository provides the code for "Fast-DetectGPT," an efficient zero-shot method for detecting machine-generated text. It targets researchers and developers working on AI safety, content authenticity, and natural language processing, offering a significantly faster and more accurate alternative to existing methods like DetectGPT.
How It Works
Fast-DetectGPT leverages the concept of "conditional probability curvature" to distinguish between human-written and AI-generated text. Unlike DetectGPT, which relies on extensive sampling from a language model, Fast-DetectGPT uses a more efficient approach by analyzing the curvature of conditional probabilities. This method achieves a substantial speedup (up to 340x) while improving detection accuracy (AUROC up to 0.9887).
Quick Start & Requirements
setup.sh
.python scripts/local_infer.py
(default models: gpt-neo-2.7B).python scripts/local_infer.py --sampling_model_name gpt-j-6B
.Highlighted Details
Maintenance & Community
The project is associated with the ICLR 2024 paper. No specific community channels or active maintenance signals are provided in the README.
Licensing & Compatibility
The repository does not explicitly state a license. It mentions borrowing code from DetectGPT, whose license should be considered. Compatibility for commercial use or closed-source linking is not specified.
Limitations & Caveats
The project's experimental setup relies on specific hardware (Tesla A100 GPU with 80GB memory), and performance may vary on different configurations. The licensing status is unclear, potentially impacting commercial adoption.
4 months ago
1 day