Discover and explore top open-source AI tools and projects—updated daily.
elder-pliniusAdvanced toolkit for liberating LLMs from refusal behaviors
New!
Top 22.0% on SourcePulse
OBLITERATUS is an advanced open-source toolkit for understanding and removing refusal behaviors from large language models (LLMs) without retraining. It targets researchers and engineers seeking to liberate models from artificial gatekeeping while preserving core capabilities. The project also serves as a distributed research experiment, crowdsourcing data to advance LLM interpretability.
How It Works
The project implements "abliteration," a technique that identifies and surgically removes internal representations responsible for content refusal. This involves probing model states, extracting refusal directions via methods like SVD, and intervening at inference time. This approach enables precise model liberation without retraining, preserving general language and reasoning abilities, and contributes to a growing research dataset.
Quick Start & Requirements
pip install -e ".[spaces]" then obliteratus ui), Google Colab, CLI (pip install -e . then obliteratus obliterate ...), Python API.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The AGPL-3.0 license's network service clause may require a commercial license for certain deployments. Users opting out of telemetry will not contribute to the shared research dataset. The advanced analytical features may present a steep learning curve.
2 days ago
Inactive
openai
zjunlp