Discover and explore top open-source AI tools and projects—updated daily.
EleutherAIConcept erasure for neural representations
Top 99.8% on SourcePulse
Summary
EleutherAI/concept-erasure provides LEAst-squares Concept Erasure (LEACE), a method designed to remove specified concepts from neural representations. It targets machine learning practitioners aiming to enhance model fairness (e.g., mitigating bias related to protected attributes) and interpretability by observing changes in model behavior after concept removal. LEACE offers provable guarantees against linear classifiers while minimizing damage to the original representation, thereby preserving its utility for downstream tasks.
How It Works
LEACE employs a closed-form solution derived from least-squares principles, offering a mathematically rigorous approach to concept erasure. This method guarantees that no linear classifier can detect the targeted concept in the modified representation, providing strong assurances for fairness and interpretability tasks. The core advantage lies in its minimal impact on the representation's utility, ensuring downstream tasks are not unduly affected. The library facilitates this through LeaceFitter for incremental updates (O(d^2) memory) and LeaceEraser for a compact representation of the erasure function (O(dk) memory), balancing computational needs with storage efficiency for diverse use cases.
Quick Start & Requirements
pip install concept-erasureHighlighted Details
LeaceFitter.update() for streaming data scenarios.Maintenance & Community
No specific details on maintainers, community channels (e.g., Discord/Slack), or roadmap were found in the provided README.
Licensing & Compatibility
The repository's license is not specified in the README, which requires clarification for commercial or integration use.
Limitations & Caveats
The concept scrubbing implementation is described as "messy" and subject to refactoring. Provable guarantees are limited to linear classifiers. Tagged datasets for experiments are pending upload.
1 year ago
Inactive
seal-rg