Discover and explore top open-source AI tools and projects—updated daily.
zhipeixuExplainable image forgery detection and localization using MLLMs
Top 82.2% on SourcePulse
FakeShield is a novel framework for explainable image forgery detection and localization (e-IFDL), targeting researchers and practitioners in digital forensics and AI security. It leverages multi-modal large language models (MLLMs) to not only identify manipulated regions but also provide human-understandable explanations for the detected forgeries, addressing the opacity of traditional methods.
How It Works
FakeShield integrates a Domain Tag-guided Explainable Forgery Detection Module (DTE-FDM) and a Multimodal Forgery Localization Module (MFLM). The DTE-FDM analyzes pixel-level artifacts and semantic inconsistencies, guided by domain tags to recognize various manipulation techniques. The MFLM then localizes these manipulations and generates textual explanations, enhancing interpretability. This multi-modal approach aims for improved generalization and robustness across diverse forgery types.
Quick Start & Requirements
zhipeixu/dte-fdm and zhipeixu/mflm.zhipeixu/fakeshield-v1-22b) and SAM pre-trained weights.scripts/cli_demo.sh) is provided.Highlighted Details
Maintenance & Community
The project is associated with Peking University. Links to arXiv, Hugging Face checkpoints and datasets, and project pages are provided. Related projects like AvatarShield and EditGuard are also highlighted.
Licensing & Compatibility
The project is licensed under Apache 2.0, permitting commercial use and closed-source linking.
Limitations & Caveats
The README emphasizes using Docker for environment setup to reproduce paper results, suggesting potential complexities with direct pip installation. Specific versions of PyTorch and CUDA are required.
3 months ago
Inactive
lucidrains