Discover and explore top open-source AI tools and projects—updated daily.
Generalist model for document image restoration
Top 61.9% on SourcePulse
DocRes is a generalist model designed to unify various document image restoration tasks, including dewarping, deshadowing, appearance correction, deblurring, and binarization. It aims to provide a single, versatile solution for improving the quality of scanned or photographed documents, benefiting researchers and practitioners in document analysis and computer vision.
How It Works
DocRes employs a unified architecture that can handle multiple restoration tasks. The model leverages a combination of techniques, likely including diffusion models or similar generative approaches, to reconstruct degraded document images. This unified approach allows for efficient training and inference across different restoration needs, avoiding the need for task-specific models.
Quick Start & Requirements
mbd.pkl
, docres.pkl
) in ./data/MBD/checkpoint/
and ./checkpoints/
respectively. Run python inference.py --im_path <path_to_image> --task <dewarping|deshadowing|appearance|deblurring|binarization|end2end>
.Highlighted Details
realdae
, tdd
, dibco18
, etc.Maintenance & Community
The project is associated with authors from CVPR 2024 and includes recent updates evaluating generative models like GPT-4o for document processing tasks. Links to arXiv papers and IJCV 2025 work (LGGPT) are provided, indicating active research in related areas.
Licensing & Compatibility
The repository does not explicitly state a license. Users should verify licensing terms for commercial use or integration into closed-source projects.
Limitations & Caveats
The README focuses on inference and evaluation setup, with detailed dataset preparation and training instructions requiring further exploration within the repository. Specific hardware requirements (e.g., GPU) are not explicitly mentioned but are implied for efficient operation.
1 month ago
Inactive