On-premises tool for document information extraction and benchmarking
Top 27.2% on sourcepulse
This toolkit provides an on-premises solution for unstructured document information extraction and benchmarking, targeting developers and researchers in Intelligent Document Processing (IDP). It leverages Vision-Language Models (VLMs) to extract structured data, including tabular information, from documents like invoices and passports, offering an OCR-free approach.
How It Works
Docext utilizes VLMs to interpret document content and structure, enabling the extraction of both key fields and complex tables without relying on traditional OCR. This vision-language approach allows for more nuanced understanding of document layouts and content, facilitating accurate data extraction and providing confidence scores for predictions.
Quick Start & Requirements
pip install docext
Highlighted Details
Maintenance & Community
Developed by Nanonets, a company specializing in document AI. Contributions are welcomed via issues and pull requests.
Licensing & Compatibility
Licensed under the Apache License 2.0, permitting commercial use and integration with closed-source applications.
Limitations & Caveats
The toolkit is primarily focused on VLM-based extraction and does not include traditional OCR capabilities. Performance is dependent on the underlying VLM used and the complexity of the document structure.
1 month ago
Inactive