docext  by NanoNets

On-premises tool for document information extraction and benchmarking

Created 5 months ago
1,731 stars

Top 24.7% on SourcePulse

GitHubView on GitHub
Project Summary

This toolkit provides an on-premises solution for unstructured document information extraction and benchmarking, targeting developers and researchers in Intelligent Document Processing (IDP). It leverages Vision-Language Models (VLMs) to extract structured data, including tabular information, from documents like invoices and passports, offering an OCR-free approach.

How It Works

Docext utilizes VLMs to interpret document content and structure, enabling the extraction of both key fields and complex tables without relying on traditional OCR. This vision-language approach allows for more nuanced understanding of document layouts and content, facilitating accurate data extraction and providing confidence scores for predictions.

Quick Start & Requirements

  • Install: pip install docext
  • Prerequisites: Linux or macOS. Specific VLM dependencies are managed by the toolkit.
  • Resources: Requires sufficient hardware to run VLMs locally.
  • Docs: Full feature guide

Highlighted Details

  • On-premises deployment for data privacy and control.
  • Supports custom field definitions and pre-built templates for invoices and passports.
  • Includes a REST API for integration and multi-page document processing.
  • Features an Intelligent Document Processing Leaderboard for evaluating VLM performance across various IDP tasks.

Maintenance & Community

Developed by Nanonets, a company specializing in document AI. Contributions are welcomed via issues and pull requests.

Licensing & Compatibility

Licensed under the Apache License 2.0, permitting commercial use and integration with closed-source applications.

Limitations & Caveats

The toolkit is primarily focused on VLM-based extraction and does not include traditional OCR capabilities. Performance is dependent on the underlying VLM used and the complexity of the document structure.

Health Check
Last Commit

3 weeks ago

Responsiveness

1 day

Pull Requests (30d)
3
Issues (30d)
5
Star History
98 stars in the last 30 days

Explore Similar Projects

Starred by John Resig John Resig(Author of jQuery; Chief Software Architect at Khan Academy), Sasha Rush Sasha Rush(Research Scientist at Cursor; Professor at Cornell Tech), and
2 more.

llmparser by kyang6

0%
426
LLM tool for structured data extraction and classification
Created 2 years ago
Updated 2 years ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Jerry Liu Jerry Liu(Cofounder of LlamaIndex), and
1 more.

sparrow by katanaml

0.1%
5k
Data processing & instruction calling tool using ML, LLM, and Vision LLM
Created 3 years ago
Updated 1 day ago
Feedback? Help us improve.