easydoc  by easydoc-ai

Multimodal document processing API for LLM pipelines

Created 8 months ago
1,114 stars

Top 34.4% on SourcePulse

GitHubView on GitHub
Project Summary

EasyDoc provides a multimodal document processing API designed to convert unstructured documents into structured, hierarchical JSON. This platform is targeted at developers building AI and LLM applications, offering enriched data for model inference, fine-tuning, and optimized context windows by identifying content blocks and extracting semantic information.

How It Works

EasyDoc employs AI to parse documents, moving beyond simple text extraction to identify meaningful content blocks and reconstruct document hierarchies. It supports multimodal content, converting tables, figures, and visual data into machine-readable JSON, enabling LLMs to process complex information more effectively.

Quick Start & Requirements

Highlighted Details

  • Content block identification for meaningful data grouping.
  • Semantic extraction to reconstruct document hierarchies.
  • Multimodal parsing of tables, figures, and visual data into JSON.
  • Offers "Lite" and "Pro" modes, with a "Premium" beta.

Maintenance & Community

Licensing & Compatibility

  • Licensing: Not specified in the README.
  • Compatibility: REST API is language-agnostic.

Limitations & Caveats

The "Premium" tier is in beta with a waitlist for API keys. Pricing is per 1000 pages, with different rates for Lite and Pro modes.

Health Check
Last Commit

2 months ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
9 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Paul Copplestone Paul Copplestone(Cofounder of Supabase), and
4 more.

MegaParse by QuivrHQ

0.1%
7k
File parser optimized for LLM ingestion
Created 1 year ago
Updated 6 months ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Jerry Liu Jerry Liu(Cofounder of LlamaIndex), and
1 more.

sparrow by katanaml

0.1%
5k
Data processing & instruction calling tool using ML, LLM, and Vision LLM
Created 3 years ago
Updated 1 day ago
Feedback? Help us improve.