rowfill  by harishdeivanayagam

Open-source platform for unstructured data processing

Created 8 months ago
363 stars

Top 77.3% on SourcePulse

GitHubView on GitHub
Project Summary

Rowfill is an open-source platform designed to help knowledge workers extract, analyze, and process data from unstructured documents like PDFs and images using AI. It offers advanced OCR, auto-schema generation, and custom workflow capabilities, with a strong emphasis on privacy through local LLM support and secure data handling.

How It Works

Rowfill leverages advanced OCR and AI models to extract text, tables, and handwriting from various document formats. It automatically infers document structures to generate schemas, enabling custom workflow creation for tailored data processing tasks. The platform supports local LLMs (e.g., Llama, Mistral) and OpenAI vision models, prioritizing data privacy.

Quick Start & Requirements

  • Install via Docker Compose.
  • Configure environment variables using the provided mockenv file.
  • Requires Docker.

Highlighted Details

  • Advanced OCR for text, tables, and handwriting extraction.
  • Auto-schema generation for document structure adaptation.
  • Support for local LLMs (Llama, Mistral) and OpenAI vision models.
  • Local data processing for enhanced privacy.

Maintenance & Community

  • Active development, noted as a work in progress.
  • Community support via Discord and GitHub Issues.
  • Email support available.
  • A cloud version (Alpha) is live; contact for access.

Licensing & Compatibility

  • Licensed under AGPLv3.
  • AGPLv3 is a strong copyleft license, requiring derivative works to also be open-sourced under AGPLv3. This may restrict commercial use or integration into closed-source products without careful consideration.

Limitations & Caveats

The project is explicitly stated as a work in progress and not yet ready for production use.

Health Check
Last Commit

6 months ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
7 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Jerry Liu Jerry Liu(Cofounder of LlamaIndex), and
1 more.

sparrow by katanaml

0.1%
5k
Data processing & instruction calling tool using ML, LLM, and Vision LLM
Created 3 years ago
Updated 1 day ago
Feedback? Help us improve.