No-code platform for structured document extraction via LLMs
Top 9.3% on sourcepulse
Unstract is a no-code platform designed for efficiently structuring unstructured documents using Large Language Models (LLMs). It empowers users to build and deploy APIs or ETL pipelines for data extraction, targeting developers and business users seeking to automate document processing workflows.
How It Works
Unstract utilizes a three-step "nirvana" process: users first engineer prompts in a dedicated "Prompt Studio" to extract desired fields from documents. This studio provides an integrated environment for testing prompts with various document samples, LLM outputs, and schema development tools. Subsequently, the configured Prompt Studio project can be deployed as a standalone API or integrated into an ETL pipeline with specified input and output sources. Finally, these workflows are deployed, enabling automated data structuring.
Quick Start & Requirements
./run-platform.sh
.http://frontend.unstract.localhost
with username unstract
and password unstract
.Highlighted Details
Maintenance & Community
CONTRIBUTING.md
.Licensing & Compatibility
Limitations & Caveats
ENCRYPTION_KEY
is critical; its loss or change will render existing adapters inaccessible.19 hours ago
1 day