FastAPI web server for LLM-powered data extraction
Top 34.3% on sourcepulse
This project provides a FastAPI web server for extracting structured information from text and files using Large Language Models (LLMs). It's designed as a reference implementation and starting point for developers building custom data extraction applications, offering a REST API, JSON schema definition for extraction targets, and support for few-shot examples to improve accuracy.
How It Works
The server leverages LangChain for LLM orchestration and FastAPI for its web framework. Extraction logic is defined via JSON schemas, allowing users to specify the desired output structure. The system supports incorporating few-shot examples, provided via a separate API endpoint, to guide the LLM and enhance the quality of extracted results. It stores extractors and examples in a PostgreSQL database.
Quick Start & Requirements
docker compose build
and docker compose up
..local.env
.Highlighted Details
Maintenance & Community
This project is under active development by LangChain AI. While pull requests are not currently accepted, feedback via issues and discussions is encouraged.
Licensing & Compatibility
The repository does not explicitly state a license in the provided README. This requires further investigation for commercial use or closed-source linking.
Limitations & Caveats
The project is under active development, with breaking changes expected between releases. The main
branch should not be used directly; checkout releases instead. User authentication is not implemented, with access controlled by a user ID generated via uuidgen
.
1 month ago
1 day