API engine for LLM-powered data extraction
Top 48.6% on sourcepulse
This project provides a framework for building and deploying AI-powered APIs that extract structured data from websites using natural language prompts. It targets developers and power users who need to quickly create custom data extraction endpoints without extensive coding, leveraging LLMs for schema generation and web scraping for data retrieval.
How It Works
The LLM API Engine utilizes a decoupled architecture. A Next.js frontend serves as the API builder interface, allowing users to describe data extraction needs in natural language. OpenAI's models automatically generate JSON schemas, and Firecrawl handles intelligent web scraping. Configurations are stored in Redis, enabling flexible deployment of consumable API endpoints on platforms like Cloudflare Workers, Vercel Edge Functions, or AWS Lambda.
Quick Start & Requirements
npm install
npm run dev
http://localhost:3000
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
CRON functionality for scheduled scraping is listed as "In Progress" and not yet implemented. Detailed implementation guides for various CRON options are also pending.
6 months ago
Inactive