llmparser  by kyang6

LLM tool for structured data extraction and classification

created 2 years ago
425 stars

Top 70.5% on sourcepulse

GitHubView on GitHub
Project Summary

LLMParser is a tool for classifying text and extracting structured data using Large Language Models (LLMs), specifically addressing the challenge of reliably generating JSON output from LLMs. It is designed for developers and researchers who need to process unstructured text into a predictable format, offering a flexible solution for tasks like resume parsing, contract analysis, and sentiment classification.

How It Works

LLMParser enforces a consistent JSON input and output format for LLM interactions. Users define categories and fields with descriptions, which are then used to prompt the LLM. The library handles the LLM API calls and parses the response, aiming to ensure structured and reliable JSON output, even for complex extraction tasks.

Quick Start & Requirements

Highlighted Details

  • Enforces consistent JSON input/output for LLMs.
  • Supports classification and extraction of multiple fields.
  • Provides confidence scores for extracted data.
  • Example output demonstrates detailed field extraction with source attribution.

Maintenance & Community

  • Primarily maintained by kyang6.
  • No explicit community channels or roadmap links provided in the README.

Licensing & Compatibility

  • License: Not specified in the README.
  • Compatibility: Designed for server-side Node.js environments; client-side usage is discouraged due to API key exposure.

Limitations & Caveats

The library relies on external LLM providers (e.g., OpenAI) and requires an API key, incurring associated costs. The effectiveness and reliability of the extraction are dependent on the underlying LLM's capabilities and the quality of the provided category/field descriptions. The license is not specified, which may impact commercial use.

Health Check
Last commit

2 years ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
0 stars in the last 90 days

Explore Similar Projects

Starred by John Resig John Resig(Author of jQuery; Chief Software Architect at Khan Academy), Travis Fischer Travis Fischer(Founder of Agentic), and
1 more.

instructor-js by 567-labs

0%
738
Typescript tool for structured extraction from LLMs
created 1 year ago
updated 6 months ago
Feedback? Help us improve.