Python package for LLM-based structured information extraction
Top 49.4% on sourcepulse
OntoGPT is a Python package designed for extracting structured information from text using Large Language Models (LLMs) and ontology-based grounding. It targets researchers and developers working with knowledge bases, aiming to streamline the process of populating them with data extracted from unstructured text.
How It Works
OntoGPT leverages LLMs through the litellm
package, supporting a wide array of APIs including OpenAI, Azure, Anthropic, and Mistral. Its core methodology, SPIRES (Structured Prompt Interrogation and Recursive Extraction of Semantics), focuses on grounding LLM outputs with ontologies. This approach allows for more accurate and structured information extraction, enabling the population of knowledge bases with semantically rich data.
Quick Start & Requirements
pip install ontogpt
.runoak set-apikey -e openai <your openai api key>
.litellm
, requiring API key configuration for each.pip install ontogpt[web]
.ollama pull llama3
).Highlighted Details
litellm
, including local models via Ollama.Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The web application is described as "bare bones" and not recommended for public hosting without authentication. The licensing status is unclear, which may impact commercial adoption.
5 days ago
1 day