KG construction pipeline using LLMs
Top 96.2% on sourcepulse
This repository provides a pipeline for the semi-automatic construction of Ontologies and Knowledge Graphs (KGs) using Large Language Models (LLMs). It targets researchers and practitioners in knowledge representation and AI, offering a reproducible framework to generate KGs from scholarly publications with minimal human expert intervention.
How It Works
The approach leverages LLMs (Mixtral 8x22B, GPT-4o, GPT-3.5, Gemini) to automate KG creation. The pipeline begins with formulating competency questions (CQs), then uses these CQs to guide the development of an ontology (TBox). This ontology is subsequently employed to construct the KG from source documents, with evaluation metrics provided for assessing the output.
Quick Start & Requirements
pip install -r requirements.txt
helper_functions.py
.main.py
, configuring with config.ini
.Highlighted Details
Maintenance & Community
The project is associated with research publications, with citations provided for different versions. No specific community channels (e.g., Discord, Slack) are mentioned in the README.
Licensing & Compatibility
Licensed under Apache License 2.0. This license is permissive and generally compatible with commercial use and closed-source linking.
Limitations & Caveats
The README indicates the code was tested on Python 3.10.16, suggesting potential compatibility issues with other Python versions. The "semi-automatic" nature implies some level of human input or oversight may still be required.
3 months ago
1 week