itext2kg by AuvaLab

Python package for incremental knowledge graph construction using LLMs

Created 1 year ago

901 stars

Top 40.2% on SourcePulse

Project Summary

iText2KG is a Python package for incrementally constructing knowledge graphs from text documents using large language models. It targets researchers and developers needing to extract and structure information, offering zero-shot entity and relation extraction, entity disambiguation, and integration with Neo4j for visualization. The primary benefit is automated, consistent knowledge graph creation from diverse text sources.

How It Works

iText2KG employs a modular architecture: a Document Distiller reformulates text into semantic blocks based on user-defined schemas, improving signal-to-noise. An Incremental Entity Extractor identifies and resolves unique entities using cosine similarity for disambiguation. An Incremental Relation Extractor identifies relationships between entities. Finally, a Graph Integrator populates a Neo4j database, enabling visualization. This approach leverages LLMs for extraction and LangChain for model compatibility, with recent updates focusing on mitigating LLM hallucinations and enhancing entity embedding with configurable weights for name and label.

Quick Start & Requirements

Install via pip: pip install itext2kg
Requires Python 3.9+.
Compatible with all LangChain-supported chat and embedding models (e.g., Mistral, OpenAI).
Requires Neo4j for graph visualization.
Example usage with Mistral or OpenAI models is provided.

Highlighted Details

Zero-shot entity and relation extraction across domains.
Incremental KG construction and updates.
Entity disambiguation using embedded names and labels (configurable weights).
Mitigation strategies for LLM hallucination (entity replacement, re-prompting).
Integration with Neo4j for visualization.

Maintenance & Community

Accepted at WISE 2024.
Open to community contributions.
Citation provided for the associated arXiv preprint.

Licensing & Compatibility

No explicit license mentioned in the README.
Compatible with commercial LLM APIs (OpenAI, Mistral) and open-source models via LangChain.

Limitations & Caveats

The README does not specify a license, which may impact commercial use.
While designed to mitigate hallucinations, LLM-generated content inherently carries a risk of inaccuracies.
Performance and accuracy are dependent on the chosen LLM and embedding models.

Health Check

Last Commit

2 months ago

Responsiveness

1 day

Pull Requests (30d)

0

Issues (30d)

1

Star History

12 stars in the last 30 days

Explore Similar Projects

Starred by

Ishaan Jaffer

Ishaan Jaffer(Cofounder of LiteLLM).

llmgraph by dylanhogg

CLI tool to create knowledge graphs from LLM-extracted world knowledge

Created 2 years ago

Updated 3 months ago

Docs2KG by AI4WA

CLI tool for knowledge graph construction from documents

Created 1 year ago

Updated 7 months ago

graph_maker by rahulnyk

Python library for knowledge graph creation from text

Created 1 year ago

Updated 1 year ago

Starred by

Jeff Hammerbacher

Jeff Hammerbacher(Cofounder of Cloudera) and

Vaibhav Nivargi

Vaibhav Nivargi(Cofounder of Moveworks).

USC-DS-RelationExtraction by INK-USC

Relation extraction system using distant supervision

Created 9 years ago

Updated 5 years ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems").

kg-gen by stair-lab

Knowledge graph generator for text analysis and RAG

Created 1 year ago

Updated 2 days ago

knowledge-graph-llms by thu-vu92

Extract knowledge graphs from text using LLMs

Created 7 months ago

Updated 7 months ago

ai-knowledge-graph by robert-mcdermott

Knowledge graph generator from unstructured text

Created 9 months ago

Updated 2 weeks ago

knowledge_graph by rahulnyk

Knowledge graph pipeline for text corpus analysis

Created 2 years ago

Updated 8 months ago

Starred by

Boris Cherny

Boris Cherny(Creator of Claude Code; MTS at Anthropic),

Eiso Kant

Eiso Kant(Cofounder of Poolside AI), and

1 more.

GraphGPT by varunshenoy

Tool for knowledge graph extraction from text using GPT-3

Created 2 years ago

Updated 1 year ago

Yuxi-Know by xerrors

RAG knowledge base and knowledge graph QA system

Created 1 year ago

Updated 2 days ago

DeepKE by zjunlp

Deep learning toolkit for knowledge graph extraction/construction

Created 7 years ago

Updated 5 months ago

KnowledgeGraphCourse by npubird

Knowledge graph course for graduate students

Created 6 years ago

Updated 8 months ago

Feedback? Help us improve.