Python package for incremental knowledge graph construction using LLMs
Top 45.7% on sourcepulse
iText2KG is a Python package for incrementally constructing knowledge graphs from text documents using large language models. It targets researchers and developers needing to extract and structure information, offering zero-shot entity and relation extraction, entity disambiguation, and integration with Neo4j for visualization. The primary benefit is automated, consistent knowledge graph creation from diverse text sources.
How It Works
iText2KG employs a modular architecture: a Document Distiller reformulates text into semantic blocks based on user-defined schemas, improving signal-to-noise. An Incremental Entity Extractor identifies and resolves unique entities using cosine similarity for disambiguation. An Incremental Relation Extractor identifies relationships between entities. Finally, a Graph Integrator populates a Neo4j database, enabling visualization. This approach leverages LLMs for extraction and LangChain for model compatibility, with recent updates focusing on mitigating LLM hallucinations and enhancing entity embedding with configurable weights for name and label.
Quick Start & Requirements
pip install itext2kg
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
3 days ago
1 day