AutoSchemaKG by HKUST-KnowComp

Framework for autonomous knowledge graph construction

Created 7 months ago

657 stars

Top 51.1% on SourcePulse

Project Summary

AutoSchemaKG is a framework for automated knowledge graph (KG) construction from unstructured text, designed for researchers and developers needing to build KGs without predefined schemas. It addresses the challenges of KG creation by combining LLM-based triple extraction with schema induction, enabling zero-shot inferencing and achieving state-of-the-art performance on benchmarks.

How It Works

AutoSchemaKG employs a two-stage approach: first, it extracts entities and events as triples from text using Large Language Models (LLMs). Second, it induces a schema through conceptualization, creating semantic links between disparate information. This method allows for autonomous KG construction and generalization across domains.

Quick Start & Requirements

Install via pip: pip install atlas-rag
For NV-embed-v2 support: pip install atlas-rag[nvembed]
Requires Python and potentially specific versions of transformers (>=4.42.4, <=4.47.1).
PDF processing requires a separate environment with marker-pdf and google-genai.
See example notebooks for detailed usage: atlas_billion_kg_usage.ipynb, atlas_full_pipeline.ipynb, atlas_multihopqa.ipynb.

Highlighted Details

Implements the ATLAS family of KGs (ATLAS-Wiki, ATLAS-Pes2o, ATLAS-CC) with over 900M nodes and 5.9B edges.
Supports Retrieval Augmented Generation (RAG) over constructed KGs.
Includes modules for KG quality, factual consistency, and general task performance evaluation.
Offers PDF-to-Markdown conversion for KG construction.

Maintenance & Community

Project is actively updated, with recent changes including batch generation and PDF support.
Contact information for key contributors is provided.

Licensing & Compatibility

The repository does not explicitly state a license in the provided README.

Limitations & Caveats

PDF processing requires setting up a separate Conda environment due to dependency versioning.
The framework relies heavily on LLMs, which may introduce costs and potential biases.

Health Check

Last Commit

6 days ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

4

Star History

30 stars in the last 30 days

Explore Similar Projects

Starred by

Jeff Hammerbacher

Jeff Hammerbacher(Cofounder of Cloudera).

zincbase by tomgrek

Deprecated knowledge base for graph-based reasoning

Created 6 years ago

Updated 4 years ago

automatic-KG-creation-with-LLM by fusion-jena

KG construction pipeline using LLMs

Created 1 year ago

Updated 8 months ago

Docs2KG by AI4WA

CLI tool for knowledge graph construction from documents

Created 1 year ago

Updated 7 months ago

ToG by DataArcTech

Research paper code for knowledge graph reasoning with LLMs

Created 2 years ago

Updated 1 year ago

KG-Pipeline by FareedKhan-dev

LLM-powered pipeline for text-to-knowledge graph conversion

Created 9 months ago

Updated 9 months ago

itext2kg by AuvaLab

Python package for incremental knowledge graph construction using LLMs

Created 1 year ago

Updated 2 months ago

goingmeta by jbarrasa

Code examples for knowledge graph and LLM integration

Created 4 years ago

Updated 2 days ago

Starred by

Ari Holtzman

Ari Holtzman(Coauthor of QLoRA; Professor at UChicago),

Julien Chaumond

Julien Chaumond(Cofounder of Hugging Face), and

1 more.

comet-commonsense by atcbosselut

Code for a commonsense knowledge graph construction research paper

Created 6 years ago

Updated 3 years ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems").

kg-gen by stair-lab

Knowledge graph generator for text analysis and RAG

Created 1 year ago

Updated 2 days ago

ai-knowledge-graph by robert-mcdermott

Knowledge graph generator from unstructured text

Created 9 months ago

Updated 2 weeks ago

openspg by OpenSPG

Knowledge graph engine for domain-specific applications

Created 2 years ago

Updated 6 months ago

Yuxi-Know by xerrors

RAG knowledge base and knowledge graph QA system

Created 1 year ago

Updated 2 days ago

Feedback? Help us improve.