Schema-based-Knowledge-Extraction  by yuanxiaosc

SPO triples extraction for knowledge graphs

created 6 years ago
287 stars

Top 92.3% on sourcepulse

GitHubView on GitHub
Project Summary

This repository provides an end-to-end joint model for schema-based information extraction, specifically addressing the task of extracting Subject-Predicate-Object (SPO) triples from Chinese text under given schema constraints. It is designed for researchers and practitioners in Natural Language Processing (NLP) and Artificial Intelligence (AI) aiming to advance Chinese information extraction capabilities.

How It Works

The project utilizes a BERT-based approach for joint entity and relation extraction. This end-to-end model processes sentences and schema constraints to directly output SPO triples that conform to the specified schema types for subjects and objects. This integrated approach aims for greater efficiency and accuracy compared to pipeline methods that handle entity recognition and relation extraction separately.

Quick Start & Requirements

Highlighted Details

  • Implements an end-to-end joint model for schema-based SPO extraction.
  • Leverages BERT for enhanced contextual understanding.
  • Addresses a large-scale Chinese information extraction dataset (SKE).
  • Focuses on a specific task defined by schema constraints for SPO triples.

Maintenance & Community

The project appears to be associated with the CCF LIC 2019 competition. Further community or maintenance activity is not explicitly detailed in the README.

Licensing & Compatibility

The repository's license is not specified in the provided README.

Limitations & Caveats

The README focuses on the competition task and dataset, with no explicit mention of model performance benchmarks, limitations, or potential issues with the implementation. The primary focus is on Chinese text.

Health Check
Last commit

6 years ago

Responsiveness

1 week

Pull Requests (30d)
0
Issues (30d)
0
Star History
1 stars in the last 90 days

Explore Similar Projects

Feedback? Help us improve.