SPO triples extraction for knowledge graphs
Top 92.3% on sourcepulse
This repository provides an end-to-end joint model for schema-based information extraction, specifically addressing the task of extracting Subject-Predicate-Object (SPO) triples from Chinese text under given schema constraints. It is designed for researchers and practitioners in Natural Language Processing (NLP) and Artificial Intelligence (AI) aiming to advance Chinese information extraction capabilities.
How It Works
The project utilizes a BERT-based approach for joint entity and relation extraction. This end-to-end model processes sentences and schema constraints to directly output SPO triples that conform to the specified schema types for subjects and objects. This integrated approach aims for greater efficiency and accuracy compared to pipeline methods that handle entity recognition and relation extraction separately.
Quick Start & Requirements
pip install -r requirements.txt
Highlighted Details
Maintenance & Community
The project appears to be associated with the CCF LIC 2019 competition. Further community or maintenance activity is not explicitly detailed in the README.
Licensing & Compatibility
The repository's license is not specified in the provided README.
Limitations & Caveats
The README focuses on the competition task and dataset, with no explicit mention of model performance benchmarks, limitations, or potential issues with the implementation. The primary focus is on Chinese text.
6 years ago
1 week