Attribute extraction using BERT for knowledge graphs
Top 97.5% on sourcepulse
This repository provides a solution for attribute extraction in knowledge graphs, specifically targeting character entries from Baidu Encyclopedia. It offers two primary methods, fine-tuning and feature extraction, leveraging BERT models for improved accuracy. The target audience includes researchers and developers working on knowledge graph construction and information extraction.
How It Works
The project utilizes BERT (Bidirectional Encoder Representations from Transformers) for attribute extraction. It offers two distinct approaches: fine-tuning, where a pre-trained BERT model is further trained on a specific dataset for the extraction task, and feature extraction, where BERT generates vector representations of text, which are then used with traditional machine learning classifiers (like MLP) for attribute prediction. This dual approach allows for flexibility depending on computational resources and desired performance.
Quick Start & Requirements
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The dataset labeling is noted as potentially imperfect due to manual annotation. The project relies on specific pre-trained BERT model checkpoints and requires manual download. There are no explicit test execution instructions or automated testing frameworks mentioned.
6 years ago
Inactive