Relation extraction system using distant supervision
Top 71.1% on sourcepulse
This repository provides a system for sentence-level relation extraction using distant supervision, targeting researchers and practitioners in Natural Language Processing. It offers implementations of recent models and processed datasets, enabling the identification of relationships between entity pairs within text.
How It Works
The system leverages distant supervision, automatically labeling entity pairs in a corpus based on existing knowledge bases. It processes raw text, identifies entity mentions, maps them to knowledge base entities, and aligns facts to sentences. The core approach involves learning representations that capture contextual information and relation types, with specific models like CoType (joint extraction of typed entities and relations) and various LSTM/GRU-based architectures implemented.
Quick Start & Requirements
pip install pexpect ujson tqdm
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The project specifies Python 2.7, which is end-of-life. Stanford CoreNLP 3.7.0 is also an older version. The lack of an explicit license may pose issues for commercial adoption.
5 years ago
1 week