Research paper code for named entity recognition via unified MRC framework
Top 51.1% on sourcepulse
This repository provides the implementation for a unified MRC (Machine Reading Comprehension) framework for Named Entity Recognition (NER), addressing both flat and nested entity extraction. It is targeted at NLP researchers and practitioners seeking to leverage MRC for improved NER performance. The framework offers a novel approach to NER by reformulating it as a question-answering task.
How It Works
The framework treats NER as a reading comprehension problem, where the model answers questions about text to identify entities. For flat NER, it uses a span-extraction approach. For nested NER, it employs a more complex MRC formulation to handle entities that contain other entities. This MRC-based approach allows for a more flexible and potentially more accurate extraction of entities compared to traditional sequence labeling methods.
Quick Start & Requirements
pip install -r requirements.txt
(after installing PyTorch with CUDA 10.1 support if applicable: pip install torch==1.7.1+cu101 torchvision==0.8.2+cu101 torchaudio==0.7.2 -f https://download.pytorch.org/whl/torch_stable.html
).DATA_DIR
, BERT_DIR
, and OUTPUT_DIR
to be configured.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
2 years ago
Inactive