GPT-NER by ShuheWang1998

NER research paper using GPT models

Created 2 years ago

273 stars

Top 94.7% on SourcePulse

Project Summary

This repository provides code and results for GPT-NER, a method for Named Entity Recognition (NER) leveraging Large Language Models (LLMs). It targets researchers and practitioners in NLP seeking to apply LLMs to NER tasks, offering a framework for few-shot and zero-shot NER with GPT-3, demonstrating competitive performance against supervised baselines on standard datasets.

How It Works

GPT-NER utilizes GPT-3 for NER by framing the task as a generation problem. It explores different retrieval strategies for providing context to the LLM: random retrieval, sentence-level embeddings (using SimCSE), and entity-level embeddings. The approach aims to enhance NER performance by effectively guiding the LLM with relevant examples, particularly in few-shot scenarios.

Quick Start & Requirements

Install: pip install openai==0.27.2 simcse==0.4
Prerequisites: Python >= 3.7.3, OpenAI API key (set as environment variable OPENAI_API_KEY).
Data: MRC-NER dataset (full) or sampled 100-dataset (Google Drive link provided).
SimCSE Model: sup-simcse-roberta-large (link provided).
Usage: Scripts are located in openai_access/scripts/. Refer to openai_access/get_results_mrc_knn.py and openai_access/verify_results.py for argument details.
Evaluation: Use openai_access/scripts/compute_f1.sh.

Highlighted Details

Demonstrates GPT-3 performance on Flat NER (CoNLL2003, OntoNotes5.0) and Nested NER (ACE2004, ACE2005, GENIA) datasets.
Achieves competitive results, particularly with entity-level embedding retrieval, outperforming some supervised methods on sampled data.
Includes self-verification scripts for zero-shot and few-shot evaluations.
Code is structured around OpenAI API access and SimCSE for embeddings.

Maintenance & Community

Primary contact: wangshuhe@stu.pku.edu.cn.
The project is associated with a research paper published on arXiv.

Licensing & Compatibility

The repository itself does not explicitly state a license.
Usage of OpenAI's GPT-3 requires adherence to OpenAI's terms of service and API usage policies.

Limitations & Caveats

Reliance on the OpenAI API means costs are associated with usage, and access is subject to OpenAI's availability and policies.
Performance is heavily dependent on the quality of retrieved examples and the chosen embedding strategy.
The README mentions that accessing GPT-3 can be expensive, advising users to start with the sampled dataset.

Health Check

Last Commit

2 years ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

2 stars in the last 30 days

Explore Similar Projects

Starred by

Jeff Hammerbacher

Jeff Hammerbacher(Cofounder of Cloudera).

prodigy-openai-recipes by explosion

Prodigy recipes for zero/few-shot learning via OpenAI GPT-3

Created 3 years ago

Updated 2 years ago

universal-ner by universal-ner

NER research paper using LLMs for targeted distillation

Created 2 years ago

Updated 2 years ago

fancy-nlp by boat-group

NLP toolkit for rapid prototyping and deployment

Created 6 years ago

Updated 3 years ago

gpqa by idavidrein

Benchmark for graduate-level, Google-proof question answering

Created 3 years ago

Updated 1 year ago

Starred by

Wing Lian

Wing Lian(Founder of Axolotl AI) and

Shizhe Diao

Shizhe Diao(Author of LMFlow; Research Scientist at NVIDIA).

zero_shot_cot by kojima-takeshi188

Reasoning framework for LLMs, based on a NeurIPS 2022 paper

Created 3 years ago

Updated 2 years ago

NLPGNN by kyzhouhzau

NLP/GNN toolbox for TensorFlow 2.0 implementing various models

Created 5 years ago

Updated 1 year ago

NER-BERT-pytorch by lemonhu

PyTorch solution for named entity recognition

Created 7 years ago

Updated 2 years ago

Starred by

Shizhe Diao

Shizhe Diao(Author of LMFlow; Research Scientist at NVIDIA).

Yuan-1.0 by Shawn-IEITSystems

Large language model for NLP tasks

Created 4 years ago

Updated 1 year ago

Starred by

Omar Sanseviero

Omar Sanseviero(DevRel at Google DeepMind),

Jeff Hammerbacher

Jeff Hammerbacher(Cofounder of Cloudera), and

1 more.

allennlp-models by allenai

NLP models library for various NLP tasks

Created 5 years ago

Updated 3 years ago

bert_seq2seq by 920232796

PyTorch toolkit for sequence-to-sequence and other NLP tasks

Created 5 years ago

Updated 3 years ago

Starred by

Kaichao You

Kaichao You(Core Maintainer of vLLM).

BERT-NER by kyzhouhzau

BERT fine-tuning for named entity recognition

Created 7 years ago

Updated 3 years ago

Starred by

Aravind Srinivas

Aravind Srinivas(Cofounder of Perplexity),

Jasper Zhang

Jasper Zhang(Cofounder of Hyperbolic), and

21 more.

lm-evaluation-harness by EleutherAI

Framework for few-shot language model evaluation

Created 5 years ago

Updated 4 days ago

Feedback? Help us improve.