ChatIE  by cocacola-lab

Chatbot for zero-shot information extraction research

Created 2 years ago
825 stars

Top 43.0% on SourcePulse

GitHubView on GitHub
Project Summary

ChatIE is an open-source tool for zero-shot information extraction (IE) using large language models, specifically ChatGPT. It targets researchers and developers needing to extract structured data like entities, relations, and events from unannotated text with minimal human intervention, offering a cost-effective alternative to traditional supervised IE methods.

How It Works

ChatIE transforms zero-shot IE into a multi-turn question-answering problem within a two-stage framework. By leveraging ChatGPT's capabilities and carefully crafted prompts, it directly extracts structured information from raw sentences. This approach bypasses the need for extensive data labeling, enabling rapid development and deployment of IE systems, even for complex tasks like entity-relation triple extraction, named entity recognition, and event extraction.

Quick Start & Requirements

  • Installation: Clone the repository. Navigate to front-end and run npm install then npm run start. Navigate to back-end and run python run.py.
  • Prerequisites: Node.js v14.17.4, npm v9.6.0. Requires an OpenAI API key (default key is provided but may be overloaded).
  • Setup: Local setup involves frontend and backend service startup.
  • Docs: arXiv preprint

Highlighted Details

  • Supports Entity-Relation (RE), Named Entity Recognition (NER), and Event Extraction (EE) tasks.
  • Handles both Chinese and English languages.
  • Claims to surpass some full-shot models on specific datasets (e.g., NYT11-HRL).
  • Provides default relation/entity/event type lists, with options for customization.

Maintenance & Community

The project is associated with the paper "Zero-Shot Information Extraction via Chatting with ChatGPT" (arXiv:2302.10205). The README indicates active updates are planned. No specific community channels (Discord/Slack) are listed.

Licensing & Compatibility

The repository is presented as open-source, but the specific license is not explicitly stated in the README. The use of OpenAI's API implies adherence to their terms of service.

Limitations & Caveats

The online demo is temporarily unavailable due to cost. The response speed and reliability are dependent on the official OpenAI ChatGPT API, which can be slow or overloaded. Using the default OpenAI key is discouraged due to potential overload from multiple users.

Health Check
Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
1 stars in the last 30 days

Explore Similar Projects

Feedback? Help us improve.