GenRead by wyu97

Research paper code for context generation using LLMs

Created 3 years ago

289 stars

Top 91.2% on SourcePulse

View on GitHub

1 Expert Loves This Project

Jeff Hammerbacher

Cofounder of Cloudera

Project Summary

This repository provides the official implementation for the "Generate rather than Retrieve" (GenRead) paper, which explores using large language models (LLMs) as strong context generators for question answering. It targets researchers and practitioners in NLP and LLM applications, offering a novel approach to knowledge retrieval by generating relevant context instead of traditional retrieval methods.

How It Works

GenRead frames question answering as a context generation task. Instead of retrieving existing documents, it leverages LLMs to generate relevant background documents that contain the answer. The process involves two main steps: generating candidate documents using an LLM (either zero-shot or supervised with sampling/clustering) and then inferring the answer from these generated documents. This approach aims to overcome limitations of traditional retrieval systems by creating context tailored to the query.

Quick Start & Requirements

Install: pip install openai
Prerequisites: OpenAI API key (set in inference.py), Python 3.x. Datasets (NQ, TriviaQA, WebQ, FM2, FEVER, Wizard) need to be downloaded and placed in the indataset folder.
Official Docs/Demo: OpenReview, arXiv, FiD GitHub

Highlighted Details

Zero-shot generation using text-davinci-002 with greedy search for reproducibility.
Supervised generation supports sampling (multiple documents) and clustering (diverse documents).
Offers pre-trained Fusion-in-Decoder (FiD) reader models (GenRead-3B-NQ, GenRead-3B-TQA) with reported performance metrics.

Maintenance & Community

The project is associated with ICLR 2023. Contact information for checkpoint requests is provided (wyu1@nd.edu).

Licensing & Compatibility

The repository does not explicitly state a license. The use of OpenAI's API implies adherence to their terms of service. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

The project relies heavily on the OpenAI API, which incurs costs and is subject to API availability. Supervised generation methods may produce non-deterministic outputs. The Fusion-in-Decoder models are provided as separate checkpoints requiring separate download and integration.

Health Check

Last Commit

2 years ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

1 stars in the last 30 days