selfcheckgpt by potsawee

Hallucination detection research paper for generative LLMs using black-box methods

Created 2 years ago

590 stars

Top 55.1% on SourcePulse

2 Experts Love This Project

shizhediao

Author of LMFlow; Research Scientist at NVIDIA

hammer

Jeff Hammerbacher

Cofounder of Cloudera

Project Summary

SelfCheckGPT provides zero-resource, black-box hallucination detection for generative LLMs. It's designed for researchers and developers evaluating LLM outputs, offering sentence-level consistency scores without needing access to the LLM's internal workings or training data.

How It Works

The library implements several variants of the self-check approach: BERTScore, Question-Answering (MQAG), n-gram, NLI, and LLM-Prompting. These methods compare a generated passage against multiple sampled variations of the same passage. For instance, BERTScore measures semantic similarity, MQAG generates and answers questions about the text, n-gram checks for distributional shifts, NLI assesses entailment/contradiction between sentences and samples, and LLM-Prompting uses another LLM to judge consistency. This ensemble of techniques allows for robust hallucination detection by leveraging different linguistic and semantic signals.

Quick Start & Requirements

Install via pip: pip install selfcheckgpt
Requires torch and spacy. Download a spaCy model (e.g., python -m spacy download en_core_web_sm).
GPU with CUDA is recommended for performance.
See demo notebooks for detailed usage examples: demo/SelfCheck_demo1.ipynb

Highlighted Details

Offers five distinct hallucination detection methods: BERTScore, MQAG, N-gram, NLI, and LLM-Prompting.
SelfCheck-Prompt using gpt-3.5-turbo achieved the highest performance (AUC-PR 93.42 for NonFact) on the wiki_bio_gpt3_hallucination dataset.
Includes an implementation of MQAG (Multiple-choice Question Answering and Generation) from prior work.
Provides access to the wiki_bio_gpt3_hallucination dataset via Hugging Face Datasets or direct download.

Maintenance & Community

Paper accepted at EMNLP 2023.
Codebase appears actively maintained with recent updates and analysis.
No explicit community links (Discord/Slack) are provided in the README.

Licensing & Compatibility

The README does not explicitly state a license. The repository name and common practice suggest it might be MIT or Apache 2.0, but this requires verification.
Compatibility for commercial use is dependent on the unstated license.

Limitations & Caveats

The N-gram method's scores are not bounded, unlike BERTScore and MQAG.
LLM-Prompting requires API keys for services like OpenAI or Groq, or local setup for HuggingFace models, introducing external dependencies and potential costs.
The effectiveness of NLI and LLM-Prompting methods relies on the quality of the underlying NLI model and the prompted LLM, respectively.

Health Check

Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

6 stars in the last 30 days

Explore Similar Projects

prometheus by prometheus-eval

Evaluator LM for fine-grained assessment using customized rubrics

Created 2 years ago

Updated 2 years ago

Awesome-LLM-hallucination by LuckyyySTA

Survey paper list on LLM hallucination

Created 2 years ago

Updated 1 year ago

hallucination_probes by obalcells

Real-time hallucination detection for long-form text generation

Created 4 months ago

Updated 1 month ago

Starred by

Gabriel Almeida

Gabriel Almeida(Cofounder of Langflow).

Woodpecker by VITA-MLLM

Training-free method for correcting hallucinations in multimodal LLMs

Created 2 years ago

Updated 1 year ago

OPERA by shikiw

Decoding method for multimodal LLMs, addressing hallucinations (CVPR 2024)

Created 2 years ago

Updated 1 year ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"),

Travis Fischer

Travis Fischer(Founder of Agentic), and

1 more.

HaluEval by RUCAIBox

Benchmark dataset for LLM hallucination evaluation

Created 2 years ago

Updated 1 year ago

LettuceDetect by KRLabsOrg

Hallucination detection framework for RAG applications

Created 11 months ago

Updated 4 months ago

Binoculars by ahans30

Zero-shot tool for detecting LLM-generated text

Created 2 years ago

Updated 1 year ago

semantic_uncertainty by jlko

Code for reproducing semantic uncertainty research paper experiments

Created 1 year ago

Updated 1 year ago

Starred by

Elvis Saravia

Elvis Saravia(Founder of DAIR.AI) and

Thomas Wolf

Thomas Wolf(Cofounder of Hugging Face).

awesome-hallucination-detection by EdinburghNLP

Hallucination detection resources for large language models

Created 2 years ago

Updated 1 month ago

Starred by

Jeff Hammerbacher

Jeff Hammerbacher(Cofounder of Cloudera),

Amin Ahmad

Amin Ahmad(Cofounder of Vectara), and

4 more.

hallucination-leaderboard by vectara

LLM leaderboard for hallucination detection in summarization

Created 2 years ago

Updated 1 day ago

Starred by

Chaoyu Yang

Chaoyu Yang(Founder of Bento),

Jeff Hammerbacher

Jeff Hammerbacher(Cofounder of Cloudera), and

1 more.

WikiChat by stanford-oval

Improved RAG for factual LLM responses using Wikipedia grounding

Created 2 years ago

Updated 8 months ago

Feedback? Help us improve.