awesome-hallucination-detection by EdinburghNLP

Hallucination detection resources for large language models

Created 2 years ago

1,015 stars

Top 36.8% on SourcePulse

2 Experts Love This Project

omarsar

Founder of DAIR.AI

thomwolf

Cofounder of Hugging Face

Project Summary

This repository is a curated list of papers focused on detecting and mitigating hallucinations in Large Language Models (LLMs). It serves researchers and practitioners aiming to improve the factual accuracy and trustworthiness of LLM outputs across various domains, including question answering, summarization, and vision-language tasks.

How It Works

The collection highlights diverse approaches to hallucination detection and mitigation. Methods range from analyzing semantic similarities and embedding spaces to leveraging internal model states, external knowledge bases, and even fine-grained AI feedback. Some papers focus on preemptive detection before generation, while others propose post-generation correction or uncertainty quantification techniques.

Quick Start & Requirements

This is a curated list of research papers, not a runnable software package.
Links to papers, datasets, and code repositories are provided within the README.

Highlighted Details

Covers a broad spectrum of hallucination types, including factuality and faithfulness.
Includes benchmarks and datasets specifically designed for evaluating hallucination detection and mitigation.
Features methods applicable to both text-only and multimodal (vision-language) LLMs.
Discusses various evaluation metrics, from traditional statistical measures to model-based and human-centric assessments.

Maintenance & Community

The repository is maintained by EdinburghNLP.
It cites a comprehensive list of academic papers, indicating active curation.
Links to related surveys and shared tasks are provided.

Licensing & Compatibility

The repository itself is licensed under the MIT License.
Individual papers and linked code repositories will have their own licenses.

Limitations & Caveats

This resource is a collection of research papers and does not provide a unified tool or framework for hallucination detection.
The effectiveness and applicability of the discussed methods vary depending on the specific LLM and task.

Health Check

Last Commit

1 month ago

Responsiveness

1 day

Pull Requests (30d)

0

Issues (30d)

0

Star History

12 stars in the last 30 days

Explore Similar Projects

Awesome-LLM-hallucination by LuckyyySTA

Survey paper list on LLM hallucination

Created 2 years ago

Updated 1 year ago

hallucination_probes by obalcells

Real-time hallucination detection for long-form text generation

Created 4 months ago

Updated 1 month ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"),

Travis Fischer

Travis Fischer(Founder of Agentic), and

1 more.

HaluEval by RUCAIBox

Benchmark dataset for LLM hallucination evaluation

Created 2 years ago

Updated 1 year ago

LettuceDetect by KRLabsOrg

Hallucination detection framework for RAG applications

Created 11 months ago

Updated 4 months ago

Awesome-MLLM-Hallucination by showlab

Curated list of resources for multimodal large language model hallucination

Created 2 years ago

Updated 3 months ago

Awesome-LLM-Eval by onejune2018

Curated list for LLM evaluation tools, datasets, and models

Created 2 years ago

Updated 1 month ago

semantic_uncertainty by jlko

Code for reproducing semantic uncertainty research paper experiments

Created 1 year ago

Updated 1 year ago

Starred by

Jeff Hammerbacher

Jeff Hammerbacher(Cofounder of Cloudera).

Awesome-LLM4IE-Papers by quqxui

Curated list of LLM papers for generative information extraction (IE)

Created 2 years ago

Updated 1 year ago

llms by IbrahimSobh

Collection of resources for large language models

Created 2 years ago

Updated 3 months ago

Starred by

Shizhe Diao

Shizhe Diao(Author of LMFlow; Research Scientist at NVIDIA) and

Jeff Hammerbacher

Jeff Hammerbacher(Cofounder of Cloudera).

selfcheckgpt by potsawee

Hallucination detection research paper for generative LLMs using black-box methods

Created 2 years ago

Updated 1 year ago

Starred by

Jeff Hammerbacher

Jeff Hammerbacher(Cofounder of Cloudera),

Amin Ahmad

Amin Ahmad(Cofounder of Vectara), and

4 more.

hallucination-leaderboard by vectara

LLM leaderboard for hallucination detection in summarization

Created 2 years ago

Updated 1 day ago

Starred by

Peter Norvig

Peter Norvig(Author of "Artificial Intelligence: A Modern Approach"; Research Director at Google),

Elvis Saravia

Elvis Saravia(Founder of DAIR.AI), and

3 more.

Hands-On-Large-Language-Models by HandsOnLLM

Code examples for "Hands-On Large Language Models" book

Created 1 year ago

Updated 3 weeks ago

Feedback? Help us improve.