LettuceDetect by KRLabsOrg

Hallucination detection framework for RAG applications

Created 11 months ago

528 stars

Top 59.8% on SourcePulse

Project Summary

LettuceDetect is a hallucination detection framework for Retrieval-Augmented Generation (RAG) systems, designed to identify unsupported parts of an answer by comparing it against provided context. It targets developers and researchers working with RAG, offering a lightweight, efficient, and precise solution to improve the factual accuracy of AI-generated responses.

How It Works

LettuceDetect employs a token-level classification approach, inspired by encoder-based models like Luna and leveraging ModernBERT for extended context processing. This method allows for precise identification of hallucinated spans within an answer. The framework addresses limitations of traditional encoder models by overcoming context window constraints and offers greater computational efficiency compared to LLM-based detection methods.

Quick Start & Requirements

Install via pip: pip install lettucedetect or pip install -e . for development.
Requires Python.
Models are available on Huggingface: KRLabsOrg/lettucedect-base-modernbert-en-v1 and KRLabsOrg/lettucedect-large-modernbert-en-v1.
Official quick-start example and demo available in the README.

Highlighted Details

Achieves 79.22% F1 score on the RAGTruth dataset with its large model, outperforming GPT-4 and Luna, and competitive with fine-tuned LLAMA-3-8B.
Provides token-level precision for identifying exact hallucinated spans.
Optimized for inference with smaller model sizes and faster processing.
Features a 4K context window capability via ModernBERT.
Integrates with Hugging Face Transformers for easy model loading.
Includes a Python API and an optional Web API.

Maintenance & Community

Developed by KRLabsOrg.
MIT-licensed code and models.
Citation details provided for academic use.

Licensing & Compatibility

Licensed under the MIT License, permitting commercial use and integration with closed-source applications.

Limitations & Caveats

While competitive, the large model is noted as "coming up just short" of the SOTA fine-tuned LLAMA-3-8B from the RAG-HAT paper. Training requires downloading the RAGTruth dataset separately.

Health Check

Last Commit

4 months ago

Responsiveness

1 week

Pull Requests (30d)

1

Issues (30d)

1

Star History

11 stars in the last 30 days

Explore Similar Projects

Awesome-LLM-hallucination by LuckyyySTA

Survey paper list on LLM hallucination

Created 2 years ago

Updated 1 year ago

hallucination_probes by obalcells

Real-time hallucination detection for long-form text generation

Created 4 months ago

Updated 1 month ago

Awesome-RAG by frutik

RAG resource list

Created 2 years ago

Updated 4 months ago

OPERA by shikiw

Decoding method for multimodal LLMs, addressing hallucinations (CVPR 2024)

Created 2 years ago

Updated 1 year ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"),

Travis Fischer

Travis Fischer(Founder of Agentic), and

1 more.

HaluEval by RUCAIBox

Benchmark dataset for LLM hallucination evaluation

Created 2 years ago

Updated 1 year ago

Awesome-MLLM-Hallucination by showlab

Curated list of resources for multimodal large language model hallucination

Created 2 years ago

Updated 3 months ago

Starred by

Shizhe Diao

Shizhe Diao(Author of LMFlow; Research Scientist at NVIDIA) and

Jeff Hammerbacher

Jeff Hammerbacher(Cofounder of Cloudera).

selfcheckgpt by potsawee

Hallucination detection research paper for generative LLMs using black-box methods

Created 2 years ago

Updated 1 year ago

Starred by

Elvis Saravia

Elvis Saravia(Founder of DAIR.AI) and

Thomas Wolf

Thomas Wolf(Cofounder of Hugging Face).

awesome-hallucination-detection by EdinburghNLP

Hallucination detection resources for large language models

Created 2 years ago

Updated 1 month ago

StageRAG by darrencxl0301

Hallucination-resistant RAG framework

Created 3 months ago

Updated 3 months ago

Starred by

Harrison Chase

Harrison Chase(Founder of LangChain).

Controllable-RAG-Agent by NirDiamant

RAG agent for complex question answering

Created 1 year ago

Updated 6 months ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"),

Pawel Garbacki

Pawel Garbacki(Cofounder of Fireworks AI), and

4 more.

LongLoRA by JIA-Lab-research

LongLoRA: Efficient fine-tuning for long-context LLMs

Created 2 years ago

Updated 1 year ago

Starred by

Elie Bursztein

Elie Bursztein(Cybersecurity Lead at Google DeepMind),

Yiran Wu

Yiran Wu(Coauthor of AutoGen), and

2 more.

RAG_Techniques by NirDiamant

RAG techniques showcase for enhanced generation systems

Created 1 year ago

Updated 1 month ago

Feedback? Help us improve.