Discover and explore top open-source AI tools and projects—updated daily.
ParticleMediaHallucination corpus and evaluation tools for trustworthy RAG
Top 99.6% on SourcePulse
<2-3 sentences summarising what the project addresses and solves, the target audience, and the benefit.> RAGTruth addresses the critical issue of hallucinations in Retrieval-Augmented Generation (RAG) systems. It offers a comprehensive, word-level hallucination corpus derived from diverse LLM responses across various RAG tasks (QA, Data2txt, Summary). This dataset empowers researchers and engineers to train and rigorously evaluate RAG models, fostering the development of more trustworthy and reliable AI.
How It Works
The project provides nearly 18,000 manually annotated responses generated by multiple LLMs under RAG conditions. Annotations are granular, identifying specific hallucination spans, their types (e.g., Evident Baseless Info, implicit_true), and intensity. This detailed labeling facilitates precise measurement and targeted mitigation of factual inaccuracies and unsupported claims within LLM outputs.
Quick Start & Requirements
Training and evaluation code were released in June 2024. Model weights are also available. Specific installation commands, dependencies (e.g., Python version, CUDA), or setup resource estimates are not detailed in the provided README excerpt.
Highlighted Details
response.jsonl and source_info.jsonl with comprehensive fields for analysis.Maintenance & Community
The project has seen recent updates in January, February, and June 2024, indicating active maintenance. No specific community channels (e.g., Discord, Slack) or detailed contributor information are present in the excerpt.
Licensing & Compatibility
The provided README excerpt does not specify a software license. This lack of clarity may impact commercial use or integration into closed-source projects.
Limitations & Caveats
The README does not detail any specific limitations, known bugs, or alpha status. The focus is on the dataset's utility for hallucination research.
1 year ago
Inactive
RUCAIBox
HillZhang1999
potsawee