Awesome-LLM-Uncertainty-Reliability-Robustness  by jxzhangjhu

Curated list of LLM uncertainty, reliability, and robustness resources

created 2 years ago
774 stars

Top 46.0% on sourcepulse

GitHubView on GitHub
Project Summary

This repository is a curated list of academic papers and resources focused on Uncertainty, Reliability, and Robustness (UR2) in Large Language Models (LLMs). It serves as a comprehensive reference for researchers and practitioners aiming to understand and improve the trustworthiness and dependability of LLM outputs.

How It Works

The repository categorizes resources into key areas such as Uncertainty Estimation, Calibration, Reliability, Hallucination, Reasoning, Prompt Engineering, and Robustness (including Invariance, Distribution Shift, and Adversarial attacks). It provides links to papers, technical reports, tutorials, and relevant blog posts, offering a structured overview of the current research landscape.

Highlighted Details

  • Extensive collection of papers covering diverse UR2 aspects of LLMs.
  • Includes links to official reports (e.g., GPT-4 Technical Report), benchmarks (e.g., HallusionBench), and toolkits (e.g., TextFlint, Robustness Gym).
  • Covers foundational concepts and cutting-edge research in LLM evaluation and safety.
  • Features resources on prompt engineering techniques for improving reliability.

Maintenance & Community

This is a community-driven "awesome list" project, with contributions welcomed from the research community.

Licensing & Compatibility

The repository itself is typically licensed under permissive terms (e.g., MIT License), but the linked academic papers are subject to their respective copyright and licensing agreements.

Limitations & Caveats

As a curated list, it does not provide code or direct tools for implementing UR2 techniques. The content is a snapshot of research and may not include the very latest publications.

Health Check
Last commit

2 months ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
31 stars in the last 90 days

Explore Similar Projects

Feedback? Help us improve.