RLHF-V by RLHF-V

CVPR'24 research on aligning MLLMs via fine-grained human feedback

Created 2 years ago

302 stars

Top 88.5% on SourcePulse

View on GitHub

1 Expert Loves This Project

Omar Sanseviero

DevRel at Google DeepMind

Project Summary

RLHF-V provides a framework for aligning Multimodal Large Language Models (MLLMs) using fine-grained human feedback to reduce hallucinations. It targets researchers and developers aiming to improve MLLM trustworthiness, offering a data-efficient method to enhance model behavior.

How It Works

The framework leverages fine-grained correctional human feedback, where annotators correct hallucinated segments in MLLM responses. This approach prioritizes data efficiency, allowing for significant hallucination rate reduction with minimal training time. The core methodology involves aligning MLLM behavior through this targeted feedback, enhancing reliability.

Quick Start & Requirements

Install: Clone the repository and set up a conda environment (conda create -n muffin python=3.10, conda activate muffin). Install dependencies via pip install -e .. Specific versions of transformers and flash-attention are recommended for reproducibility.
Prerequisites: Python 3.10, CUDA (for flash-attention), COCO2014 dataset annotations for Object HalBench evaluation.
Resources: Training requires 8 A100 GPUs for 1 hour to achieve a 34.8% hallucination rate reduction.
Links: Project page, paper, demo.

Highlighted Details

Achieves a 34.8% hallucination rate reduction in 1 hour on 8 A100 GPUs.
RLHF-V models have ranked #1 on MMHal-Bench among open-source models and outperform GPT-4V on Object HalBench.
Supports evaluation on LLaVA Bench, Object HalBench, and MMHal Bench.
Offers a larger, diverse dataset of 5.7k fine-grained human correction data.

Maintenance & Community

The project is associated with THU NLP and has seen contributions and integrations with other models like MiniCPM-V 2.0 and OmniLMM-12B. Updates are regularly posted on Hugging Face and arXiv.

RLHF-V by RLHF-V

Explore Similar Projects

Q-Bench by Q-Future

LAMM by OpenGVLab

LRV-Instruction by FuxiaoLiu

RLAIF-V by RLHF-V

MedTrinity-25M by UCSC-VLAA

OPERA by shikiw

HaluEval by RUCAIBox

Awesome-MLLM-Hallucination by showlab

RRHF by GanjinZero

AlpacaDataCleaned by gururise

DialoGPT by microsoft

Awesome-Multimodal-Large-Language-Models by BradyFU