verbalized-sampling by CHATS-lab

Enhance LLM diversity with training-free verbalized sampling

Created 1 year ago

774 stars

Top 44.4% on SourcePulse

Project Summary

Verbalized Sampling is a training-free prompting strategy designed to mitigate mode collapse and significantly enhance Large Language Model (LLM) response diversity by 2-3x, while preserving output quality. This model-agnostic framework is ideal for users engaged in creative writing, synthetic data generation, and dialogue simulation, offering a straightforward method to unlock richer, more varied LLM outputs.

How It Works

The core approach involves prompting LLMs to generate multiple candidate responses, each accompanied by its estimated probability. The system then samples from this distribution, specifically targeting lower-probability responses (below 0.10), to encourage diversity. This method is training-free, meaning it can be applied to any LLM via prompting without requiring model fine-tuning. It is also orthogonal to the temperature parameter and effective across a wide range of tasks.

Quick Start & Requirements

Primary install / run command: pip install verbalized-sampling
Non-default prerequisites: API keys (e.g., OPENAI_API_KEY, OPENROUTER_API_KEY) are required for the Python package. For optimal results, advanced models like GPT-5, Claude 4 Opus, and Gemini 2.5 Pro are recommended.
Relevant links:
- Homepage: https://www.verbalized-sampling.com/
- Paper: https://arxiv.org/abs/2510.01171
- Blog: https://simonucl.notion.site/verbalized-sampling
- Colab Notebooks and examples are available via GitHub links in the README.

Highlighted Details

Achieves 2-3x improvement in LLM response diversity.
Training-free and model-agnostic, compatible with GPT, Claude, Gemini, Llama, and others.
Orthogonal to temperature settings, offering an alternative control for output variation.
Effective for creative writing, social simulation, synthetic data generation, and open-ended QA.
Includes a Python package with CLI/API functionality and LangChain integration.

Maintenance & Community

The project provides links to its paper and blog, indicating active development and research. Specific community channels (e.g., Discord, Slack) or notable contributors are not detailed in the provided README.

Licensing & Compatibility

This project is licensed under the Apache License 2.0, which is permissive for commercial use and integration into closed-source projects.

Limitations & Caveats

The README suggests optimal performance with advanced LLMs such as GPT-5, Claude 4 Opus, and Gemini 2.5 Pro, implying that results may vary with less capable models. The prompt examples require specific response formatting (e.g., <response>, <text>, <probability>), which may necessitate careful handling to ensure correct interpretation by the target LLM.

verbalized-sampling by CHATS-lab

Explore Similar Projects

MR-Models by mtkresearch

lost_in_conversation by microsoft

InternBootcamp by InternLM

LMaaS-Papers by txsun1997

MemoryLLM by wangyu-ustc

Awesome-LLMs-as-Judges by CSHaitao

llm-consortium by irthomasthomas

Shubhamsaboo-awesome-llm-apps by joypaul162

OpenELM by CarperAI

locomo by snap-research

LLMZoo by FreedomIntelligence

lm-evaluation-harness by EleutherAI