deepconf by facebookresearch

Parallel thinking framework for enhanced LLM reasoning

Created 4 months ago

349 stars

Top 79.7% on SourcePulse

Project Summary

Summary

DeepConf is an efficient parallel thinking framework designed to enhance Large Language Model (LLM) reasoning capabilities. It targets users requiring more robust and confident outputs for tasks like math, science, and coding, by leveraging advanced LLM serving backends. The framework offers improved accuracy and efficiency through novel voting and early-stopping mechanisms.

How It Works

This project builds upon popular LLM serving backends, notably vLLM, by introducing a "parallel thinking pipeline." It employs diverse "voting and aggregation strategies" to consolidate multiple generated "traces" into a final answer. A key innovation is "confidence-based early stopping," which allows the system to terminate generation once a sufficient level of confidence is reached, thereby optimizing resource usage and speed.

Quick Start & Requirements

Installation is straightforward via pip: pip install deepconf. A requirements.txt file is also provided for more detailed dependency management using uv pip install -r requirements.txt. The framework is built on vLLM and requires compatible LLM models, such as deepseek-ai/DeepSeek-R1-0528-Qwen3-8B. Links to a website, paper, Twitter/X, and quick start guide are referenced but not provided in the README.

Highlighted Details

The DeepThinkLLM class wraps vLLM, offering both standard generation (generate()) and enhanced reasoning (deepthink()).
deepthink() supports two modes: "online" (confidence-based early stopping with warmup traces) and "offline" (batch generation with multiple voting strategies).
The output is structured as a DeepThinkOutput dataclass, providing final/voted answers, detailed voting results with confidences, all generated traces, confidence thresholds, and performance statistics.
Full compatibility with vLLM is maintained, allowing standard vLLM initialization parameters (vllm_kwargs).

Maintenance & Community

The project references links for a website, research paper, and Twitter/X, but these URLs are not included in the provided README. No specific community channels (e.g., Discord, Slack) or details on contributors/sponsorships are mentioned.

Licensing & Compatibility

The README does not specify a software license. This omission requires further investigation to determine usage restrictions, particularly for commercial applications or integration into closed-source projects.

Limitations & Caveats

No explicit limitations, known bugs, or alpha status are declared. The specific LLM models used (e.g., deepseek-ai/DeepSeek-R1-0528-Qwen3-8B) may imply significant hardware requirements (GPU, VRAM) not detailed in the README. The absence of direct links for documentation and community resources hinders immediate deeper evaluation.

deepconf by facebookresearch

Explore Similar Projects

Awesome-Long2short-on-LRMs by Hongcheng-Gao

ThinkMesh by martianlantern

Tool-Star by RUC-NLPIR

InternBootcamp by InternLM

Husky-v1 by agent-husky

Awesome-Efficient-Reasoning-LLMs by Eclipsess

arc-lang-public by jerber

reasoning-with-sampling by aakaran

Mulberry by HJYao00

train-deepseek-r1 by FareedKhan-dev

Skills by NVIDIA-NeMo

chain-of-thought-hub by FranxYao