r1-overthinker  by qunash

Gradio app for extending DeepSeek R1 reasoning

created 6 months ago
368 stars

Top 77.8% on sourcepulse

GitHubView on GitHub
Project Summary

This project enables users to extend the reasoning capabilities of DeepSeek R1 models, allowing them to "overthink" and produce more thorough responses. It targets researchers and power users seeking deeper insights from LLMs by providing fine-grained control over the generation process and supporting unlimited context length, limited only by VRAM.

How It Works

The core mechanism involves intercepting early model conclusions and replacing them with prompts that encourage further deliberation. This "budget forcing" technique, validated by the independent "s1: Simple test-time scaling" paper, allows for controlled extension of the model's thinking process until a user-defined threshold is met. It leverages unsloth-optimized models for enhanced performance and VRAM efficiency.

Quick Start & Requirements

  • Install via pip install -e .
  • Requires Python 3.10+ and PyTorch.
  • Supports various DeepSeek R1 models (1.5B to 70B parameters), including Qwen and LLaMA architectures.
  • Models up to 14B parameters can run on a free Google Colab T4 GPU.
  • See unsloth for optimization details.

Highlighted Details

  • Forces models to think longer and more thoroughly.
  • Customizable reasoning extensions and thinking thresholds.
  • Fine-grained control over model parameters (temperature, top-p).
  • Visible thinking process with token count tracking.

Maintenance & Community

  • Developed by anzorq.
  • Credits original idea to vgel's gist.
  • Utilizes unsloth for optimization and Gradio for the app interface.

Licensing & Compatibility

  • MIT License.
  • Permissive license suitable for commercial use and integration into closed-source projects.

Limitations & Caveats

The effectiveness of "overthinking" may vary depending on the specific model and task. The project relies on unsloth optimizations, which might introduce specific dependencies or behaviors.

Health Check
Last commit

5 months ago

Responsiveness

1+ week

Pull Requests (30d)
0
Issues (30d)
0
Star History
7 stars in the last 90 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems) and Jiayi Pan Jiayi Pan(Author of SWE-Gym; AI Researcher at UC Berkeley).

DeepSeek-Coder-V2 by deepseek-ai

0.4%
6k
Open-source code language model comparable to GPT4-Turbo
created 1 year ago
updated 10 months ago
Starred by George Hotz George Hotz(Author of tinygrad; Founder of the tiny corp, comma.ai), Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), and
5 more.

TinyZero by Jiayi-Pan

0.2%
12k
Minimal reproduction of DeepSeek R1 Zero for countdown/multiplication tasks
created 6 months ago
updated 3 months ago
Feedback? Help us improve.