DeepSeek-Math  by deepseek-ai

Math reasoning model for competition-level problems

created 1 year ago
2,833 stars

Top 17.2% on sourcepulse

GitHubView on GitHub
Project Summary

DeepSeekMath provides open-source 7B parameter language models specifically trained for advanced mathematical reasoning. Targeting researchers and developers, these models offer strong performance on benchmarks like MATH, approaching proprietary model capabilities without external toolkits, and also demonstrate robust tool-use and coding abilities.

How It Works

DeepSeekMath models are initialized from DeepSeek-Coder-v1.5 7B and further pre-trained on a massive 500B token dataset comprising mathematical web texts, natural language, and code. This extensive training, particularly on curated mathematical content from Common Crawl, imbues the models with superior mathematical reasoning capabilities. The instruct and RL variants are fine-tuned for better instruction following and optimized using a Group Relative Policy Optimization (GRPO) algorithm, respectively.

Quick Start & Requirements

  • Install/Run: Use Hugging Face's Transformers library.
  • Prerequisites: Python, PyTorch. torch_dtype=torch.bfloat16 is recommended for inference.
  • Resources: Requires sufficient VRAM for a 7B model (e.g., ~14GB for bfloat16).
  • Docs: Quick Start

Highlighted Details

  • Achieves 51.7% on the MATH benchmark (few-shot CoT), outperforming other open-source models by >10%.
  • Demonstrates strong tool-use capabilities for solving and proving mathematical problems.
  • Offers comparable reasoning and coding performance to its base model, DeepSeekCoder-7B.
  • The RL version approaches 60% accuracy on MATH with tool use.

Maintenance & Community

Licensing & Compatibility

  • Code licensed under MIT.
  • Model use subject to a separate Model License.
  • Commercial use is permitted.

Limitations & Caveats

  • The README advises against using system prompts for instruct/RL models and recommends specific chain-of-thought prompting formats for optimal results.
Health Check
Last commit

1 year ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
1
Star History
176 stars in the last 90 days

Explore Similar Projects

Starred by Georgios Konstantopoulos Georgios Konstantopoulos(CTO, General Partner at Paradigm) and Jiayi Pan Jiayi Pan(Author of SWE-Gym; AI Researcher at UC Berkeley).

DeepSeek-V2 by deepseek-ai

0.1%
5k
MoE language model for research/API use
created 1 year ago
updated 10 months ago
Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems) and Jiayi Pan Jiayi Pan(Author of SWE-Gym; AI Researcher at UC Berkeley).

DeepSeek-Coder-V2 by deepseek-ai

0.4%
6k
Open-source code language model comparable to GPT4-Turbo
created 1 year ago
updated 10 months ago
Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Jeff Hammerbacher Jeff Hammerbacher(Cofounder of Cloudera), and
10 more.

open-r1 by huggingface

0.2%
25k
SDK for reproducing DeepSeek-R1
created 6 months ago
updated 3 days ago
Starred by Michael Han Michael Han(Cofounder of Unsloth), Sebastian Raschka Sebastian Raschka(Author of Build a Large Language Model From Scratch), and
6 more.

DeepSeek-R1 by deepseek-ai

0.1%
91k
Reasoning models research paper
created 6 months ago
updated 1 month ago
Feedback? Help us improve.