Dataset for grade school math word problems
Top 31.3% on sourcepulse
This repository provides the GSM8K dataset, a collection of 8,500 grade-school level math word problems designed to evaluate and improve the multi-step reasoning capabilities of large language models. It targets AI researchers and developers working on natural language understanding and mathematical reasoning in AI.
How It Works
GSM8K addresses LLM failures in multi-step mathematical reasoning by offering a curated dataset of linguistically diverse problems. Solutions are formatted with calculation annotations (e.g., <<50*3=150>>
) that can be parsed by an external calculator, mitigating arithmetic errors common in LLMs. This approach allows models to offload calculations, improving accuracy on complex problems.
Quick Start & Requirements
pip install -r requirements.txt
train.jsonl
and test.jsonl
from grade_school_math/data/
.bash python view_model_solutions.py
Highlighted Details
calculator.py
) for integrating a calculator during sampling.Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The provided model-generated samples may contain occasional calculation errors due to previous implementation bugs in the calculator, which have since been fixed in the codebase but not reflected in the samples. The example training and sampling code is for illustrative purposes and is noted as inefficient (no batching, no activation caching).
1 year ago
Inactive