LLM skill-improvement pipelines for synthetic data generation, training, and evaluation
Top 63.4% on sourcepulse
NeMo-Skills provides pipelines for enhancing large language models' reasoning and problem-solving capabilities, particularly in mathematics. It targets researchers and developers working with LLMs, offering tools for data generation, model training, and evaluation, with a focus on reproducible results and state-of-the-art performance on mathematical benchmarks.
How It Works
The project leverages NeMo-Aligner for efficient model training and supports flexible inference across various backends like NeMo, vLLM, sglang, and TensorRT-LLM. It facilitates checkpoint format conversion and offers a suite of evaluation pipelines for diverse tasks including math problem-solving, formal proofs, coding, chat, and general knowledge. A key contribution is the OpenMathReasoning dataset, featuring synthetic data for mathematical reasoning.
Quick Start & Requirements
ns --help
command.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
1 day ago
1 day