Skills  by NVIDIA-NeMo

LLM skill-improvement pipelines for synthetic data generation, training, and evaluation

Created 1 year ago
756 stars

Top 46.1% on SourcePulse

GitHubView on GitHub
Project Summary

NeMo-Skills provides pipelines for enhancing large language models' reasoning and problem-solving capabilities, particularly in mathematics. It targets researchers and developers working with LLMs, offering tools for data generation, model training, and evaluation, with a focus on reproducible results and state-of-the-art performance on mathematical benchmarks.

How It Works

The project leverages NeMo-Aligner for efficient model training and supports flexible inference across various backends like NeMo, vLLM, sglang, and TensorRT-LLM. It facilitates checkpoint format conversion and offers a suite of evaluation pipelines for diverse tasks including math problem-solving, formal proofs, coding, chat, and general knowledge. A key contribution is the OpenMathReasoning dataset, featuring synthetic data for mathematical reasoning.

Quick Start & Requirements

  • Installation and usage details are available via a tutorial and the ns --help command.
  • Requires NVIDIA NeMo framework and potentially specific hardware for training and inference.
  • Links: Tutorial, Pipelines

Highlighted Details

  • Achieved state-of-the-art results on mathematical benchmarks like AIME24 and HMMT-24-25 with OpenMath-Nemotron models.
  • Released the OpenMathReasoning dataset (540K math problems, 3.2M CoT solutions, 1.7M TIR solutions).
  • Supports seamless switching between inference servers (NeMo, vLLM, sglang, TensorRT-LLM) and checkpoint conversion.
  • Includes Nemo Inspector for visualizing inference and data analysis.

Maintenance & Community

  • Developed by NVIDIA, with contributions from researchers like Ivan Moshkov, Shubham Toshniwal, and Igor Gitman.
  • Citations provided for related papers on arXiv.

Licensing & Compatibility

  • The project is strictly for research purposes.
  • No explicit license is mentioned in the README, implying potential restrictions on commercial use or closed-source integration.

Limitations & Caveats

  • The project is designated for research purposes only, which may impose usage restrictions.
  • Specific hardware requirements for optimal performance are not detailed but are implied by the NVIDIA ecosystem.
Health Check
Last Commit

23 hours ago

Responsiveness

1 day

Pull Requests (30d)
55
Issues (30d)
7
Star History
100 stars in the last 30 days

Explore Similar Projects

Starred by Théophile Gervet Théophile Gervet(Cofounder of Genesis AI), Jason Knight Jason Knight(Director AI Compilers at NVIDIA; Cofounder of OctoML), and
7 more.

lingua by facebookresearch

0.0%
5k
LLM research codebase for training and inference
Created 1 year ago
Updated 5 months ago
Feedback? Help us improve.