Merlin  by NVIDIA-Merlin

Open-source library for GPU-accelerated recommender systems

created 4 years ago
843 stars

Top 43.1% on sourcepulse

GitHubView on GitHub
Project Summary

NVIDIA Merlin is an open-source library designed to accelerate the entire lifecycle of recommender systems, from data preprocessing to model training and production inference, specifically leveraging NVIDIA GPUs. It targets data scientists, ML engineers, and researchers building high-performance recommenders at scale, offering end-to-end capabilities for handling terabyte-sized datasets.

How It Works

Merlin is a modular ecosystem built on RAPIDS cuDF and Dask for GPU-accelerated data manipulation and distributed computing. Its core components include NVTabular for feature engineering, HugeCTR for scalable deep learning model training with distributed embeddings, Merlin Models for standardized model architectures, Transformers4Rec for sequential recommendations, and Merlin Systems for production deployment via Triton Inference Server. This layered approach allows for seamless integration and optimization across the recommendation pipeline.

Quick Start & Requirements

Highlighted Details

  • End-to-end GPU acceleration for recommender systems.
  • Scales embedding tables beyond GPU/CPU memory limits.
  • Integrates with TensorFlow, PyTorch, FastAI, and Triton Inference Server.
  • Supports sequential and session-based recommendation models.

Maintenance & Community

  • Developed and maintained by NVIDIA.
  • Bug reporting and support via GitHub Issues.

Licensing & Compatibility

  • Apache 2.0 License.
  • Compatible with commercial use and closed-source linking.

Limitations & Caveats

The library is heavily reliant on NVIDIA hardware and CUDA. While modular, integrating custom components or non-standard workflows may require deeper understanding of the underlying libraries.

Health Check
Last commit

8 months ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
1
Star History
22 stars in the last 90 days

Explore Similar Projects

Starred by Jeff Hammerbacher Jeff Hammerbacher(Cofounder of Cloudera) and Stas Bekman Stas Bekman(Author of Machine Learning Engineering Open Book; Research Engineer at Snowflake).

InternEvo by InternLM

1.0%
402
Lightweight training framework for model pre-training
created 1 year ago
updated 1 week ago
Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Omar Sanseviero Omar Sanseviero(DevRel at Google DeepMind), and
5 more.

TensorRT-LLM by NVIDIA

0.6%
11k
LLM inference optimization SDK for NVIDIA GPUs
created 1 year ago
updated 17 hours ago
Feedback? Help us improve.