Merlin  by NVIDIA-Merlin

Open-source library for GPU-accelerated recommender systems

Created 4 years ago
849 stars

Top 42.1% on SourcePulse

GitHubView on GitHub
Project Summary

NVIDIA Merlin is an open-source library designed to accelerate the entire lifecycle of recommender systems, from data preprocessing to model training and production inference, specifically leveraging NVIDIA GPUs. It targets data scientists, ML engineers, and researchers building high-performance recommenders at scale, offering end-to-end capabilities for handling terabyte-sized datasets.

How It Works

Merlin is a modular ecosystem built on RAPIDS cuDF and Dask for GPU-accelerated data manipulation and distributed computing. Its core components include NVTabular for feature engineering, HugeCTR for scalable deep learning model training with distributed embeddings, Merlin Models for standardized model architectures, Transformers4Rec for sequential recommendations, and Merlin Systems for production deployment via Triton Inference Server. This layered approach allows for seamless integration and optimization across the recommendation pipeline.

Quick Start & Requirements

Highlighted Details

  • End-to-end GPU acceleration for recommender systems.
  • Scales embedding tables beyond GPU/CPU memory limits.
  • Integrates with TensorFlow, PyTorch, FastAI, and Triton Inference Server.
  • Supports sequential and session-based recommendation models.

Maintenance & Community

  • Developed and maintained by NVIDIA.
  • Bug reporting and support via GitHub Issues.

Licensing & Compatibility

  • Apache 2.0 License.
  • Compatible with commercial use and closed-source linking.

Limitations & Caveats

The library is heavily reliant on NVIDIA hardware and CUDA. While modular, integrating custom components or non-standard workflows may require deeper understanding of the underlying libraries.

Health Check
Last Commit

9 months ago

Responsiveness

1 week

Pull Requests (30d)
0
Issues (30d)
0
Star History
7 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Jeff Hammerbacher Jeff Hammerbacher(Cofounder of Cloudera), and
2 more.

towhee by towhee-io

0.0%
3k
Framework for neural data processing pipelines
Created 4 years ago
Updated 11 months ago
Starred by George Hotz George Hotz(Author of tinygrad; Founder of the tiny corp, comma.ai), Casper Hansen Casper Hansen(Author of AutoAWQ), and
1 more.

GPT2 by ConnorJL

0%
1k
GPT2 training implementation, supporting TPUs and GPUs
Created 6 years ago
Updated 2 years ago
Feedback? Help us improve.