Merlin by NVIDIA-Merlin

Open-source library for GPU-accelerated recommender systems

Created 4 years ago

880 stars

Top 40.7% on SourcePulse

View on GitHub

2 Experts Love This Project

Project Summary

NVIDIA Merlin is an open-source library designed to accelerate the entire lifecycle of recommender systems, from data preprocessing to model training and production inference, specifically leveraging NVIDIA GPUs. It targets data scientists, ML engineers, and researchers building high-performance recommenders at scale, offering end-to-end capabilities for handling terabyte-sized datasets.

How It Works

Merlin is a modular ecosystem built on RAPIDS cuDF and Dask for GPU-accelerated data manipulation and distributed computing. Its core components include NVTabular for feature engineering, HugeCTR for scalable deep learning model training with distributed embeddings, Merlin Models for standardized model architectures, Transformers4Rec for sequential recommendations, and Merlin Systems for production deployment via Triton Inference Server. This layered approach allows for seamless integration and optimization across the recommendation pipeline.

Quick Start & Requirements

Installation: The simplest method is via NVIDIA GPU Cloud (NGC) containers. Component-specific installation via conda or pip is also supported.
Prerequisites: NVIDIA GPUs are essential. Specific CUDA versions are implied by NGC containers.
Resources: Handling terabyte-scale datasets implies significant storage and GPU memory requirements.
Links:
- NGC Containers: https://catalog.ngc.nvidia.com/orgs/nvidia/containers/merlin
- Example Notebooks: https://github.com/NVIDIA-Merlin/Merlin/tree/main/examples

Highlighted Details

End-to-end GPU acceleration for recommender systems.
Scales embedding tables beyond GPU/CPU memory limits.
Integrates with TensorFlow, PyTorch, FastAI, and Triton Inference Server.
Supports sequential and session-based recommendation models.

Maintenance & Community

Developed and maintained by NVIDIA.
Bug reporting and support via GitHub Issues.

Licensing & Compatibility

Apache 2.0 License.
Compatible with commercial use and closed-source linking.

Limitations & Caveats

The library is heavily reliant on NVIDIA hardware and CUDA. While modular, integrating custom components or non-standard workflows may require deeper understanding of the underlying libraries.

Health Check

Last Commit

1 year ago

Responsiveness

1 week

Pull Requests (30d)

Issues (30d)

Star History

9 stars in the last 30 days