Discover and explore top open-source AI tools and projects—updated daily.
NVIDIAOptimized recommender system examples for accelerated training and inference
Top 95.1% on SourcePulse
Recommender system examples optimized for NVIDIA accelerated infrastructure, this project provides easy-to-train and deploy components for large-scale recommendation tasks. It targets researchers and engineers seeking high-performance solutions for ranking, retrieval, and dynamic embedding management, enabling efficient deployment on advanced hardware.
How It Works
This project leverages NVIDIA's TorchRec and Megatron-Core for scalable training of HSTU (High-Throughput User) ranking/retrieval models and semantic-id based retrieval. Inference is heavily optimized using techniques like paged KV cache, Triton Inference Server integration, CUDA graphs, and C++ deployment via AOTInductor. DynamicEmb offers advanced features for parallelized dynamic embedding tables, including zero-collision hashing, eviction policies, admission control, and table fusion for efficient parameter management.
Quick Start & Requirements
Highlighted Details
Maintenance & Community
Active development is indicated by frequent releases (e.g., v26.03, v26.01). Community interaction is facilitated via GitHub Issues for bug reports and feature requests, and NVIDIA Developer Forums. Resources include videos and blogs detailing optimization practices.
Licensing & Compatibility
This project is licensed under the Apache License 2.0. This license is generally permissive and compatible with commercial use and closed-source linking.
Limitations & Caveats
The project heavily emphasizes NVIDIA hardware, suggesting a strong dependency on specific GPU architectures and CUDA versions. Setup complexity is implied, and detailed installation or performance benchmarks for all configurations are not exhaustively provided within the overview. The collection consists of examples, requiring users to integrate components into their specific workflows.
1 day ago
Inactive
jina-ai
jiaweizzhao
bytedance