Discover and explore top open-source AI tools and projects—updated daily.
Generative recommendation with semantic IDs
Top 88.8% on SourcePulse
GRID (Generative Recommendation with Semantic IDs) is a framework for generative recommendation systems that leverages semantic IDs derived from item text embeddings. It is designed for researchers and engineers working on recommendation systems, offering a novel approach to generating recommendations by converting item text into embeddings, learning hierarchical semantic IDs from these embeddings, and then using transformer models to generate recommendation sequences as these semantic ID tokens.
How It Works
GRID's approach involves three main steps: 1) Embedding Generation: Item text is converted into embeddings using any Huggingface-compatible LLM. 2) Semantic ID Learning: These item embeddings are then transformed into hierarchical semantic IDs using techniques like Residual Quantization (RQ-KMeans, RQ-VAE, RVQ). 3) Generative Recommendations: Transformer architectures are employed to generate recommendation sequences as sequences of these learned semantic ID tokens. This method aims to improve recommendation quality by capturing semantic relationships in a structured, tokenized format.
Quick Start & Requirements
pip install -r requirements.txt
.train/
, validation/
, test/
) and item text (items/
). Pre-processed Amazon data is available for download.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
num_hierarchies
needs to be incremented by one for the generative recommendation model training and inference steps due to an appended digit for de-duplication, which could be a point of confusion.1 week ago
Inactive