OneRec-Think  by wangshy31

Unified framework for LLM-powered dialogue, reasoning, and personalized recommendation

Created 6 months ago
253 stars

Top 99.4% on SourcePulse

GitHubView on GitHub
Project Summary

This framework addresses the limitations of conventional recommendation systems by integrating large language models (LLMs) with explicit reasoning and conversational capabilities. It targets researchers and practitioners seeking to enhance recommendation precision and user engagement through LLM-driven dialogue and reasoning. The primary benefit is improved recommendation accuracy while maintaining native conversational abilities.

How It Works

OneRec-Think employs a three-component architecture. Itemic Alignment projects item tokens into the LLM's textual space for semantic grounding. Reasoning Activation stimulates reasoning capabilities using Chain-of-Thought (CoT) fine-tuning examples tailored for recommendation contexts. Reasoning Enhancement refines the process with a recommendation-specific reward function that accommodates the multi-validity of user preferences.

Quick Start & Requirements

Primary installation involves running environment setup scripts (setup_conda_env.sh). Key requirements include downloading the Qwen3-1.7B base model from Hugging Face and extending its vocabulary. The process entails generating alignment training data, performing LoRA-based alignment fine-tuning, merging model checkpoints, preparing recommendation training corpora, and executing combined recommendation and CoT training pipelines. Evaluation scripts (eval_parallel_8gpu.sh, eval_parallel_8gpu_cot.sh) suggest multi-GPU support is beneficial. Official quick-start, docs, or demo links are not provided.

Highlighted Details

  • Achieved a 0.159% online gain in APP Stay Time on an industrial-scale short-video platform.
  • Extensive case studies provide qualitative evidence for the role of reasoning in recommendation.

Maintenance & Community

No specific details on contributors, sponsorships, community channels (Discord/Slack), or roadmaps are present in the README.

Licensing & Compatibility

The license type and compatibility notes for commercial use are not specified in the README.

Limitations & Caveats

The setup process is multi-stage and requires careful execution of several scripts and model downloads, indicating a complex integration. No explicit limitations or alpha/beta status are mentioned.

Health Check
Last Commit

5 months ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
1
Star History
14 stars in the last 30 days

Explore Similar Projects

Feedback? Help us improve.