TALLRec  by SAI990323

LLM tuning for recommendation systems

created 2 years ago
254 stars

Top 99.1% on SourcePulse

GitHubView on GitHub
Project Summary

TALLRec is a framework for adapting Large Language Models (LLMs) to recommendation tasks, offering an efficient and effective approach for instruction tuning. It targets researchers and practitioners seeking to leverage LLMs for personalized recommendations, demonstrating significant performance improvements over existing methods.

How It Works

TALLRec employs instruction tuning to align LLMs with recommendation tasks, enabling them to understand and generate recommendations based on user preferences and item information. This approach allows for efficient fine-tuning of large models, making them more accessible and practical for recommendation systems.

Quick Start & Requirements

  • Install/Run: Execute shell scripts provided in the shell directory (e.g., ./shell/instruct_7B.sh for training, ./shell/evaluate.sh for evaluation).
  • Prerequisites: LLaMA model weights (Huggingface format), CUDA 12.0 recommended, Python environment.
  • Configuration: Modify shell scripts to specify output_dir, base_model, train_data, val_data, and test_data paths.
  • Resources: Requires GPU for training and inference. Specific resource requirements depend on model size and dataset.
  • References: Paper available at https://arxiv.org/abs/2305.00447.

Highlighted Details

  • Achieves AUC scores significantly higher than baseline models (e.g., 71.98 on movie, 64.38 on book datasets with TALLRec vs. ~54 for baselines).
  • Built upon the Alpaca_lora repository, leveraging its efficient fine-tuning techniques.
  • Supports instruction tuning for LLMs like LLaMA 7B.
  • Evaluates performance across different few-shot settings (16, 64, 256).

Maintenance & Community

The project is based on Alpaca_lora. No specific community channels or roadmap are mentioned in the README.

Licensing & Compatibility

The README does not explicitly state the license. Compatibility for commercial or closed-source use is not specified.

Limitations & Caveats

The project is presented as a research framework, and specific details regarding production readiness, broader model compatibility beyond LLaMA, or comprehensive error handling are not detailed. The README mentions a corrected evaluate.py file, suggesting potential for ongoing development and changes.

Health Check
Last commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
6 stars in the last 30 days

Explore Similar Projects

Feedback? Help us improve.