ExpertLLaMA  by OFA-Sys

ChatBot for expert-like responses via expert identity augmentation

created 2 years ago
300 stars

Top 89.7% on sourcepulse

GitHubView on GitHub
Project Summary

ExpertLLaMA is an open-source chatbot designed to produce high-quality, expert-like responses by augmenting instructions with specialized "Expert Identity" descriptions. It targets researchers and developers seeking to enhance instruction-following capabilities in LLMs, offering a cost-effective alternative to models trained on more expensive data.

How It Works

The core innovation is "ExpertPrompting," which involves generating customized expert identity descriptions for specific instructions. These descriptions are then prepended to the original instructions, creating augmented training data. This approach aims to elicit better performance from LLMs by guiding them to adopt specific expert personas, leading to more elaborate and accurate responses compared to standard instruction tuning.

Quick Start & Requirements

  • Installation: Apply delta weights to official LLaMA weights using python3 apply_delta.py.
  • Prerequisites: Official LLaMA model weights (7B), Python 3.x.
  • Inference: Requires approximately 15GB memory (fp16).
  • Demo: Live demo available at Hugging Face Space.

Highlighted Details

  • Achieves 96% of ChatGPT's capability based on GPT-4 evaluation.
  • Outperforms models like Vicuna and LLaMA-GPT4.
  • Trained on 52k instruction-following expert data generated by GPT-3.5-turbo.
  • Delta weights are released to comply with LLaMA license.

Maintenance & Community

  • Initial release in May 2023.
  • Paper available: "ExpertPrompting: Instructing Large Language Models to be Distinguished Experts."

Licensing & Compatibility

  • Data is licensed under CC BY-NC 4.0 (non-commercial use only).
  • Models trained using the dataset are restricted to research purposes.

Limitations & Caveats

The dataset and models are strictly for research use and cannot be employed for commercial applications due to the CC BY-NC 4.0 license.

Health Check
Last commit

2 years ago

Responsiveness

1+ week

Pull Requests (30d)
0
Issues (30d)
0
Star History
0 stars in the last 90 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Ying Sheng Ying Sheng(Author of SGLang), and
9 more.

alpaca-lora by tloen

0.0%
19k
LoRA fine-tuning for LLaMA
created 2 years ago
updated 1 year ago
Starred by Andrej Karpathy Andrej Karpathy(Founder of Eureka Labs; Formerly at Tesla, OpenAI; Author of CS 231n), John Yang John Yang(Author of SWE-bench, SWE-agent), and
13 more.

stanford_alpaca by tatsu-lab

0.1%
30k
Instruction-following LLaMA model training and data generation
created 2 years ago
updated 1 year ago
Feedback? Help us improve.