RoleLLM-public  by InteractiveNLP-Team

Framework for benchmarking and enhancing LLM role-playing

created 1 year ago
503 stars

Top 61.7% on SourcePulse

GitHubView on GitHub
Project Summary

RoleLLM is a framework designed to benchmark, elicit, and enhance the role-playing capabilities of Large Language Models (LLMs). It addresses the limitations of closed-source models and general-purpose training for character imitation, offering a systematic approach for researchers and developers focused on conversational AI and character-driven applications.

How It Works

The RoleLLM framework consists of four stages: role profile construction, context-based instruction generation (Context-Instruct) for knowledge extraction, role prompting using GPT (RoleGPT) for speaking style imitation, and role-conditioned instruction tuning (RoCIT) for fine-tuning open-source models. This multi-stage approach allows for both high-quality role-playing with proprietary models (via RoleGPT) and the enhancement of open-source alternatives (via RoCIT on RoleBench data).

Quick Start & Requirements

  • The RoleBench dataset, comprising 168,093 samples across 100 roles, is available.
  • The framework supports both English and Chinese languages.
  • Integration with OpenCompass for evaluation is provided.
  • Specific commands for installation and usage are not detailed in the README.

Highlighted Details

  • Introduces RoleBench, the first systematic, fine-grained character-level benchmark for role-playing.
  • Developed RoleLLaMA (English) and RoleGLM (Chinese) models through RoCIT, achieving enhanced role-playing abilities.
  • Context-Instruct and RoleGPT are key components for data generation and style imitation.
  • The framework supports a wide array of 100 diverse roles, including historical figures, fictional characters, and public personalities.

Maintenance & Community

  • The project is associated with the InteractiveNLP-Team.
  • The paper detailing RoleLLM was released in October 2023.
  • RoleBench was integrated into OpenCompass in December 2023.

Licensing & Compatibility

  • The README does not explicitly state the license type.
  • No specific compatibility notes for commercial or closed-source use are provided.

Limitations & Caveats

The README does not provide specific installation instructions or details on the technical requirements for running the framework or fine-tuning models. The licensing status is also not clearly defined, which may impact commercial adoption.

Health Check
Last commit

10 months ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
1 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Didier Lopes Didier Lopes(Founder of OpenBB), and
1 more.

leon by leon-ai

0.2%
17k
Open-source personal assistant to self-host
created 6 years ago
updated 1 week ago
Feedback? Help us improve.