TextRL by voidful

RLHF for text generation models (GPT, BLOOM, T5)

Created 4 years ago

565 stars

Top 56.9% on SourcePulse

View on GitHub

2 Experts Love This Project

Alexander Borzunov

Research Scientist at OpenAI

Omar Sanseviero

DevRel at Google DeepMind

Project Summary

This library implements Reinforcement Learning from Human Feedback (RLHF) for text generation models available on Hugging Face Transformers. It allows users to fine-tune models like GPT-2, FLAN-T5, and BLOOM using RL techniques, enabling improved control and quality in generated text. The target audience includes researchers and developers looking to enhance text generation capabilities beyond standard supervised fine-tuning.

How It Works

TextRL integrates Hugging Face Transformers, PFRL, and OpenAI GYM to facilitate RL fine-tuning. It frames text generation as a sequential decision-making problem, where the model learns a policy to generate text token by token. A custom reward function, defined by the user, guides the learning process, allowing for tailored optimization based on specific criteria like sentiment, coherence, or adherence to a style.

Quick Start & Requirements

Install via pip: pip install pfrl@git+https://github.com/voidful/pfrl.git and pip install textrl.
Requires Python and PyTorch.
GPU acceleration is highly recommended for training and inference.
Official examples and Colab notebooks are available for various models.

Highlighted Details

Supports a wide range of Hugging Face Transformer models, including BLOOM (up to 176B via Petals).
Allows for customizable reward functions to guide generation quality.
Provides implementations for PPO and other RL algorithms via PFRL.
Includes examples for controllable generation, such as influencing sentiment or style.

Maintenance & Community

The repository is maintained by voidful. Links to Colab examples are provided for quick experimentation.

Licensing & Compatibility

The library appears to be under a permissive license, but specific details are not explicitly stated in the README. Compatibility with commercial use depends on the licenses of the underlying Hugging Face models and PFRL.

Limitations & Caveats

The README mentions strong recommendations for contributing to public swarms (like Petals) for larger models, indicating potential resource constraints or distributed computing requirements for very large models. The effectiveness of RLHF heavily relies on the design of the reward function, which can be complex to implement correctly.

Health Check

Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

1 stars in the last 30 days