SEAL  by Continual-Intelligence

Framework for self-adapting language models

Created 3 months ago
789 stars

Top 44.5% on SourcePulse

GitHubView on GitHub
Project Summary

SEAL (Self-Adapting LLMs) is a framework for training language models to generate self-edits, such as finetuning data or update directives, in response to new inputs. It targets researchers and practitioners seeking to enable LLMs to continually learn and adapt to new information and tasks without manual intervention, demonstrated in general-knowledge incorporation and few-shot task adaptation.

How It Works

SEAL utilizes Reinforcement Learning (RL) to train language models to produce self-editing actions. This approach allows the model to learn a policy for generating updates based on new data, effectively creating a self-improving loop. The framework is designed to be flexible, supporting adaptation for both factual knowledge integration and few-shot learning scenarios.

Quick Start & Requirements

  • Install dependencies: pip install -r requirements.txt
  • Requires Python 3.12 and an OpenAI API key configured in a .env file.
  • Experiments are designed for 2x A100/H100 GPUs; other configurations may require adjustments.

Highlighted Details

  • Framework for training LLMs to generate self-edits via RL.
  • Explored in general-knowledge incorporation and few-shot adaptation domains.
  • Includes code, data, and documentation for both explored domains.

Maintenance & Community

The project is associated with MIT CSAIL and lists authors Adam Zweiger, Jyothish Pari, Han Guo, Ekin Akyürek, Yoon Kim, and Pulkit Agrawal.

Licensing & Compatibility

The repository does not explicitly state a license.

Limitations & Caveats

The setup and experimental configurations are optimized for specific hardware (2x A100/H100 GPUs) and may require significant refactoring for different setups. An OpenAI API key is a mandatory requirement for operation.

Health Check
Last Commit

1 month ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
2
Star History
33 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Wing Lian Wing Lian(Founder of Axolotl AI), and
10 more.

open_flamingo by mlfoundations

0.1%
4k
Open-source framework for training large multimodal models
Created 2 years ago
Updated 1 year ago
Feedback? Help us improve.