rome by kmeng01

Model editing research paper for GPT-2 and GPT-J

Created 3 years ago

717 stars

Top 48.0% on SourcePulse

4 Experts Love This Project

hiyouga

Author of LLaMA-Factory

shizhediao

Author of LMFlow; Research Scientist at NVIDIA

evhub

Head of Alignment Stress-Testing at Anthropic

stellaathena

Stella Rose Biderman

Executive Director at EleutherAI

Project Summary

This repository provides an implementation of Rank-One Model Editing (ROME) for efficiently locating and editing factual associations within large auto-regressive transformer models. It is targeted at researchers and practitioners in NLP and AI safety who need to modify factual knowledge in pre-trained language models without full retraining.

How It Works

ROME operates by identifying and modifying a low-rank update to the model's weight matrices that specifically targets factual associations. This approach leverages causal tracing to pinpoint the relevant components within the transformer's layers and then applies a rank-one update, making the editing process computationally efficient and precise.

Quick Start & Requirements

Install via bash ./scripts/setup_conda.sh.
Requires Conda for dependency management, PyTorch, and CUDA.
Supports OpenAI's GPT-2 XL (1.5B) and EleutherAI's GPT-J (6B).

Highlighted Details

Implements Rank-One Model Editing (ROME) for factual association modification.
Includes notebooks for demonstrating Causal Tracing and ROME.
Provides scripts for running the full evaluation suite against baselines.
Offers a framework for integrating and benchmarking new editing methods.

Maintenance & Community

Actively developed with close monitoring of issues.
Paper published at NeurIPS 2022.

Licensing & Compatibility

License details are not explicitly stated in the README, but the project is open-source.
Currently supports methods editing autoregressive HuggingFace models using the PyTorch backend.

Limitations & Caveats

Primarily GPU-only.
Cross-platform compatibility is limited to PyTorch/HuggingFace models, with broader support planned.

Health Check

Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

12 stars in the last 30 days

Explore Similar Projects

Starred by

Ishaan Jaffer

Ishaan Jaffer(Cofounder of LiteLLM).

GPT-Fathom by GPT-Fathom

LLM evaluation suite for open/closed-source models, reproducible research

Created 2 years ago

Updated 1 year ago

Starred by

Shizhe Diao

Shizhe Diao(Author of LMFlow; Research Scientist at NVIDIA).

marc by ekinakyurek

Research paper implementation for abstract reasoning via test-time training

Created 1 year ago

Updated 2 months ago

Starred by

Yaowei Zheng

Yaowei Zheng(Author of LLaMA-Factory) and

Jeff Hammerbacher

Jeff Hammerbacher(Cofounder of Cloudera).

mend by eric-mitchell

Fast model editing for LLMs

Created 4 years ago

Updated 2 years ago

AlphaEdit by jianghoucheng

Knowledge editing via null-space projection

Created 1 year ago

Updated 2 months ago

fmeval by aws

Evaluate foundation models for various NLP tasks

Created 2 years ago

Updated 5 months ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"),

Pawel Garbacki

Pawel Garbacki(Cofounder of Fireworks AI), and

4 more.

alpaca_farm by tatsu-lab

RLHF simulation framework for accessible instruction-following/alignment research

Created 2 years ago

Updated 1 year ago

Starred by

Jeff Hammerbacher

Jeff Hammerbacher(Cofounder of Cloudera),

Stella Rose Biderman

Stella Rose Biderman(Executive Director at EleutherAI), and

2 more.

nnsight by ndif-team

SDK for interpreting/manipulating deep model internals

Created 2 years ago

Updated 2 days ago

mwp_ReFT by lqtrung1998

Research paper code for reasoning with reinforced fine-tuning (ReFT)

Created 2 years ago

Updated 1 year ago

Starred by

Shyamal Anadkat

Shyamal Anadkat(Research Scientist at OpenAI) and

Yaowei Zheng

Yaowei Zheng(Author of LLaMA-Factory).

KnowledgeEditingPapers by zjunlp

Curated list of must-read research papers on knowledge editing for LLMs

Created 3 years ago

Updated 6 months ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"),

Shizhe Diao

Shizhe Diao(Author of LMFlow; Research Scientist at NVIDIA), and

1 more.

FastEdit by hiyouga

Tool for fast edits to large language models

Created 2 years ago

Updated 2 years ago

Starred by

Jeremy Howard

Jeremy Howard(Cofounder of fast.ai),

Jeff Hammerbacher

Jeff Hammerbacher(Cofounder of Cloudera), and

4 more.

chain-of-thought-hub by FranxYao

LLM benchmark for complex reasoning via chain-of-thought prompting

Created 2 years ago

Updated 1 year ago

Starred by

Eric Ciarla

Eric Ciarla(Cofounder of Firecrawl) and

Maxime Labonne

Maxime Labonne(Head of Post-Training at Liquid AI).

EasyEdit by zjunlp

Framework for LLM knowledge editing (ACL 2024 paper)

Created 2 years ago

Updated 3 weeks ago

Feedback? Help us improve.