FireRed-Image-Edit  by FireRedTeam

State-of-the-art image editing model

Created 2 weeks ago

New!

476 stars

Top 64.3% on SourcePulse

GitHubView on GitHub
Project Summary

<2-3 sentences summarising what the project addresses and solves, the target audience, and the benefit.> FireRed-Image-Edit is a general-purpose image editing model designed for high-fidelity and consistent edits across various scenarios. It targets researchers and developers seeking advanced open-source image manipulation capabilities, offering leading performance in instruction following, image quality, and text style preservation.

How It Works

The model is built upon an open-source text-to-image foundation (currently Qwen-Image) and employs a novel training paradigm involving Pretraining, Supervised Fine-Tuning (SFT), and Reinforcement Learning (RL). This approach allows for native editing capabilities, ensuring accurate instruction following and visual coherence, and is designed to be backbone-agnostic for potential application to other text-to-image models.

Quick Start & Requirements

Highlighted Details

  • Achieves state-of-the-art results among open-source models on ImgEdit, Gedit, and RedEdit benchmarks.
  • Demonstrates leading performance in prompt following and visual consistency, comparable to closed-source solutions.
  • Excels at text style preservation with high fidelity and offers high-quality photo restoration capabilities.
  • Supports multi-image editing, such as virtual try-on scenarios.

Maintenance & Community

The project has released model weights and a technical report. Future releases are planned for a distilled version and a text-to-image foundation model. No explicit community channels (e.g., Discord, Slack) or active contributor details are provided in the README.

Licensing & Compatibility

  • License type: Apache 2.0.
  • Compatibility notes: Apache 2.0 is generally permissive for commercial use and closed-source linking, allowing broad adoption.

Limitations & Caveats

The project is actively under development, with several features listed as "To be released" in the TODO section, including a distilled model, the REDEdit-Bench dataset, and the core FireRed T2I foundation model. The Ethics Statement highlights that the model has not been comprehensively evaluated for all downstream applications and warns against prohibited uses (illegal, defamatory, pornographic, harmful content).

Health Check
Last Commit

1 week ago

Responsiveness

Inactive

Pull Requests (30d)
6
Issues (30d)
9
Star History
498 stars in the last 14 days

Explore Similar Projects

Feedback? Help us improve.