Discover and explore top open-source AI tools and projects—updated daily.
Dataset for instruction-based image editing
Top 97.7% on SourcePulse
UltraEdit is a large-scale dataset and framework for instruction-based image editing, targeting researchers and developers in generative AI. It addresses limitations in existing datasets by offering a broader range of editing instructions, utilizing real-world images for greater diversity, and supporting region-based editing, thereby enabling state-of-the-art performance on image editing benchmarks.
How It Works
UltraEdit leverages large language models (LLMs) and human-curated examples to generate diverse editing instructions. It incorporates real photographs and artworks as data anchors, reducing bias compared to purely synthetic datasets. The framework supports region-based editing through automatically generated, high-quality masks, enhancing fine-grained control. This approach aims to produce massive, high-quality image editing samples for training diffusion models.
Quick Start & Requirements
pip install -r requirements
and pip install -e .
within the diffusers
directory.diffusers
library.StableDiffusion3InstructPix2PixPipeline
.Highlighted Details
Maintenance & Community
No specific community channels (Discord/Slack) or roadmap are mentioned in the README. The project is associated with authors from various institutions, indicating academic backing.
Licensing & Compatibility
The README does not explicitly state a license. The code is presented as part of an academic research effort, and usage for commercial purposes would require clarification of licensing terms.
Limitations & Caveats
The project appears to be research-oriented, and the dataset generation process relies heavily on LLMs, which may introduce subtle biases or artifacts. Specific hardware requirements for training large models are not detailed beyond the need for GPUs.
1 year ago
Inactive