Discover and explore top open-source AI tools and projects—updated daily.
Image editing framework with multi-image references
Top 73.0% on SourcePulse
This repository provides Omini Kontext, a framework for multi-image reference-based image editing and generation, built upon the Flux.1-Kontext-dev model. It enables users to perform tasks like spatial character insertion into existing scenes by leveraging 3D RoPE embeddings, offering a novel approach to reference-based image manipulation for researchers and AI artists.
How It Works
Omini Kontext modifies the Flux.1-Kontext-dev model by implementing 3D RoPE embeddings instead of the original 2D embeddings. This technique, inspired by the OminiControl project, allows for reference-based image generation and editing. The framework supports LoRA integration for fine-tuning specific tasks, such as character or product insertion, with adjustable reference_delta
parameters to control placement.
Quick Start & Requirements
pip install -r requirements.txt
. For ComfyUI integration, clone the repo into ComfyUI/custom_nodes
.Highlighted Details
reference_delta
.Maintenance & Community
The project welcomes community contributions. Discussions and support are primarily handled through GitHub Issues and Discussions.
Licensing & Compatibility
Licensed under the Apache License 2.0, allowing for commercial use and integration with closed-source projects.
Limitations & Caveats
The current character insertion examples are described as Proofs of Concept (POCs) with plans for more robust models trained on larger datasets. Some results may not be optimal, and users might need to adjust image resolutions for better scaling. The project aims to extend functionality to multiple references and Qwen-Image-Edit support.
2 weeks ago
Inactive