Comfyui-QwenEditUtils  by lrzjason

Advanced Qwen image editing utilities for ComfyUI

Created 1 month ago
331 stars

Top 82.6% on SourcePulse

GitHubView on GitHub
Project Summary

Comfyui-QwenEditUtils provides specialized ComfyUI nodes designed to enhance Qwen-based image editing workflows. It enables users to leverage multiple reference images for more controlled and nuanced image generation, particularly for tasks like outfit transfer. The primary benefit is increased flexibility and precision in conditioning image generation models using visual and textual prompts.

How It Works

The utility comprises two main nodes: TextEncodeQwenImageEditPlus and TextEncodeQwenImageEditPlusAdvance. These nodes encode text prompts using CLIP, integrating up to five reference images. The core approach involves separate processing and resizing of reference images for VAE encoding (typically 1024x1024) and VL (Vision-Language) encoding (typically 384x384), allowing for optimized latent space representation and detailed visual conditioning. The advanced node offers finer control over these resizing behaviors, target dimensions, and image processing methods like upscaling and cropping.

Quick Start & Requirements

Installation involves cloning the repository into the ComfyUI custom_nodes directory. After restarting ComfyUI, the nodes will be available under the "advanced/conditioning" category. No specific hardware prerequisites beyond a standard ComfyUI setup are mentioned, though Qwen models and ComfyUI itself imply a capable GPU is necessary for practical use.

Highlighted Details

  • Supports multi-image conditioning (up to 5 for the basic node, 3 for the advanced).
  • Offers distinct resizing strategies for VAE and VL encoding pipelines.
  • Provides individual image outputs for processed reference images, enabling flexible workflow connections.
  • Integrates with Qwen models and supports custom Llama templates for detailed instructions.

Maintenance & Community

The project is maintained by 小志Jason (xiaozhijason), with contact provided via Twitter (@Lrzjason), Email (lrzjason@gmail.com), QQ Group (866612947), Wechatid (fkdeai), and Civitai. Sponsorship is encouraged via "Buy me a coffee."

Licensing & Compatibility

The provided README does not specify a software license. Users should exercise caution regarding usage rights, particularly for commercial applications, until a license is clarified. Compatibility is tied to ComfyUI and Qwen model ecosystems.

Limitations & Caveats

The absence of a declared license is a significant adoption blocker. While the nodes are designed for Qwen models, their compatibility with other architectures is not specified. The README focuses on functionality, with limited details on performance benchmarks or potential edge-case bugs.

Health Check
Last Commit

1 week ago

Responsiveness

Inactive

Pull Requests (30d)
1
Issues (30d)
10
Star History
314 stars in the last 30 days

Explore Similar Projects

Feedback? Help us improve.