Discover and explore top open-source AI tools and projects—updated daily.
CY-CHENYUEComfyUI node for Google Gemini API image generation
Top 99.6% on SourcePulse
This ComfyUI custom node integrates Google's Gemini API, enabling users to leverage advanced AI image generation and manipulation directly within the ComfyUI workflow. It targets ComfyUI users seeking to incorporate powerful, multimodal AI capabilities for creative image synthesis, style transfer, and concept art generation without leaving their preferred environment. The primary benefit is seamless access to Gemini's generative power through a familiar node-based interface.
How It Works
The project provides a dedicated "Gemini 2.0 image" node for ComfyUI. This node acts as an interface to the Google Gemini API, allowing users to send text prompts and optional reference images to generate new visuals. It supports various parameters like model selection, aspect ratio, temperature for controlling creativity, and a random seed for reproducibility. A key feature is its ability to accept multiple reference images, which are used collectively by the API to inform style and content, with the node automatically adjusting the prompt to reflect this multi-image input.
Quick Start & Requirements
ComfyUI/custom_nodes directory and install dependencies via requirements.txt using the appropriate Python interpreter (python.exe -m pip install -r requirements.txt).Highlighted Details
temperature parameter (0.0-2.0) to balance determinism and creativity.aspect_ratio selection (free, landscape, portrait, square).Maintenance & Community
The primary contact and developer is reachable via X (Twitter) at @cychenyue, TikTok at @cychenyue, YouTube at @CY-CHENYUE, BiliBili at @CY-CHENYUE, and 小红书 at @CY-CHENYUE.
Licensing & Compatibility
The license is not explicitly stated in the provided README. Users should verify licensing terms before integrating into commercial or closed-source projects.
Limitations & Caveats
API usage may incur costs or be subject to Google's usage limits. Image generation quality and speed are dependent on Google's server status and network connectivity. Sending reference images to the API has privacy implications. The Gemini API may not always strictly adhere to the requested aspect ratio.
1 year ago
Inactive
afiaka87
markfulton
nerdyrodent
QwenLM