Docker image for prompt generation
Top 33.9% on sourcepulse
This project provides a Dockerized tool for generating prompts for AI image generation models like MidJourney and Stable Diffusion. It targets users who want to leverage AI to create better prompts, supporting both direct Chinese input and image-to-prompt translation, ultimately aiming to simplify and enhance the AI art creation workflow.
How It Works
The tool utilizes a pipeline of pre-trained models: a text-to-image prompt generator (succinctly/text2image-prompt-generator), a Chinese-to-English translator (Helsinki-NLP/opus-mt-zh-en), and a CLIP model (laion/CLIP-ViT-H-14-laion2B-s32B-b79K). Users can input prompts in Chinese, which are then translated and processed by the prompt generator to produce optimized prompts for image models. It also supports parsing prompts from existing images.
Quick Start & Requirements
Dockerfile.base
, Dockerfile.cpu
, and Dockerfile.gpu
.docker run
commands for CPU or GPU variants, mapping port 7860.Highlighted Details
Maintenance & Community
The project is maintained by soulteary. No specific community channels or roadmap are detailed in the README.
Licensing & Compatibility
The README does not explicitly state a license. The project uses models from Hugging Face, which have their own licenses (e.g., MIT for Helsinki-NLP, potentially others for the prompt and CLIP models). Compatibility for commercial use would depend on the licenses of the underlying models.
Limitations & Caveats
The project relies on external models whose performance and availability are not guaranteed. The README does not detail specific performance benchmarks or known limitations of the prompt generation itself.
2 years ago
1 week