Discover and explore top open-source AI tools and projects—updated daily.
Hunyuan-PromptEnhancerAI prompt enhancer for text-to-image generation
Top 21.8% on SourcePulse
Hunyuan-PromptEnhancer refines text prompts for text-to-image models, transforming user inputs into clearer, structured, and logically consistent versions. It targets users of image generation AI, researchers, and developers seeking to improve prompt quality and control. The tool enhances prompt clarity and adherence to desired image attributes, potentially leading to more accurate and aesthetically pleasing image outputs.
How It Works
The core approach involves rewriting prompts using a chain-of-thought methodology, structuring them with a "global–details–summary" narrative. This process preserves the original intent across key elements like subject, action, style, and layout. The system employs robust output parsing, prioritizing structured tags like <answer>...</answer>, falling back to cleaner text extraction, or using the original input if parsing fails. Inference parameters such as temperature, top_p, and max_new_tokens are configurable to balance deterministic output with creative diversity.
Quick Start & Requirements
Installation can be done via pip install -r requirements.txt or by running script/install_gguf.sh for GGUF model support. Prerequisites include Python 3.8+, CUDA 11.8+ (recommended for GPU acceleration), at least 20GB of free storage for models, and 8GB+ RAM (16GB+ recommended for 32B models). Users can download the PromptEnhancer-7B model (13GB) via huggingface-cli download tencent/HunyuanImage-2.1/reprompt --local-dir ./models/promptenhancer-7b.
Highlighted Details
Maintenance & Community
The project appears actively developed with recent updates noted in September 2025. Contact is available via email at hunyuan_opensource@tencent.com. The project acknowledges contributions from the Transformers and HuggingFace communities.
Licensing & Compatibility
The provided README does not explicitly state the project's license. Users should verify licensing terms before integration, especially for commercial or closed-source applications.
Limitations & Caveats
No specific limitations, known bugs, or alpha status are mentioned in the provided documentation. Users should be aware of the significant hardware requirements for larger models, particularly the 32B variants and their GGUF counterparts.
2 weeks ago
Inactive
microsoft
bigscience-workshop