PosterCraft by Ephemeral182

Unified framework for aesthetic poster generation

Created 9 months ago

530 stars

Top 59.9% on SourcePulse

Project Summary

PosterCraft is a unified framework for generating high-quality aesthetic posters, targeting users who need precise text rendering, seamless integration of artistic elements, and striking visual layouts. It offers a comprehensive solution for creating visually appealing posters with stylistic harmony.

How It Works

PosterCraft employs a four-stage training workflow to achieve its poster generation capabilities. It begins with Text Rendering Optimization for accurate text placement on backgrounds, followed by High-quality Poster Fine-tuning using Region-aware Calibration for style and text-background harmony. Aesthetic-Text RL is then applied for higher-order aesthetic trade-offs and defect mitigation, culminating in Vision-Language Feedback for iterative refinement and multi-modal corrections. This layered approach ensures both fidelity and aesthetic appeal.

Quick Start & Requirements

Installation: Clone the repository, create a conda environment (conda create -n postercraft python=3.11), activate it (conda activate postercraft), and install dependencies (pip install -r requirements.txt).
Prerequisites: Python 3.11, CUDA (implied for GPU inference).
Inference: Run python inference.py or python inference_offload.py for memory-limited GPUs. Requires specifying prompt, pipeline path (e.g., "black-forest-labs/FLUX.1-dev"), and custom transformer path (e.g., "PosterCraft/PosterCraft-v1_RL").
Demo: A Gradio web UI is available via python demo_gradio.py.
Resources: Model weights and datasets are available on HuggingFace.

Highlighted Details

Achieves state-of-the-art text rendering accuracy, outperforming several open and closed-source models in quantitative benchmarks (Text Recall, F-score, Accuracy).
Utilizes specialized datasets: Text-Render-2M (2M text rendering examples), HQ-Poster-100K (100K curated posters), Poster-Preference-100K (100K preference pairs for RL), and Poster-Reflect-120K (120K vision-language feedback pairs).
Offers two fine-tuned model weights: PosterCraft-v1_RL (Stage 3) and PosterCraft-v1_Reflect (Stage 4).
Integrates with ComfyUI via community contributions.

Maintenance & Community

The project is associated with The Hong Kong University of Science and Technology (Guangzhou) and Meituan. Updates include community integrations and Chinese article releases. Contact information for authors is provided.

Licensing & Compatibility

The repository does not explicitly state a license in the README. Model weights are available on HuggingFace, implying their usage is governed by HuggingFace's terms.

PosterCraft by Ephemeral182

Explore Similar Projects

X-Omni by X-Omni-Team

WithAnyone by Doby-Xu

StyleKeeper by naver-ai

qiaomu-mondo-poster-design by joeseesun

PosterCraft by MeiGen-AI

GLM-Image by zai-org

TediGAN by IIGROUP

rich-text-to-image by songweige

guizang-s-prompt by op7418

InstantStyle by instantX-research

Qwen-Image by QwenLM

awesome-gpt4o-images by jamez-bondos