Survey on text-to-image generation/synthesis
Top 19.8% on sourcepulse
This repository is a comprehensive survey and curated collection of resources for text-to-image synthesis and manipulation tasks. It serves as a valuable knowledge base for researchers, engineers, and practitioners in the field of generative AI, providing access to papers, code, datasets, and project pages.
How It Works
The repository organizes a vast landscape of text-to-image research, categorizing papers by year, specific sub-tasks (e.g., Text to Face, Prompt Engineering), and broader application areas (e.g., Text+Image/Video → Image/Video, Text → 3D/Motion/Shape/Mesh/Object). It leverages a structured markdown format to present each entry with links to papers, code, and project pages, facilitating easy navigation and access to relevant information.
Quick Start & Requirements
This is a curated list of research papers and code, not a runnable software package. No installation or execution is required.
Highlighted Details
Maintenance & Community
The repository is actively maintained, with recent updates including Version 2.0 and the addition of new survey papers. It is part of the "Awesome" list on GitHub, indicating community recognition.
Licensing & Compatibility
The repository itself is licensed under an unspecified license, but it links to various open-source projects, each with its own license. Users should verify the licenses of individual linked projects for compatibility and usage restrictions.
Limitations & Caveats
As a curated list, the repository does not provide direct functionality. The quality and availability of linked code and datasets depend on the original authors and projects. Some older links or projects may be outdated or unmaintained.
3 weeks ago
Inactive