parti  by google-research

Autoregressive text-to-image generation model

Created 3 years ago
1,582 stars

Top 26.4% on SourcePulse

GitHubView on GitHub
Project Summary

Parti is an autoregressive text-to-image generation model designed for high-fidelity photorealistic image synthesis, supporting complex compositions and world knowledge. It targets researchers and engineers exploring generative AI, offering an alternative to diffusion models by leveraging advances in large language models through scaling. Parti enables sophisticated image generation by treating the process as a sequence-to-sequence problem, similar to machine translation.

How It Works

Parti models text-to-image generation as a sequence-to-sequence task, analogous to machine translation, allowing it to benefit from large language model advancements. It encodes images into sequences of discrete tokens using the ViT-VQGAN tokenizer, which are then decoded back into high-quality, visually diverse images. This autoregressive approach, particularly when scaled in model size and data, unlocks significant capabilities in generating complex and nuanced visual content.

Quick Start & Requirements

No installation, setup, or specific requirements are detailed in the provided text.

Highlighted Details

  • Achieves consistent quality improvements by scaling up to 20 billion parameters.
  • Reports state-of-the-art zero-shot FID score of 7.23 and finetuned FID score of 3.22 on the MS-COCO benchmark.
  • Demonstrates effectiveness across diverse categories and challenges, evaluated using the newly released PartiPrompts benchmark, a set of over 1600 English prompts.

Maintenance & Community

Parti is a collaborative effort involving multiple Google Research teams, with numerous authors and acknowledgments indicating substantial internal development. No external community channels or specific maintenance details are provided.

Licensing & Compatibility

The provided text does not specify a license or any compatibility notes for commercial or closed-source use.

Limitations & Caveats

Parti is explicitly stated as "not an officially supported Google product." The significant parameter count (up to 20 billion) implies substantial computational resources are necessary for training and potentially inference.

Health Check
Last Commit

3 years ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
4 stars in the last 30 days

Explore Similar Projects

Starred by Shengjia Zhao Shengjia Zhao(Chief Scientist at Meta Superintelligence Lab), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
14 more.

BIG-bench by google

0.1%
3k
Collaborative benchmark for probing and extrapolating LLM capabilities
Created 4 years ago
Updated 1 year ago
Starred by Aravind Srinivas Aravind Srinivas(Cofounder of Perplexity), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
16 more.

text-to-text-transfer-transformer by google-research

0.1%
6k
Unified text-to-text transformer for NLP research
Created 6 years ago
Updated 5 months ago
Feedback? Help us improve.