f-lite  by fal-ai

Diffusion model for image generation, trained on copyright-safe content

Created 6 months ago
415 stars

Top 70.5% on SourcePulse

GitHubView on GitHub
Project Summary

F Lite is a 10 billion parameter diffusion model designed for high-quality, copyright-safe image generation. It is trained exclusively on SFW content from Freepik's dataset, making it suitable for users prioritizing legal compliance and ethical AI development.

How It Works

F Lite is built upon a Diffusion Transformer (DiT) architecture, leveraging a large-scale dataset of 80 million curated images. This approach allows for detailed image synthesis with a focus on rich textures and enhanced visual fidelity, distinguishing it from models trained on broader, less controlled datasets.

Quick Start & Requirements

  • Installation: pip install -r requirements.txt
  • Prerequisites: Requires a GPU with at least 24GB of VRAM.
  • Usage: Command-line generation via python -m f_lite.generate or integration with the diffusers library.
  • ComfyUI: Supports ComfyUI workflows, including an advanced setup with prompt expansion via ComfyUI-KJNodes and ComfyUI-Custom-Scripts.
  • Resources: Model weights are available on Hugging Face.

Highlighted Details

  • 10B parameter diffusion model.
  • Trained exclusively on 80 million copyright-safe, SFW images.
  • Offers a standard and a texture-specialized model version.
  • Supports prompt expansion for improved results.

Maintenance & Community

  • Developed by Freepik and Fal.
  • Technical report available for detailed architecture and training insights.
  • Project updates are announced, with demos provided on Hugging Face and Fal.ai.

Licensing & Compatibility

  • F Lite weights are licensed under the CreativeML Open RAIL-M license.
  • T5 XXL and Flux Schnell VAE components are licensed under Apache 2.0.
  • Permissive licensing allows for commercial use and integration into closed-source projects.

Limitations & Caveats

The texture model may require more detailed prompts and can be less effective for vector-style imagery. The model requires significant VRAM (24GB+), though quantization has not been explored to reduce this footprint.

Health Check
Last Commit

2 months ago

Responsiveness

1 week

Pull Requests (30d)
0
Issues (30d)
1
Star History
4 stars in the last 30 days

Explore Similar Projects

Starred by Vincent Weisser Vincent Weisser(Cofounder of Prime Intellect), Patrick von Platen Patrick von Platen(Author of Hugging Face Diffusers; Research Engineer at Mistral), and
2 more.

IP-Adapter by tencent-ailab

0.2%
6k
Adapter for image prompt in text-to-image diffusion models
Created 2 years ago
Updated 1 year ago
Feedback? Help us improve.