fashion-clip by patrickjohncyh

Fashion-domain CLIP model fine-tuned for industry applications

Created 3 years ago

471 stars

Top 64.6% on SourcePulse

1 Expert Loves This Project

luiscape

Cofounder of Lightning AI

Project Summary

FashionCLIP is a specialized CLIP-like model fine-tuned for the fashion domain, offering enhanced zero-shot performance on fashion-related tasks like retrieval and classification. It targets researchers and practitioners in the fashion industry seeking more accurate and domain-specific multimodal understanding.

How It Works

FashionCLIP builds upon the CLIP architecture, fine-tuning it on a large dataset of over 700K fashion image-text pairs from the Farfetch dataset. This fine-tuning process adapts the model to better capture domain-specific fashion concepts, leading to improved generalization and performance in zero-shot scenarios compared to general-purpose CLIP models. The latest version, FashionCLIP 2.0, leverages the laion/CLIP-ViT-B-32-laion2B-s34B-b79K checkpoint, further boosting performance due to its larger training data.

Quick Start & Requirements

Install via pip: $ pip install fashion-clip
For local development: $ pip install -e . from the project root.
Requires Python and standard ML libraries. Specific version requirements are not detailed but implied by Hugging Face Transformers usage.
Colab notebook available for more functionalities.
Hugging Face model available at patrickjohncyh/fashion-clip.

Highlighted Details

FashionCLIP 2.0 achieves a weighted macro F1 score of 0.83 on FMNIST, 0.71 on KAGL, and 0.58 on DEEP datasets, outperforming OpenAI CLIP and Laion CLIP.
The project is associated with a publication in Nature Scientific Reports.
Provides an API for feature extraction, classification, and retrieval, with FCLIPDataset and FashionCLIP classes for data handling and model interaction.
Supports local and S3 image sources and private Hugging Face repositories.

Maintenance & Community

The project is associated with Patrick John Chia and Giuseppe Attanasio.
Model weights are available on Hugging Face.
A related project, RustEmbed, integrates FashionCLIP via gRPC.

Licensing & Compatibility

The README does not explicitly state a license. The code is released as open-source. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

The official Farfetch dataset, used for training, is pending release, meaning pre-processed data and model weights might not be fully available or reproducible without it.

Health Check

Last Commit

10 months ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

14 stars in the last 30 days

Explore Similar Projects

DIVA by baaivision

Post-training method for improving CLIP models

Created 1 year ago

Updated 10 months ago

Multimodal-RAG-Survey by llm-lab-org

Survey of multimodal retrieval-augmented generation

Created 9 months ago

Updated 3 weeks ago

FG-CLIP by 360CVGroup

Fine-grained text-image alignment model for enhanced discrimination

Created 7 months ago

Updated 1 month ago

Starred by

Jesse Clark

Jesse Clark(Cofounder of Marqo),

Elie Bursztein

Elie Bursztein(Cybersecurity Lead at Google DeepMind), and

4 more.

coyo-dataset by kakaobrain

Image-text pair dataset for vision-language model training

Created 3 years ago

Updated 3 years ago

multimodal-garment-designer by aimagelab

AI model for fashion image editing via multimodal prompts

Created 2 years ago

Updated 1 year ago

Starred by

Junyang Lin

Junyang Lin(Core Maintainer at Alibaba Qwen).

Awesome-CLIP by yzhuoning

CLIP resources list

Created 4 years ago

Updated 1 year ago

ladi-vton by miccunifi

Virtual try-on research paper using latent diffusion, textual inversion

Created 2 years ago

Updated 2 years ago

HairCLIP by wtybest

PyTorch code for hair design via text/image

Created 4 years ago

Updated 1 year ago

Starred by

Jesse Clark

Jesse Clark(Cofounder of Marqo),

Patrick von Platen

Patrick von Platen(Author of Hugging Face Diffusers; Research Engineer at Mistral), and

1 more.

CLIP_benchmark by LAION-AI

CLI tool for CLIP-like model evaluation across diverse tasks/datasets

Created 3 years ago

Updated 3 weeks ago

Starred by

Eric Zhang

Eric Zhang(Founding Engineer at Modal),

Tim J. Baek

Tim J. Baek(Founder of Open WebUI), and

2 more.

ml-mobileclip by apple

Image-text models research paper, CVPR 2024

Created 1 year ago

Updated 1 month ago

OpenAI-CLIP by moein-shariatnia

PyTorch CLIP implementation for text-image retrieval

Created 4 years ago

Updated 1 month ago

Starred by

Aravind Srinivas

Aravind Srinivas(Cofounder of Perplexity),

Travis Fischer

Travis Fischer(Founder of Agentic), and

31 more.

CLIP by openai

Image-text matching model for zero-shot prediction

Created 5 years ago

Updated 1 year ago

Feedback? Help us improve.