PromptSRC by muzairkhattak

Vision-language prompt learning research paper

Created 2 years ago

281 stars

Top 92.8% on SourcePulse

Project Summary

This repository provides the official implementation for PromptSRC, a self-regulating framework for adapting foundational vision-language models like CLIP to downstream tasks without sacrificing their generalizability. It targets researchers and practitioners in computer vision and natural language processing seeking to improve prompt learning efficiency and performance.

How It Works

PromptSRC addresses the common issue of prompt learning methods overfitting to downstream tasks, leading to a loss of CLIP's inherent generalization capabilities. It employs a three-pronged self-regularization approach: maximizing mutual agreement between prompted and frozen model features, using Gaussian-weighted self-ensembling of prompts over training, and incorporating textual diversity to balance visual and textual branches. This strategy aims to jointly optimize for task-specific performance and task-agnostic representations.

Quick Start & Requirements

Installation: Follow instructions in INSTALL.md.
Data Preparation: Follow instructions in DATASETS.md.
Evaluation: Refer to EVAL.md for reproducing results with pre-trained models.
Training: Refer to TRAIN.md for training from scratch.
Dependencies: Not explicitly detailed in the README, but likely requires PyTorch and CLIP.

Highlighted Details

Achieves state-of-the-art performance on base-to-novel generalization across 11 image recognition datasets.
Outperforms existing methods like CoOp, CoCoOp, ProDA, and MaPLe on various benchmarks.
Supports MaPLe, CoOp, and Co-CoOp architectures.
Provides pre-trained models and evaluation codes.

Maintenance & Community

The project is associated with ICCV'23 and WECIA'23.
Contact available via email (uzair.khattak@mbzuai.ac.ae, syed.wasim@mbzuai.ac.ae) or GitHub issues.
Code is based on MaPLe, Co-CoOp, and CoOp repositories.

Licensing & Compatibility

The README does not explicitly state a license. However, the project is based on other repositories which may have their own licenses. Users should verify licensing for commercial use.

Limitations & Caveats

Detailed installation and dependency information is deferred to separate markdown files, requiring additional steps to assess system requirements.
The specific license is not mentioned, which could be a concern for commercial adoption.

Health Check

Last Commit

2 years ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

0 stars in the last 30 days

Explore Similar Projects

Awesome-CV-Foundational-Models by awaisrauf

Vision-language survey paper with curated list of foundational CV models

Created 2 years ago

Updated 1 year ago

Starred by

Yiran Wu

Yiran Wu(Coauthor of AutoGen).

PromptAgent by maitrix-org

CLI tool for automated prompt optimization via strategic planning

Created 2 years ago

Updated 5 months ago

Starred by

Edward Sun

Edward Sun(Research Scientist at Meta Superintelligence Lab),

Shizhe Diao

Shizhe Diao(Author of LMFlow; Research Scientist at NVIDIA), and

2 more.

ama_prompting by HazyResearch

Language model prompting strategy research paper

Created 3 years ago

Updated 2 years ago

promptimizer by hinthornw

CLI tool for automated prompt optimization

Created 1 year ago

Updated 8 months ago

rl-prompt by mingkaid

RL-based prompt optimization framework, per research paper

Created 3 years ago

Updated 1 year ago

multimodal-prompt-learning by muzairkhattak

Research paper on multimodal prompt learning for vision-language models

Created 3 years ago

Updated 2 years ago

ChatGPT-PromptGenius by wenhaomin

Prompt website with hierarchical classification system

Created 2 years ago

Updated 1 year ago

personal_chatgpt by chunhuizhang

Personal ChatGPT training pipeline

Created 2 years ago

Updated 1 year ago

Starred by

Eric Zhu

Eric Zhu(Coauthor of AutoGen; Research Scientist at Microsoft Research) and

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems").

PromptWizard by microsoft

Agent-driven framework for task-aware prompt optimization

Created 1 year ago

Updated 3 months ago

Starred by

Ishaan Jaffer

Ishaan Jaffer(Cofounder of LiteLLM),

Yaowei Zheng

Yaowei Zheng(Author of LLaMA-Factory), and

10 more.

LMOps by microsoft

AI research initiative for building AI products with foundation models

Created 3 years ago

Updated 2 weeks ago

Starred by

Clement Delangue

Clement Delangue(Cofounder of Hugging Face),

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and

10 more.

promptsource by bigscience-workshop

Toolkit for creating, sharing, and using natural language prompts

Created 4 years ago

Updated 2 years ago

Starred by

Yiran Wu

Yiran Wu(Coauthor of AutoGen),

Junyang Lin

Junyang Lin(Core Maintainer at Alibaba Qwen), and

3 more.

PromptPapers by thunlp

Paper list for prompt-based tuning of pre-trained language models

Created 4 years ago

Updated 2 years ago

Feedback? Help us improve.