Awesome-Prompt-Adapter-Learning-for-VLMs by zhengli97

Curated list of prompt/adapter learning methods for VLMs

Created 1 year ago

739 stars

Top 47.0% on SourcePulse

Project Summary

This repository is a curated list of prompt and adapter learning methods for Vision-Language Models (VLMs) like CLIP. It serves researchers and practitioners by cataloging papers, providing comparative benchmarks, and highlighting open-source implementations, aiming to facilitate efficient adaptation of VLMs for various downstream tasks.

How It Works

The list categorizes methods into prompt learning, test-time prompt learning, and adapter learning, with specific sections for video understanding and continual learning. It includes a comparative table of prompt learning methods, showcasing performance metrics across multiple datasets and indicating the availability of code. The repository emphasizes papers with open-source code, making it a practical resource for empirical evaluation and implementation.

Quick Start & Requirements

This is a curated list, not a runnable software package. To use the methods, users must refer to the individual papers and their associated code repositories, typically requiring Python environments with deep learning frameworks (PyTorch/TensorFlow) and specific VLM implementations. Links to papers and code are provided for each entry.

Highlighted Details

Comprehensive categorization of prompt/adapter learning techniques for VLMs.
Comparative benchmarks (e.g., Table 1, Table 2) evaluating method performance on various datasets.
Strong emphasis on open-source code availability for practical implementation.
Inclusion of recent advancements and emerging trends in VLM adaptation.

Maintenance & Community

The list is actively maintained by zhengli97, with an invitation for community contributions via email or GitHub issues. The focus is on including papers from top-tier conferences and journals.

Licensing & Compatibility

The repository itself is a list and does not have a specific license. Individual papers and their associated codebases will have their own licenses, which users must adhere to. Compatibility for commercial use depends entirely on the licenses of the referenced code.

Limitations & Caveats

The repository is a collection of links and information; it does not provide a unified API or framework. Users must individually retrieve, set up, and integrate each method. Some listed papers may not have publicly available code or may be in early stages (e.g., arXiv preprints).

Awesome-Prompt-Adapter-Learning-for-VLMs by zhengli97

Explore Similar Projects

ViP-LLaVA by WisconsinAIVision

Awesome-Parameter-Efficient-Transfer-Learning by jianghaojun

Awesome-CV-Foundational-Models by awaisrauf

multimodal-prompt-learning by muzairkhattak

VLM_survey by jingyi0000

Artemis by ls1intum

LMOps by microsoft

PromptPapers by thunlp

Learning-Prompt by thinkingjimmy

Awesome-Incremental-Learning by xialeiliu

Voyager by MineDojo

team-learning by datawhalechina