awesome-attention-mechanism-in-cv  by pprp

Curated list of attention modules for CV

Created 4 years ago
1,220 stars

Top 32.2% on SourcePulse

GitHubView on GitHub
Project Summary

This repository is an "Awesome List" curating attention mechanisms and plug-and-play modules for computer vision tasks. It serves researchers and practitioners by cataloging papers, their publication venues, and associated code repositories, facilitating the exploration and adoption of state-of-the-art attention techniques.

How It Works

The list is organized into categories: Attention Mechanism, Dynamic Networks, Plug and Play Modules, and Vision Transformers. Each entry links to the relevant research paper and, where available, its implementation. This structured approach allows users to quickly find and compare different attention architectures and their applications in computer vision.

Quick Start & Requirements

This is a curated list, not a runnable library. To use any of the included modules, users must refer to the individual project repositories linked within the list.

Highlighted Details

  • Extensive catalog of attention mechanisms, including Squeeze-and-Excitation (SE), Convolutional Block Attention Module (CBAM), Non-local Networks, and various Transformer-based approaches.
  • Covers "plug-and-play" modules and dynamic network concepts that can be integrated into existing CNN architectures.
  • Features a dedicated section on Vision Transformers, highlighting key models like ViT, Swin Transformer, and MobileViT.
  • Includes links to papers, GitHub repositories, and sometimes blog posts or specific implementations for each listed item.

Maintenance & Community

The list is maintained by pprp and welcomes community contributions via issues and pull requests for additions or corrections.

Licensing & Compatibility

The repository itself is licensed under an unspecified license. The licensing of individual code repositories linked within the list will vary and must be checked on a per-project basis.

Limitations & Caveats

The list is not exhaustive due to the maintainer's "limited ability and energy." Some entries may lack direct code links or comprehensive descriptions. The "Main Idea" column for Vision Transformers is often empty.

Health Check
Last Commit

2 years ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
7 stars in the last 30 days

Explore Similar Projects

Starred by Omar Sanseviero Omar Sanseviero(DevRel at Google DeepMind), Shizhe Diao Shizhe Diao(Author of LMFlow; Research Scientist at NVIDIA), and
1 more.

Awesome-Visual-Transformer by dk-liang

0.1%
4k
Vision transformer paper collection
Created 4 years ago
Updated 8 months ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Jeff Hammerbacher Jeff Hammerbacher(Cofounder of Cloudera), and
10 more.

x-transformers by lucidrains

0.2%
6k
Transformer library with extensive experimental features
Created 4 years ago
Updated 5 days ago
Starred by Andrew Ng Andrew Ng(Founder of DeepLearning.AI; Cofounder of Coursera; Professor at Stanford), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
2 more.

vision-agent by landing-ai

0.1%
5k
Visual AI agent for generating runnable vision code from image/video prompts
Created 1 year ago
Updated 2 weeks ago
Feedback? Help us improve.