awesome-transformers  by abacaj

Curated list of transformer models

created 2 years ago
659 stars

Top 51.7% on sourcepulse

GitHubView on GitHub
Project Summary

This repository is a curated list of transformer models, categorized by architecture and modality, aimed at researchers and practitioners in NLP, computer vision, and speech processing. It provides model names, descriptions, links to Hugging Face or GitHub repositories, original papers, sources, and licenses, facilitating the discovery and selection of suitable pre-trained models for various tasks.

How It Works

The list is organized into distinct categories such as Encoder, Decoder, Encoder+Decoder, Multimodal, Vision, Audio, Recommendation, and Grounded Situation Recognition. Each entry includes essential metadata, allowing users to quickly assess model capabilities, origins, and licensing terms. The curation aims to cover a broad spectrum of transformer applications.

Quick Start & Requirements

This is a curated list, not a runnable codebase. To use the models, refer to the individual model links provided for installation and usage instructions.

Highlighted Details

  • Comprehensive categorization of transformer models across diverse domains.
  • Includes links to papers, Hugging Face/GitHub repos, and source organizations.
  • Explicitly lists model licenses, highlighting non-commercial or restrictive ones.
  • Covers a wide range of architectures and modalities, from text to vision and audio.

Maintenance & Community

The list is maintained by abacaj, with an invitation for community contributions via pull requests or Twitter outreach.

Licensing & Compatibility

Licenses vary significantly, including Apache 2.0, MIT, BSD 3-Clause, CC BY 4.0, CC BY-NC-SA 4.0, and custom licenses with use-based restrictions. Some models, like LLaMa and OPT, require approval and are non-commercial. VALL-E has a dependency on a CC-BY-NC library.

Limitations & Caveats

The list is a directory and does not provide direct access to the models themselves. Users must consult individual model repositories for specific usage, dependencies, and potential compatibility issues, especially concerning non-commercial licenses or restricted use cases.

Health Check
Last commit

2 years ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
0
Star History
14 stars in the last 90 days

Explore Similar Projects

Starred by Dan Guido Dan Guido(Cofounder of Trail of Bits), Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), and
6 more.

open-llms by eugeneyan

0.2%
12k
Curated list of commercially-usable open LLMs
created 2 years ago
updated 5 months ago
Feedback? Help us improve.