awesome-transformers  by abacaj

Curated list of transformer models

Created 2 years ago
661 stars

Top 50.7% on SourcePulse

GitHubView on GitHub
Project Summary

This repository is a curated list of transformer models, categorized by architecture and modality, aimed at researchers and practitioners in NLP, computer vision, and speech processing. It provides model names, descriptions, links to Hugging Face or GitHub repositories, original papers, sources, and licenses, facilitating the discovery and selection of suitable pre-trained models for various tasks.

How It Works

The list is organized into distinct categories such as Encoder, Decoder, Encoder+Decoder, Multimodal, Vision, Audio, Recommendation, and Grounded Situation Recognition. Each entry includes essential metadata, allowing users to quickly assess model capabilities, origins, and licensing terms. The curation aims to cover a broad spectrum of transformer applications.

Quick Start & Requirements

This is a curated list, not a runnable codebase. To use the models, refer to the individual model links provided for installation and usage instructions.

Highlighted Details

  • Comprehensive categorization of transformer models across diverse domains.
  • Includes links to papers, Hugging Face/GitHub repos, and source organizations.
  • Explicitly lists model licenses, highlighting non-commercial or restrictive ones.
  • Covers a wide range of architectures and modalities, from text to vision and audio.

Maintenance & Community

The list is maintained by abacaj, with an invitation for community contributions via pull requests or Twitter outreach.

Licensing & Compatibility

Licenses vary significantly, including Apache 2.0, MIT, BSD 3-Clause, CC BY 4.0, CC BY-NC-SA 4.0, and custom licenses with use-based restrictions. Some models, like LLaMa and OPT, require approval and are non-commercial. VALL-E has a dependency on a CC-BY-NC library.

Limitations & Caveats

The list is a directory and does not provide direct access to the models themselves. Users must consult individual model repositories for specific usage, dependencies, and potential compatibility issues, especially concerning non-commercial licenses or restricted use cases.

Health Check
Last Commit

2 years ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
2 stars in the last 30 days

Explore Similar Projects

Starred by Jiayi Pan Jiayi Pan(Author of SWE-Gym; MTS at xAI), Shizhe Diao Shizhe Diao(Author of LMFlow; Research Scientist at NVIDIA), and
1 more.

METER by zdou0830

0%
373
Multimodal framework for vision-and-language transformer research
Created 3 years ago
Updated 2 years ago
Feedback? Help us improve.