keras-transformer by kpot

Keras library for building Transformer models, enabling BERT and GPT

Created 7 years ago

541 stars

Top 58.7% on SourcePulse

Project Summary

This library provides Keras layers for building Universal Transformer models, targeting researchers and practitioners in NLP. It offers a flexible, modular approach to constructing Transformer architectures, enabling experimentation with models like BERT and GPT.

How It Works

The library implements core Transformer components as standalone Keras layers, including positional encoding, attention masking, and memory-compressed attention. This modular design allows users to assemble custom Transformer architectures by composing these layers, facilitating experimentation with variations like Adaptive Computation Time (ACT) and enabling direct replacement or rearrangement of components.

Quick Start & Requirements

Install via pip install . after cloning the repository.
Requires Python >= 3.6.
Example usage requires pip install -r example/requirements.txt and a Keras backend like TensorFlow (pip install tensorflow).
Examples are available for BERT and GPT language modeling on WikiText-2.

Highlighted Details

Supports Universal Transformers, BERT, and GPT architectures.
Includes memory-compressed attention and Adaptive Computation Time (ACT).
Modular Keras layers allow for custom model construction.
Demonstrates language modeling on WikiText-2 with perplexity metrics.

Maintenance & Community

No specific information on maintainers, community channels, or roadmap is provided in the README.

Licensing & Compatibility

The README does not explicitly state a license.

Limitations & Caveats

The provided examples are demonstrations and not rigorous evaluations. Training BERT models requires significant time and computational resources.

Health Check

Last Commit

5 years ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

0 stars in the last 30 days

Explore Similar Projects

Starred by

Sebastian Raschka

Sebastian Raschka(Author of "Build a Large Language Model (From Scratch)").

mint by dpressel

Minimal PyTorch library for Transformer tutorials

Created 3 years ago

Updated 3 years ago

llms by IbrahimSobh

Collection of resources for large language models

Created 2 years ago

Updated 3 months ago

CPT by fastnlp

Chinese pre-trained transformer for language understanding and generation research

Created 4 years ago

Updated 3 years ago

Starred by

Stas Bekman

Stas Bekman(Author of "Machine Learning Engineering Open Book"; Research Engineer at Snowflake) and

Thomas Wolf

Thomas Wolf(Cofounder of Hugging Face).

transformer by sannykim

Resource list for studying Transformers

Created 6 years ago

Updated 2 years ago

Starred by

Elvis Saravia

Elvis Saravia(Founder of DAIR.AI) and

Stas Bekman

Stas Bekman(Author of "Machine Learning Engineering Open Book"; Research Engineer at Snowflake).

awesome-transformer-nlp by cedrickchee

Curated list of NLP resources for Transformer networks

Created 7 years ago

Updated 1 year ago

Mastering-Transformers by PacktPublishing

Code repository for NLP book "Mastering Transformers"

Created 4 years ago

Updated 3 weeks ago

Starred by

Thomas Wolf

Thomas Wolf(Cofounder of Hugging Face).

pytorchic-bert by dhlee347

Pytorch re-implementation of Google BERT

Created 7 years ago

Updated 5 years ago

NLP-Tutorials by MorvanZhou

NLP tutorial with simple implementations of models

Created 7 years ago

Updated 2 years ago

Transformers-for-Natural-Language-Processing by PacktPublishing

NLP code examples for transformer models

Created 5 years ago

Updated 3 weeks ago

ru-gpts by ai-forever

Russian GPT models for text generation and related tasks

Created 5 years ago

Updated 3 years ago

pure_attention by mmmwhy

Attention-based models for CV and NLP tasks

Created 8 years ago

Updated 3 years ago

bert4keras by bojone

Keras library for Transformer models, aiming for clarity

Created 6 years ago

Updated 1 year ago

Feedback? Help us improve.