tensorli by joennlae

Minimalist GPT-like transformer implementation for educational purposes

Created 2 years ago

254 stars

Top 99.1% on SourcePulse

Project Summary

This project provides an absolute minimalistic implementation of a GPT-like transformer using only NumPy, targeting developers and researchers who want to understand the core mechanics of transformer models without the complexity of large frameworks. It offers a learning tool for building and training transformer architectures from scratch.

How It Works

Tensorli implements a GPT-like transformer using a custom Tensorli object that mimics PyTorch's tensor functionality, built entirely on NumPy. It includes automatic differentiation and essential neural network components like Linearli, Embeddingli, MultiheadAttentionli, and LayerNorm, along with the Adamli optimizer. This approach prioritizes clarity and educational value over performance or scalability.

Quick Start & Requirements

Install via Conda: conda env create -f environment.yml or mamba env create -f environment.yml.
Activate environment: conda activate tensorli.
Set Python path: export PYTHONPATH=$PWD.
Run tests: pytest.
Requires Python and NumPy.

Highlighted Details

Implements automatic differentiation.
Includes core NN layers and Adam optimizer.
Demonstrates a functional GPT-like transformer architecture.
Inspired by minGPT and tinygrad.

Maintenance & Community

The project appears to be a personal learning project with no explicit mention of contributors, sponsorships, or community channels.

Licensing & Compatibility

The README does not specify a license. Compatibility for commercial or closed-source use is not addressed.

Limitations & Caveats

This library is not optimized and is not intended for production use or large-scale applications. It is purely a learning tool. Dropout and additional experimental architectures are planned but not yet implemented.

tensorli by joennlae

Explore Similar Projects

awesome-dl-projects by wandb

Omega-AI by dromara

onnxruntime-training-examples by microsoft

ColossalAI-Examples by hpcaitech

PytorchNetHub by bobo0810

zero_to_gpt by VikParuchuri

torchscale by microsoft

pytorch-openai-transformer-lm by huggingface

PyTorch-Tutorial-2nd by TingsongYu

optimum by huggingface

DiT by facebookresearch

FasterTransformer by NVIDIA