bert4torch by Tongjilibo

PyTorch library for transformer models

Created 3 years ago

1,334 stars

Top 29.9% on SourcePulse

Project Summary

This repository provides an elegant PyTorch implementation of transformer models, aiming to simplify the process of loading, fine-tuning, and deploying large language models (LLMs). It is designed for researchers and developers working with NLP tasks who need a flexible and efficient framework for various transformer architectures.

How It Works

The library offers a unified interface for building and managing transformer models, abstracting away much of the complexity associated with different architectures and pre-trained weights. It supports loading models from Hugging Face or local checkpoints, handling configuration files, and integrating common training tricks like LoRA. The design emphasizes code clarity and reusability, drawing inspiration from the Keras training style.

Quick Start & Requirements

Install: pip install bert4torch
Requirements: PyTorch (developed with v2.0, compatible with v1.10), Python. GPU recommended for LLMs.
Setup: Minimal for basic usage; LLM fine-tuning and deployment require significant computational resources and dataset preparation.
Links: Documentation, Torch4keras, Examples

Highlighted Details

Supports a wide range of LLMs (ChatGLM, Llama, Baichuan, Qwen, etc.) and traditional transformers (BERT, RoBERTa, T5, etc.).
One-click deployment for LLM services via command line (bert4torch-llm-server).
Integrates common training tricks and callbacks for efficient fine-tuning.
Offers a comprehensive table of supported pre-trained weights and their loading methods.
Code is designed for ease of understanding and customization, with a focus on code reuse.

Maintenance & Community

The project is primarily maintained by a single individual.
Community support is available via WeChat (contact author for group invitation).
Star History Chart

Licensing & Compatibility

The repository does not explicitly state a license in the README. This requires clarification for commercial use or integration into closed-source projects.

Limitations & Caveats

The project is largely maintained by a single individual, which could impact long-term development velocity and support.
The absence of a clear license in the README is a significant caveat for adoption, especially for commercial applications.

Health Check

Last Commit

2 weeks ago

Responsiveness

1 day

Pull Requests (30d)

0

Issues (30d)

1

Star History

4 stars in the last 30 days

Explore Similar Projects

Starred by

Sebastian Raschka

Sebastian Raschka(Author of "Build a Large Language Model (From Scratch)").

mint by dpressel

Minimal PyTorch library for Transformer tutorials

Created 3 years ago

Updated 3 years ago

Building-a-Small-LLM-from-Scratch by KaihuaTang

Tutorial for building LLMs from scratch using PyTorch

Created 11 months ago

Updated 4 months ago

tiny-llm-zh by wdndev

Chinese LLM for learning large language models

Created 1 year ago

Updated 1 year ago

Starred by

Gabriel Almeida

Gabriel Almeida(Cofounder of Langflow),

Tristan Hume

Tristan Hume(MTS at Anthropic), and

2 more.

texar-pytorch by asyml

PyTorch toolkit for NLP and text generation research

Created 6 years ago

Updated 3 years ago

step_into_llm by mindspore-lab

Online course for large language model (LLM) techniques using MindSpore

Created 2 years ago

Updated 2 weeks ago

BERT-keras by Separius

Keras implementation for BERT and Transformer LM research

Created 7 years ago

Updated 6 years ago

Bert-Multi-Label-Text-Classification by lonePatient

PyTorch code for multi-label text classification

Created 7 years ago

Updated 2 years ago

Starred by

Yaowei Zheng

Yaowei Zheng(Author of LLaMA-Factory),

Elvis Saravia

Elvis Saravia(Founder of DAIR.AI), and

1 more.

FlagAI by FlagAI-Open

Toolkit for large-scale model training, fine-tuning, and deployment

Created 3 years ago

Updated 2 months ago

PyTorch-Tutorial-2nd by TingsongYu

PyTorch tutorial (2nd edition) for deep learning engineers

Created 4 years ago

Updated 11 months ago

Starred by

Yaowei Zheng

Yaowei Zheng(Author of LLaMA-Factory),

Binyuan Hui

Binyuan Hui(Research Scientist at Alibaba Qwen), and

3 more.

UER-py by dbiir

PyTorch toolkit for pre-training and fine-tuning NLP models

Created 6 years ago

Updated 1 year ago

Starred by

Yineng Zhang

Yineng Zhang(Inference Lead at SGLang; Research Scientist at Together AI),

Lewis Tunstall

Lewis Tunstall(Research Engineer at Hugging Face), and

15 more.

torchtune by meta-pytorch

PyTorch library for LLM post-training and experimentation

Created 2 years ago

Updated 1 day ago

bert4keras by bojone

Keras library for Transformer models, aiming for clarity

Created 6 years ago

Updated 1 year ago

Feedback? Help us improve.