gpt2-ml  by imcaspar

GPT-2 for multiple languages, including pretrained models

Created 5 years ago
1,713 stars

Top 24.9% on SourcePulse

GitHubView on GitHub
Project Summary

This repository provides a GPT-2 implementation optimized for multilingual support, specifically featuring a 1.5 billion parameter Chinese pretrained model. It is designed for researchers and developers working with large-scale language models for Chinese text generation and analysis.

How It Works

The project adapts Grover's training scripts for GPT-2, incorporating a ported BERT tokenizer compatible with multilingual corpora. It leverages Cloud TPUs for efficient training, enabling the creation of large, high-performance models like the 1.5B parameter Chinese version.

Quick Start & Requirements

  • Install/Run: Colab demo available for quick experimentation.
  • Prerequisites: Google Colab, potentially Cloud TPUs for training.
  • Resources: Pretrained models are ~15-30GB.
  • Links: Colab Notebook

Highlighted Details

  • Features a 1.5 billion parameter GPT-2 model pretrained on Chinese corpora (~15GB and ~30GB versions).
  • Training utilized Cloud TPU Pod v3-256 for 220,000 steps.
  • Includes simplified training scripts based on Grover.
  • Compatible with multilingual corpora via a ported BERT tokenizer.

Maintenance & Community

  • Developed by Zhibo Zhang.
  • Research supported by Google's TensorFlow Research Cloud (TFRC).

Licensing & Compatibility

  • The repository's license is not explicitly stated in the README.
  • The project is intended for academic research purposes.

Limitations & Caveats

The project is designated for academic research and does not offer conclusive remarks. The specific license for commercial use or closed-source linking is not detailed.

Health Check
Last Commit

2 years ago

Responsiveness

1 week

Pull Requests (30d)
0
Issues (30d)
0
Star History
1 stars in the last 30 days

Explore Similar Projects

Starred by Luis Capelo Luis Capelo(Cofounder of Lightning AI), Eugene Yan Eugene Yan(AI Scientist at AWS), and
14 more.

text by pytorch

0.0%
4k
PyTorch library for NLP tasks
Created 8 years ago
Updated 1 week ago
Feedback? Help us improve.