Chinese GPT3 pre-trained language model
Top 72.8% on sourcepulse
SkyText is a large Chinese GPT-3 pre-trained language model developed by Singularity-AI. It addresses the need for advanced Chinese NLP capabilities, offering functionalities beyond basic chat and Q&A, including content generation, translation, and creative writing. The model is suitable for researchers and developers working with Chinese language AI applications.
How It Works
SkyText leverages a novel Chinese encoding method optimized for the language's unique characteristics, differing from English-centric approaches. This innovation, combined with a rigorous 30+ step data cleaning process for its training corpus, aims to enhance the model's comprehension and performance on Chinese text.
Quick Start & Requirements
transformers
library.transformers>=4.18.0
.GPT2LMHeadModel
and AutoTokenizer
from "SkyWork/SkyText" or "SkyWork/SkyTextTiny".device=0
).Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The 13 billion parameter model is currently closed-source, with users directed to await the release of a new 10 billion parameter model.
2 years ago
Inactive