Yuan-1.0 by Shawn-IEITSystems

Large language model for NLP tasks

Created 4 years ago

590 stars

Top 55.1% on SourcePulse

View on GitHub

1 Expert Loves This Project

Shizhe Diao

Author of LMFlow; Research Scientist at NVIDIA

Project Summary

Yuan 1.0 is a large-scale pre-trained language model designed for zero-shot and few-shot learning tasks in Natural Language Processing. It targets researchers and developers seeking high-performance language models, offering state-of-the-art results on various NLP tasks with a focus on Chinese text.

How It Works

Yuan 1.0 is a 246 billion parameter singleton language model trained on a custom 5TB Chinese text corpus, the largest of its kind. The model architecture design incorporates insights from large-scale distributed training. It also utilizes a calibration and label expansion method to enhance zero-shot and few-shot performance, aiming to produce human-indistinguishable text.

Quick Start & Requirements

Install: pip install requests hashlib json
Prerequisites: Python 3.x. Access to the Yuan model API requires application submission and approval via the official website.
Setup: API access requires application processing, typically taking less than a week.
Resources: Official website: https://air.inspur.com/home. Example code: yuan_api/examples.

Highlighted Details

246B parameters, making it the largest singleton language model.
Trained on a 5TB high-quality Chinese text corpus.
Demonstrates state-of-the-art performance on various NLP tasks.
Offers API access for integration into applications.

Maintenance & Community

The project is associated with Inspur. Further community or maintenance details are not explicitly provided in the README.

Licensing & Compatibility

The README does not specify a license. API access is controlled via application and key generation.

Limitations & Caveats

Access to the model's full capabilities is gated behind an application process. The README does not detail specific hardware requirements for running inference locally, implying API usage is the primary access method.

Health Check

Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)

Issues (30d)

Star History

0 stars in the last 30 days