awesome-pretrained-chinese-nlp-models by lonePatient

Resource list: Chinese NLP pretrained models, LLMs, multimodal models

Created 6 years ago

5,501 stars

Top 9.1% on SourcePulse

3 Experts Love This Project

hiyouga

Author of LLaMA-Factory

shizhediao

Author of LMFlow; Research Scientist at NVIDIA

huybery

Research Scientist at Alibaba Qwen

Project Summary

This repository serves as a curated collection of high-quality Chinese pre-trained NLP models, including large language models (LLMs), multimodal models, and their associated resources. It aims to provide researchers and developers with a centralized hub for discovering and accessing state-of-the-art models for Chinese natural language processing tasks.

How It Works

The project meticulously gathers and organizes information on a vast array of Chinese NLP models, categorizing them by architecture (e.g., BERT, GPT, T5, RoFormer), domain (e.g., general, finance, medical, code), and modality (text-only, multimodal). It provides links to Hugging Face, model repositories, papers, and project pages, facilitating easy access and evaluation.

Quick Start & Requirements

Models are primarily accessed via Hugging Face (🤗HF) or ModelScope.
Installation typically involves using libraries like transformers or modelscope.
Specific hardware requirements (e.g., GPU, VRAM) depend on the model size and complexity.
Links to official documentation, demos, and quick-start guides are provided for individual models.

Highlighted Details

Comprehensive coverage of foundational NLP models (BERT, RoBERTa, etc.) and modern LLMs.
Extensive lists of Chinese-specific models, including domain-specific and multimodal variants.
Curated resources for datasets, evaluation benchmarks (e.g., C-Eval, FlagEval), and related "awesome" lists.
Regular updates to include the latest models and research advancements.

Maintenance & Community

The project is community-driven, with contributions from various institutions and individuals.
Links to Hugging Face, ModelScope, and other relevant platforms are provided for community engagement.

Licensing & Compatibility

Model licenses vary, with many available under permissive licenses (e.g., Apache 2.0) allowing commercial use.
Users should verify the specific license of each model before deployment.

Limitations & Caveats

The sheer volume of models means direct evaluation of each is impractical; users must assess suitability for their specific needs.
Access to some models may require registration or application.

Health Check

Last Commit

4 weeks ago

Responsiveness

1 day

Pull Requests (30d)

0

Issues (30d)

0

Star History

25 stars in the last 30 days

Explore Similar Projects

Mengzi3 by Langboat

LLM for multilingual generation, especially Chinese

Created 1 year ago

Updated 1 year ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems").

awesome-japanese-llm by llm-jp

Japanese LLM list: models, benchmarks, datasets

Created 2 years ago

Updated 1 day ago

llms by IbrahimSobh

Collection of resources for large language models

Created 2 years ago

Updated 3 months ago

Mengzi by Langboat

Chinese language models for NLP tasks, emphasizing efficiency

Created 4 years ago

Updated 3 years ago

nlp_notes by YangBin1729

NLP notes for ML/DL principles, examples, and model deployment

Created 6 years ago

Updated 5 years ago

Pre-trained-Models by loujie0822

NLP pre-trained model overview

Created 6 years ago

Updated 5 years ago

Chinese-XLNet by ymcui

Chinese XLNet pre-trained models for NLP tasks

Created 6 years ago

Updated 6 months ago

Starred by

Alexander Wu

Alexander Wu(Founder of MetaGPT).

GLM by THUDM

General language model for NLU, generation, and blank-filling tasks

Created 4 years ago

Updated 2 years ago

Starred by

Yaowei Zheng

Yaowei Zheng(Author of LLaMA-Factory),

Junyang Lin

Junyang Lin(Core Maintainer at Alibaba Qwen), and

1 more.

Fengshenbang-LM by IDEA-CCNL

Chinese foundation model ecosystem for AI infrastructure

Created 4 years ago

Updated 1 year ago

Chinese-BERT-wwm by ymcui

Pre-trained language models for Chinese NLP tasks

Created 6 years ago

Updated 6 months ago

Starred by

Jeremy Howard

Jeremy Howard(Cofounder of fast.ai),

Alex Cheema

Alex Cheema(Cofounder of EXO Labs), and

22 more.

unilm by microsoft

Foundation models for language, vision, speech, and multimodal tasks

Created 6 years ago

Updated 3 weeks ago

Starred by

Alexander Borzunov

Alexander Borzunov(Research Scientist at OpenAI),

Stas Bekman

Stas Bekman(Author of "Machine Learning Engineering Open Book"; Research Engineer at Snowflake), and

2 more.

nlp_course by yandexdataschool

NLP course materials

Created 7 years ago

Updated 1 month ago

Feedback? Help us improve.