Chinese-LLaMA-Alpaca by ymcui

Chinese LLaMA & Alpaca: LLMs for Chinese NLP research

Created 2 years ago

18,974 stars

Top 2.4% on SourcePulse

2 Experts Love This Project

hiyouga

Author of LLaMA-Factory

ggerganov

Georgi Gerganov

Author of llama.cpp, whisper.cpp

Project Summary

This project provides open-source Chinese LLaMA and Alpaca large language models, enhancing Chinese NLP research. It addresses the need for improved Chinese language understanding and instruction-following capabilities by expanding the vocabulary of original LLaMA models with Chinese data and fine-tuning with Chinese instruction datasets. The models are suitable for researchers and developers working with Chinese NLP tasks.

How It Works

The project extends the original LLaMA models by incorporating a larger Chinese vocabulary for more efficient encoding and decoding. Chinese LLaMA models are pre-trained on extensive Chinese text data, while Chinese Alpaca models undergo instruction fine-tuning using Chinese instruction datasets. This approach significantly boosts the models' ability to understand and execute instructions, making them comparable to models like ChatGPT for Chinese language tasks.

Quick Start & Requirements

Installation: Models are provided as LoRA weights and require merging with original LLaMA models. Instructions for merging and deployment are available in the project's wiki.
Prerequisites: Requires original LLaMA model weights (application needed), Python, and potentially CUDA for GPU acceleration.
Resources: Merged models can be quantized to 4-bit for local CPU/GPU deployment, with sizes ranging from 3.9 GB (7B) to 17.2 GB (33B).
Links: Model Downloads, Merging Models, Local Deployment

Highlighted Details

Offers multiple model versions (7B, 13B, 33B) including base (LLaMA) and instruction-tuned (Alpaca) variants, with "Pro" versions addressing short reply issues.
Supports integration with popular frameworks like 🤗transformers, llama.cpp, text-generation-webui, LangChain, and privateGPT.
Provides training scripts for pre-training and instruction fine-tuning, allowing users to further train models.
Achieved competitive results on the C-Eval benchmark for Chinese language understanding.

Maintenance & Community

The project is actively maintained, with recent releases including Llama-3 based models (Chinese-LLaMA-3-8B, Llama-3-Chinese-8B-Instruct).
Community discussions are available via GitHub Issues and Discussions.

Licensing & Compatibility

The project states that original LLaMA models are prohibited from commercial use. The provided LoRA weights are for academic research only and cannot be used for commercial purposes.

Limitations & Caveats

Models may generate unpredictable, harmful, or biased content.
Training is not fully comprehensive due to compute and data limitations, requiring further improvement in Chinese understanding.
No interactive online demo is currently available; local deployment is necessary.

Health Check

Last Commit

6 months ago

Responsiveness

1 day

Pull Requests (30d)

0

Issues (30d)

0

Star History

24 stars in the last 30 days

Explore Similar Projects

MLE-LLaMA by feizc

LLM fine-tuning for Chinese language support

Created 2 years ago

Updated 2 years ago

Starred by

Yaowei Zheng

Yaowei Zheng(Author of LLaMA-Factory).

BayLing by ictnlp

Multilingual LLM for cross-lingual alignment and instruction following

Created 2 years ago

Updated 1 year ago

Chinese-Llama-2 by longyuewangdcu

Chinese Llama-2 enhances Llama-2 for Chinese language tasks

Created 2 years ago

Updated 1 year ago

Firefly-LLaMA2-Chinese by yangjianxin1

LLM for Chinese LLaMA-2, supporting incremental pre-training

Created 2 years ago

Updated 2 years ago

Unichat-llama3-Chinese by UnicomAI

Chinese instruction-tuned Llama 3 models

Created 1 year ago

Updated 1 year ago

Chinese-LlaMA2 by michael-wzhu

Chinese adaptation of Meta's LLaMA2

Created 2 years ago

Updated 2 years ago

Chinese-alpaca-lora by LC1332

Chinese LLaMA finetuning project

Created 2 years ago

Updated 2 years ago

Chinese-LLaMA-Alpaca-3 by ymcui

Chinese Llama-3, instruction-tuned LLMs

Created 2 years ago

Updated 1 year ago

Starred by

Yaowei Zheng

Yaowei Zheng(Author of LLaMA-Factory).

Linly by CVI-SZU

Chinese LLMs and datasets for pretraining/finetuning

Created 2 years ago

Updated 1 year ago

llama3-Chinese-chat by CrazyBoyM

Chinese Llama3 fine-tunes for chat, tutorials, and deployment

Created 1 year ago

Updated 5 days ago

Starred by

Lysandre Debut

Lysandre Debut(Chief Open-Source Officer at Hugging Face) and

Yaowei Zheng

Yaowei Zheng(Author of LLaMA-Factory).

Chinese-LLaMA-Alpaca-2 by ymcui

Chinese LLaMA/Alpaca-2: LLMs with long context for Chinese language

Created 2 years ago

Updated 6 months ago

Starred by

Michael Chiang

Michael Chiang(Cofounder of Ollama).

Llama-Chinese by LlamaFamily

Chinese Llama community for open-source LLM ecosystem building

Created 2 years ago

Updated 9 months ago

Feedback? Help us improve.