Awesome-Chinese-LLM  by HqWu-HITCS

Chinese LLM collection for smaller, privatizable models with lower training costs

Created 2 years ago
21,223 stars

Top 2.1% on SourcePulse

GitHubView on GitHub
1 Expert Loves This Project
Project Summary

This repository is a curated collection of open-source Chinese Large Language Models (LLMs), focusing on smaller, deployable, and cost-effective models. It serves researchers, developers, and power users by cataloging base models, domain-specific fine-tuned models, datasets, and tutorials, aiming to foster the development and application of Chinese LLMs.

How It Works

The project acts as a comprehensive directory, meticulously organizing over 100 resources related to Chinese LLMs. It categorizes information into models (text and multimodal), applications (domain-specific fine-tuning, LangChain, etc.), datasets (pre-training, SFT, preference), LLM training/fine-tuning frameworks, inference/deployment frameworks, evaluation benchmarks, and tutorials. The README includes a detailed table comparing key features of popular base models like ChatGLM, LLaMA, Qwen, and others.

Quick Start & Requirements

This repository is a collection of links and information, not a runnable application itself. Users are directed to individual GitHub repositories for specific models and tools, each with its own installation and usage instructions.

Highlighted Details

  • Comprehensive catalog of over 100 Chinese LLM resources.
  • Detailed comparison table of major Chinese LLM base models.
  • Extensive categorization covering models, applications, datasets, frameworks, evaluation, and tutorials.
  • Focus on models that are smaller, privately deployable, and have lower training costs.

Maintenance & Community

The project is actively maintained and welcomes community contributions via Pull Requests. Users can find links to related repositories and discussions.

Licensing & Compatibility

Licensing varies by the individual projects linked within the repository. Users must consult the specific licenses of each model, dataset, or tool for commercial use or closed-source linking compatibility.

Limitations & Caveats

As a curated list, this repository does not provide direct functionality. Users must navigate to individual project repositories to assess their specific features, requirements, and licenses. The quality and status of linked projects may vary.

Health Check
Last Commit

4 months ago

Responsiveness

1 day

Pull Requests (30d)
1
Issues (30d)
0
Star History
283 stars in the last 30 days

Explore Similar Projects

Starred by Rodrigo Nader Rodrigo Nader(Cofounder of Langflow), Shizhe Diao Shizhe Diao(Author of LMFlow; Research Scientist at NVIDIA), and
11 more.

Awesome-LLM by Hannibal046

0.3%
25k
Curated list of Large Language Model resources
Created 2 years ago
Updated 1 month ago
Feedback? Help us improve.