GLM-130B by zai-org

Bilingual model for research and evaluation

Created 3 years ago

7,677 stars

Top 6.7% on SourcePulse

11 Experts Love This Project

soldni

Research Scientist at Ai2

wassemgtk

Cofounder of Writer

jiamings

Chief Scientist at Luma AI

mckaywrigley

Founder of Takeoff AI

and 7 more!

Project Summary

GLM-130B is an open-source, 130-billion parameter, bilingual (English/Chinese) language model designed for researchers and developers working with large-scale NLP models. It offers strong performance on various benchmarks and supports efficient inference through quantization and optimized libraries.

How It Works

GLM-130B utilizes the General Language Model (GLM) pre-training approach, which combines autoregressive blank infilling with bidirectional context. This allows it to excel at both left-to-right generation and filling in masked segments of text. The model is designed for efficient deployment, supporting INT4 quantization to enable inference on consumer-grade hardware.

Quick Start & Requirements

Health Check

Last Commit

2 years ago

Responsiveness

1 day

Pull Requests (30d)

0

Issues (30d)

0

Star History

4 stars in the last 30 days

Explore Similar Projects

Starred by

Michael Han

Michael Han(Cofounder of Unsloth).

r1-ktransformers-guide by ubergarm

Local inference for large language models using ktransformers

Created 11 months ago

Updated 10 months ago

Starred by

Yineng Zhang

Yineng Zhang(Inference Lead at SGLang; Research Scientist at Together AI).

dots.llm1 by rednote-hilab

MoE model for research

Created 8 months ago

Updated 4 months ago

TeleChat2 by Tele-AI

Chinese large language model series trained on domestic hardware

Created 1 year ago

Updated 5 months ago

HY-MT by Tencent-Hunyuan

Multilingual translation models with advanced features

Created 2 weeks ago

Updated 1 week ago

Telechat by Tele-AI

Chinese LLM for dialogue, long-form generation, and code assistance

Created 2 years ago

Updated 1 year ago

Starred by

Yaowei Zheng

Yaowei Zheng(Author of LLaMA-Factory).

CPM-Bee by OpenBMB

Bilingual base model for research/commercial use

Created 2 years ago

Updated 2 years ago

Starred by

Travis Fischer

Travis Fischer(Founder of Agentic),

Elvis Saravia

Elvis Saravia(Founder of DAIR.AI), and

1 more.

LLMZoo by FreedomIntelligence

LLM project provides data, models, and evaluation benchmark

Created 2 years ago

Updated 2 years ago

Starred by

Shizhe Diao

Shizhe Diao(Author of LMFlow; Research Scientist at NVIDIA),

Yineng Zhang

Yineng Zhang(Inference Lead at SGLang; Research Scientist at Together AI), and

8 more.

EAGLE by SafeAILab

Speculative decoding research paper for faster LLM inference

Created 2 years ago

Updated 3 weeks ago

zero_nlp by yuanzhoulvpi2017

NLP solution for Chinese language models, data, training, and inference

Created 2 years ago

Updated 5 months ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"),

Simon Willison

Simon Willison(Coauthor of Django), and

10 more.

Yi by 01-ai

Open-source bilingual LLMs trained from scratch

Created 2 years ago

Updated 1 year ago

Starred by

Aravind Srinivas

Aravind Srinivas(Cofounder of Perplexity),

Jasper Zhang

Jasper Zhang(Cofounder of Hyperbolic), and

21 more.

lm-evaluation-harness by EleutherAI

Framework for few-shot language model evaluation

Created 5 years ago

Updated 4 days ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"),

Jeff Hammerbacher

Jeff Hammerbacher(Cofounder of Cloudera), and

4 more.

ChatGLM-6B by zai-org

Bilingual dialogue language model for research

Created 2 years ago

Updated 1 year ago

Feedback? Help us improve.