chatglm-web  by NCZkevin

Web UI for local ChatGLM deployment

created 2 years ago
470 stars

Top 65.7% on sourcepulse

GitHubView on GitHub
Project Summary

This project provides a self-hostable web interface for the ChatGLM language model, aiming to replicate the user experience of ChatGPT. It's designed for users who want to run a powerful conversational AI locally, offering offline capabilities and the flexibility to use custom-tuned GLM models.

How It Works

The project utilizes FastAPI for the backend API and Vue 3 for the frontend. It supports streaming output from ChatGLM models, allowing users to adjust parameters, manage conversation history, and save outputs as images. The architecture is forked from existing popular ChatGPT web UIs, incorporating features like knowledge base Q&A, though some advanced features from the original repositories are still under development.

Quick Start & Requirements

  • Backend: python main.py (with optional arguments for device, quantization, host, port). Requires Python 3.8+.
  • Frontend: pnpm bootstrap then pnpm dev. Requires Node.js (v16 or v18 recommended).
  • Dependencies: pnpm for frontend, requirements.txt for backend.
  • Hardware: GPU with 6GB VRAM (INT4 quantization) to 13GB VRAM (FP16) for inference.
  • Knowledge Base: Run python gen_data.py before starting the API.
  • Docs: https://github.com/NCZkevin/chatglm-web

Highlighted Details

  • Supports ChatGLM-6B and potentially other models.
  • Offers offline and self-deployable operation.
  • Features include streaming output, parameter tuning, and context selection.
  • Includes a knowledge base Q&A functionality.

Maintenance & Community

The project is a fork of Chanzhaoyu/chatgpt-web and WenJing95/chatgpt-web. Contributions are guided by a contribution guide.

Licensing & Compatibility

MIT License. Permissive for commercial use and integration with closed-source applications.

Limitations & Caveats

Some features from parent repositories (e.g., permissions, prompt store, Langchain integration) are marked as "to be implemented." Docker deployment instructions are pending. The project relies on specific Node.js versions and may require manual configuration for certain frontend behaviors like typewriter effects when behind reverse proxies.

Health Check
Last commit

2 years ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
0
Star History
8 stars in the last 90 days

Explore Similar Projects

Starred by Jared Palmer Jared Palmer(Ex-VP of AI at Vercel; Founder of Turborepo; Author of Formik, TSDX), Jeff Hammerbacher Jeff Hammerbacher(Cofounder of Cloudera), and
1 more.

promptable by cfortuner

0%
2k
TS/JS library for building full-stack AI apps
created 2 years ago
updated 2 years ago
Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Andre Zayarni Andre Zayarni(Cofounder of Qdrant), and
2 more.

RealChar by Shaunwei

0.1%
6k
Real-time AI character/companion creation and interaction codebase
created 2 years ago
updated 1 year ago
Starred by Addy Osmani Addy Osmani(Engineering Leader on Google Chrome), Victor Taelin Victor Taelin(Author of Bend, Kind, HVM), and
1 more.

chatbox by chatboxai

0.3%
36k
Desktop client app for AI models/LLMs
created 2 years ago
updated 6 days ago
Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Pietro Schirano Pietro Schirano(Founder of MagicPath), and
1 more.

SillyTavern by SillyTavern

3.2%
17k
LLM frontend for power users
created 2 years ago
updated 3 days ago
Feedback? Help us improve.