HuixiangDou by InternLM

LLM-based assistant for group chat technical support

Created 2 years ago

2,479 stars

Top 18.3% on SourcePulse

View on GitHub

1 Expert Loves This Project

Chip Huyen

Author of "AI Engineering", "Designing Machine Learning Systems"

Project Summary

HuixiangDou is an LLM-based technical assistant designed to overcome the challenges of group chat scenarios and provide knowledge-based assistance. It targets developers and researchers looking for efficient, customizable AI solutions for information retrieval and Q&A, offering a flexible architecture that can be deployed across various platforms with minimal resource requirements.

How It Works

HuixiangDou employs a three-stage pipeline: preprocessing, rejection, and response, to manage group chat interactions effectively without flooding users with messages. It supports hybrid retrieval methods, combining dense retrieval for documents and sparse retrieval for code, and integrates knowledge graphs and internet search capabilities. This approach aims to provide precise and contextually relevant answers by leveraging multiple information sources.

Quick Start & Requirements

Install: pip install -r requirements.txt
Prerequisites: Python 3.8+, faiss-gpu (for GPU), apt packages for document parsing (e.g., poppler-utils, tesseract-ocr). CPU-only requires requirements-cpu.txt.
Configuration: Requires setting up LLM API keys or local LLM configurations in config.ini.
Resources: Standard Edition (text-only) requires 2GB GPU memory; Multimodal Edition (image/text retrieval) requires 10GB GPU memory. CPU-only mode is available.
Docs: Read the Docs, OpenXLab Web, Gradio Demo

Highlighted Details

Supports a wide range of file formats (Excel, HTML, Markdown, PDF, TXT, Word) and retrieval methods (dense, sparse, knowledge graph, internet search, image/text).
Offers integrations with WeChat, Feishu, and a web UI/API for broader application.
Claims industrial-grade and commercially viable source code with verified functionality for one year.
Recent updates include support for coreference resolution, multimodal retrieval, and simplified deployment.

Maintenance & Community

The project is actively developed with recent updates in 2024 and planned features for 2025. Links to community resources like BiliBili and YouTube are provided for demonstrations.

Licensing & Compatibility

The project's licensing is not explicitly stated in the README, which may pose a compatibility concern for commercial or closed-source use.

Limitations & Caveats

The README mentions that WeChat integration has associated costs. Specific LLM configurations and hardware requirements (especially for multimodal features) need careful setup. The project's licensing status requires clarification for commercial adoption.

Health Check

Last Commit

3 months ago

Responsiveness

1 day

Pull Requests (30d)

Issues (30d)

Star History

19 stars in the last 30 days