Web-enhanced question answering system using a 10B GLM
Top 26.7% on sourcepulse
WebGLM is an efficient web-enhanced question-answering system designed for researchers and developers seeking to integrate real-time web search and retrieval into large language models. It leverages the 10-billion-parameter General Language Model (GLM) to provide accurate and human-like answers by augmenting it with web search capabilities.
How It Works
WebGLM employs a three-pronged approach: an LLM-augmented Retriever to fetch relevant web content, a bootstrapped Generator that uses the GLM to formulate answers based on retrieved information, and a Human Preference-aware Scorer to evaluate and prioritize response quality. This architecture aims to improve the efficiency and cost-effectiveness of deploying QA systems in real-world scenarios.
Quick Start & Requirements
pip install -r requirements.txt
, playwright install
--searcher bing
with Playwright). Retriever checkpoint download required.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Model_License
.Limitations & Caveats
The README mentions that the current version of ChatGLM2-6B (related to the GLM family) has limited understanding of single-round ultra-long documents, which is a focus for future optimization. The use of open-sourced data is restricted to research purposes.
4 months ago
1 week