Open-source toolkit for building specialized/general-purpose chat models
Top 5.8% on sourcepulse
OpenChatKit provides an open-source toolkit for building and deploying conversational AI models. It offers pre-trained chat models, fine-tuning scripts, and an extensible retrieval system, targeting developers and researchers aiming to create specialized or general-purpose dialogue agents.
How It Works
The kit leverages instruction-tuned large language models, including a 20B parameter GPT-NeoXT-Chat-Base-20B and a 7B parameter Pythia-Chat-Base-7B. It supports fine-tuning on custom datasets and integrates a retrieval-augmented generation (RAG) capability using a Faiss index for incorporating external knowledge.
Quick Start & Requirements
mamba env create -f environment.yml
and activate it with conda activate OpenChatKit
.python inference/bot.py --model togethercomputer/Pythia-Chat-Base-7B
.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The retrieval augmentation feature is experimental and requires significant time to load the index. Specific hardware requirements for running the larger 20B parameter model are not detailed.
1 year ago
Inactive