openchat  by hyunwoongko

Open source chatting framework for neural network-based dialogue

created 4 years ago
432 stars

Top 69.9% on sourcepulse

GitHubView on GitHub
Project Summary

OpenChat provides an easy-to-use Python framework for building conversational AI applications, supporting over 40 dialogue models based on neural networks. It targets developers and researchers looking to quickly integrate diverse chatbot capabilities, from general conversation to specialized tasks like safety and knowledge grounding, with minimal code.

How It Works

The framework simplifies interaction with various pre-trained language models by abstracting away complex setup and inference logic. Users instantiate an OpenChat object, specifying a model name and device (CPU or GPU). It supports multiple decoding strategies (greedy, beam, top-k, nucleus) and allows fine-tuning of parameters like top_k, no_repeat_ngram_size, and length_penalty for customized output generation.

Quick Start & Requirements

  • Install via pip: pip install openchat
  • Requires Python. GPU acceleration is supported via device="cuda" or device="cuda:n".
  • Official documentation and supported models list are available via links in the README.

Highlighted Details

  • Supports 40+ dialogue models including GPT-Neo, Blender, DialoGPT, Dodecathlon, and safety models.
  • Enables custom persona settings for ConvAI2 and topic settings for Wizard of Wikipedia models.
  • Offers specialized models for offensive and sensitive topic classification.
  • Allows fine-grained control over decoding parameters for output generation.

Maintenance & Community

The project is maintained by Hyunwoong Ko. No specific community channels (like Discord/Slack) or roadmap links are provided in the README.

Licensing & Compatibility

Licensed under the Apache License, Version 2.0. This license is permissive and generally compatible with commercial use and closed-source applications.

Limitations & Caveats

The README does not specify hardware requirements beyond GPU support, nor does it detail performance benchmarks or potential limitations of the supported models. Community support and project roadmap are not explicitly detailed.

Health Check
Last commit

2 years ago

Responsiveness

1+ week

Pull Requests (30d)
0
Issues (30d)
0
Star History
0 stars in the last 90 days

Explore Similar Projects

Feedback? Help us improve.