OpenVoiceChat  by Finity-Alpha

Natural voice conversations with LLMs

Created 2 years ago
256 stars

Top 98.7% on SourcePulse

GitHubView on GitHub
Project Summary

OpenVoiceChat enables natural, conversational voice interactions with Large Language Models (LLMs). It is designed for researchers and developers seeking an open-source alternative to commercial voice AI solutions, offering extensive customization for Speech-to-Text (STT), Text-to-Speech (TTS), and LLM backends.

How It Works

The project provides a modular architecture allowing users to integrate various STT, TTS, and LLM models. This flexibility is achieved through well-abstracted APIs, facilitating easy extension and customization. The system supports interruptions, enabling more natural conversational flow.

Quick Start & Requirements

  • Install via pip: pip install openvoicechat
  • For specific backends: pip install openvoicechat[piper,openai,transformers]
  • Supported backends include piper (Linux/macOS only), vosk, openai, tortoise, xtts, and transformers.
  • Run locally: python main.py local
  • Documentation: https://www.finityalpha.com/OpenVoiceChat/docs/

Highlighted Details

  • Supports a wide array of STT, TTS, and LLM models.
  • Designed for extensibility and ease of use.
  • Enables natural conversational interactions with LLMs.
  • Supports conversational interruptions.

Maintenance & Community

Licensing & Compatibility

  • The README does not specify a license. Compatibility for commercial or closed-source use is undetermined.

Limitations & Caveats

The piper STT backend is explicitly stated as not working on Windows. The absence of a specified license raises concerns about commercial use and redistribution.

Health Check
Last Commit

9 months ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
4 stars in the last 30 days

Explore Similar Projects

Feedback? Help us improve.