RealtimeVoiceChat  by KoljaB

Real-time voice chat with AI using streaming audio

created 3 months ago
2,858 stars

Top 17.0% on sourcepulse

GitHubView on GitHub
Project Summary

This project provides a real-time, voice-driven conversational AI experience, enabling users to speak naturally with an LLM and receive spoken responses. It's designed for users seeking a fluid, interactive AI companion, offering low-latency communication through a client-server architecture.

How It Works

The system captures voice via the browser, streams audio chunks to a Python backend using WebSockets, and transcribes speech to text using RealtimeSTT. The text is processed by an LLM (Ollama or OpenAI), and the AI's response is synthesized into speech by RealtimeTTS. Audio is streamed back to the browser for playback, with support for interruptions and dynamic silence detection for natural turn-taking.

Quick Start & Requirements

  • Docker Installation (Recommended):
    1. docker compose build
    2. docker compose up -d
    3. docker compose exec ollama ollama pull hf.co/bartowski/huihui-ai_Mistral-Small-24B-Instruct-2501-abliterated-GGUF:Q4_K_M
  • Prerequisites:
    • NVIDIA GPU with CUDA 12.1 (highly recommended for performance).
    • Docker Engine and Docker Compose v2+ (for Docker setup).
    • Python 3.9+ (for manual setup).
    • Linux recommended for optimal Docker GPU integration.
    • PyTorch installation instructions for specific CUDA versions.

Highlighted Details

  • Low-latency focus via audio chunk streaming and WebSockets.
  • Pluggable LLM backends (Ollama default, OpenAI support).
  • Customizable Text-to-Speech engines (Kokoro, Coqui, Orpheus).
  • Web Interface using Vanilla JS and Web Audio API.

Maintenance & Community

  • Contributions are welcome via issues and pull requests.

Licensing & Compatibility

  • Core codebase: MIT License.
  • External TTS engines and LLM providers have their own licenses; users must comply.

Limitations & Caveats

CPU-only or weaker GPU performance will be significantly slower. Manual installation, especially on non-Windows systems or with different CUDA versions, may require troubleshooting.

Health Check
Last commit

3 weeks ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
5
Star History
2,380 stars in the last 90 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Andre Zayarni Andre Zayarni(Cofounder of Qdrant), and
2 more.

RealChar by Shaunwei

0.1%
6k
Real-time AI character/companion creation and interaction codebase
created 2 years ago
updated 1 year ago
Feedback? Help us improve.