Discover and explore top open-source AI tools and projects—updated daily.
KoljaBReal-time voice chat with AI using streaming audio
Top 14.6% on SourcePulse
This project provides a real-time, voice-driven conversational AI experience, enabling users to speak naturally with an LLM and receive spoken responses. It's designed for users seeking a fluid, interactive AI companion, offering low-latency communication through a client-server architecture.
How It Works
The system captures voice via the browser, streams audio chunks to a Python backend using WebSockets, and transcribes speech to text using RealtimeSTT. The text is processed by an LLM (Ollama or OpenAI), and the AI's response is synthesized into speech by RealtimeTTS. Audio is streamed back to the browser for playback, with support for interruptions and dynamic silence detection for natural turn-taking.
Quick Start & Requirements
docker compose builddocker compose up -ddocker compose exec ollama ollama pull hf.co/bartowski/huihui-ai_Mistral-Small-24B-Instruct-2501-abliterated-GGUF:Q4_K_MHighlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
CPU-only or weaker GPU performance will be significantly slower. Manual installation, especially on non-Windows systems or with different CUDA versions, may require troubleshooting.
3 months ago
1 day
gradio-app
OpenBMB