AIVoiceChat by KoljaB

Voice chat for low-latency AI companion interaction

Created 2 years ago

311 stars

Top 86.8% on SourcePulse

Project Summary

This project provides a low-latency AI voice chat companion, enabling real-time spoken interaction with AI models. It is designed for users seeking a responsive and natural voice-based AI experience, leveraging advanced speech recognition and synthesis technologies.

How It Works

The system utilizes faster_whisper for efficient speech-to-text conversion and ElevenLabs' streaming API for text-to-speech synthesis. This combination allows for near real-time processing of spoken input and generation of AI responses, minimizing latency for a more fluid conversation. Two modes are offered: voice_talk_vad.py for automatic speech detection and voice_talk.py for manual recording control via the spacebar.

Quick Start & Requirements

Install dependencies: pip install openai elevenlabs pyaudio wave keyboard faster_whisper numpy torch
Requires OpenAI and ElevenLabs API keys.
Run with python voice_talk_vad.py or python voice_talk.py.
Demo available at: [Link to Demo Video]

Highlighted Details

Achieves low latency through faster_whisper and ElevenLabs input streaming.
Offers two distinct interaction modes: automatic speech detection or manual spacebar control.
Built with core libraries including openai, elevenlabs, pyaudio, keyboard, and faster_whisper.

Maintenance & Community

The project acknowledges contributions from developers of faster_whisper, ElevenLabs, and OpenAI. Contributions are welcomed via pull requests, with issues encouraged for significant changes.

Licensing & Compatibility

The repository does not explicitly state a license. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

Performance is dependent on internet connection speed, with the demo conducted on a 10Mbit/s connection. The project does not specify compatibility with different operating systems or hardware configurations beyond standard Python environments.

AIVoiceChat by KoljaB

Explore Similar Projects

Transcribro by soupslurpr

SpeechGPT-2.0-preview by OpenMOSS

alibabacloud-bailian-speech-demo by aliyun

LiveWhisper by Nikorasu

ollama-voice-mac by apeatling

fast-voice-assistant by dsa

ChatWaifu by cjyaddone

10x by 0xCrunchyy

swift by ai-ng

talk-to-chatgpt by C-Nedelcu

RealtimeSTT by KoljaB

sherpa-onnx by k2-fsa