Discover and explore top open-source AI tools and projects—updated daily.
asiff00On-device speech-to-speech conversational AI
Top 100.0% on SourcePulse
Summary
This project delivers a real-time, on-CPU conversational AI system enabling two-way speech communication. It targets users seeking local, responsive AI interactions without cloud dependencies, offering fluid conversations with immediate responses and natural interruption handling through a continuous streaming architecture.
How It Works
A multi-threaded architecture orchestrates a pipeline: Voice Activity Detection (Pyannote) feeds into Speech Recognition (Whisper), then to a Language Model (Ollama/qwen2.5), processed by a custom TextChunker, and finally synthesized via Voice Synthesis (Kokoro). Components communicate via queues, enabling independent operation and responsiveness. Novel latency reduction techniques include priority-based text chunking and LLM prompting with leading filler words for natural, immediate interaction and interruption handling.
Quick Start & Requirements
sudo apt install -y espeak-ng on Linux), and Ollama (https://ollama.ai/). Clone the repo, run git lfs pull for models, configure .env with a HuggingFace token, and install dependencies via pip install -r requirements.txt.ollama run qwen2.5:0.5b-instruct-q8_0), then run python speech_to_speech.py.https://youtu.be/x92FLnwf-nA.Highlighted Details
Maintenance & Community
The provided README does not detail specific contributors, community channels (e.g., Discord, Slack), or a public roadmap.
Licensing & Compatibility
The repository's license is not explicitly stated in the README. This lack of clarity presents a significant barrier for commercial use or integration into closed-source projects.
Limitations & Caveats
The system is optimized and tested for on-CPU execution, with the README noting that GPU utilization would likely yield substantial performance gains. The project appears to be a personal implementation ("in my test system"). The absence of a clear license is a critical adoption blocker.
6 months ago
Inactive
collabora