Voice interface for ChatGPT
Top 51.6% on sourcepulse
This project enables voice-based conversations with ChatGPT, allowing users to speak to the AI and receive spoken responses. It is designed for individuals seeking a more natural, hands-free interaction with large language models.
How It Works
The system captures audio input via a microphone, converts speech to text using an underlying LLM API, processes the text with ChatGPT, converts ChatGPT's text response back to speech, and plays the audio output. This creates a continuous, spoken dialogue loop.
Quick Start & Requirements
sudo apt install portaudio19-dev python3-pyaudio ffmpeg espeak
pip install -r requirements.txt
config.json.example
to config.json
and adding a session_token
.python chatgpt.py
Highlighted Details
Maintenance & Community
No specific community channels, roadmap, or contributor information is provided in the README.
Licensing & Compatibility
The project's license is not specified in the README.
Limitations & Caveats
The project is currently command-line based and lacks a web interface. Future development plans include adding interruption capabilities, suppressing audio errors, filtering common AI phrases, and optimizing performance with GPU acceleration for speech-to-text.
2 years ago
1 week