AI smart speaker for voice-driven conversations
Top 90.4% on sourcepulse
This project provides a DIY AI smart speaker leveraging OpenAI's GPT models for conversational AI, coupled with speech-to-text (STT) and text-to-speech (TTS) capabilities. It targets hobbyists and developers looking to build custom voice-controlled assistants with web search integration via Langchain agents.
How It Works
The system utilizes a combination of Python scripts for different deployment targets. PC/Mac versions (chat.py
, test.py
) directly use the microphone and speakers, integrating OpenAI for responses and gTTS
for audio output. The Raspberry Pi version (pi.py
) employs Picovoice for efficient wake-word detection and integrates with a ReSpeaker 4-Mic Array for enhanced audio input and visual feedback via APA102 LEDs. Web search is enabled through Tavily API integration.
Quick Start & Requirements
pip install openai pyaudio SpeechRecognition gTTS playsound python-dotenv pyobjc
(Mac)brew install portaudio
on macOS.python chat.py
or python test.py
pip install openai pyaudio SpeechRecognition gTTS pydub python-dotenv apa102-pi gpiozero
and pip install -r requirements.txt
. Install portaudio19-dev
, ffmpeg
, python3-dev
, libasound2-dev
, python3-rpi.gpio
. Follow Seeed ReSpeaker guide.python3 pi.py
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
8 months ago
Inactive