ChatGPT-OpenAI-Smart-Speaker by Olney1

AI smart speaker for voice-driven conversations

Created 3 years ago

310 stars

Top 86.9% on SourcePulse

1 Expert Loves This Project

mxcl

Author of Homebrew

Project Summary

This project provides a DIY AI smart speaker leveraging OpenAI's GPT models for conversational AI, coupled with speech-to-text (STT) and text-to-speech (TTS) capabilities. It targets hobbyists and developers looking to build custom voice-controlled assistants with web search integration via Langchain agents.

How It Works

The system utilizes a combination of Python scripts for different deployment targets. PC/Mac versions (chat.py, test.py) directly use the microphone and speakers, integrating OpenAI for responses and gTTS for audio output. The Raspberry Pi version (pi.py) employs Picovoice for efficient wake-word detection and integrates with a ReSpeaker 4-Mic Array for enhanced audio input and visual feedback via APA102 LEDs. Web search is enabled through Tavily API integration.

Quick Start & Requirements

PC/Mac:
- Install: pip install openai pyaudio SpeechRecognition gTTS playsound python-dotenv pyobjc (Mac)
- Prerequisites: Python 3.7.3+, OpenAI API key, working microphone/speakers. brew install portaudio on macOS.
- Run: python chat.py or python test.py
Raspberry Pi (4b recommended):
- Install: pip install openai pyaudio SpeechRecognition gTTS pydub python-dotenv apa102-pi gpiozero and pip install -r requirements.txt. Install portaudio19-dev, ffmpeg, python3-dev, libasound2-dev, python3-rpi.gpio. Follow Seeed ReSpeaker guide.
- Prerequisites: Raspberry Pi 4b, ReSpeaker 4-Mic Array (or alternative mic/speaker), OpenAI API key, Tavily Search API key, PicoVoice Access Key & Custom Voice Model. Python 3.9+ recommended.
- Run: python3 pi.py
- Docs: https://wiki.seeedstudio.com/ReSpeaker_4_Mic_Array_for_Raspberry_Pi/

Highlighted Details

Supports wake-word detection ("Jeffers" by default) via PicoVoice for Raspberry Pi.
Integrates web search capabilities using Langchain agents and Tavily API.
Provides visual feedback on Raspberry Pi using APA102 LEDs with ReSpeaker array.
Allows customization of OpenAI model, language, and response temperature.

Maintenance & Community

Project appears actively developed, with a Medium post detailing future plans: https://medium.com/@ben_olney/openai-smart-speaker-with-raspberry-pi-5e284d21a53e

Licensing & Compatibility

License: Not explicitly stated in the README.

Limitations & Caveats

ReSpeaker hardware is listed as retired by Seeed Studio and may have compatibility issues with Raspberry Pi 5.
Raspberry Pi setup involves numerous dependencies and potential troubleshooting steps, particularly for audio and hardware integration.
Windows and Linux setups may require additional dependencies beyond those listed for macOS.

Health Check

Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

3 stars in the last 30 days

Explore Similar Projects

alibabacloud-bailian-speech-demo by aliyun

Speech AI SDK demos for AlibabaCloud Bailian

Created 1 year ago

Updated 3 weeks ago

Starred by

Georgi Gerganov

Georgi Gerganov(Author of llama.cpp, whisper.cpp).

pi-card by nkasmanoff

Voice assistant for Raspberry Pi

Created 1 year ago

Updated 1 year ago

gpt-voice-conversation-chatbot by Adri6336

Voice chatbot for engaging spoken conversations with ChatGPT/GPT-4

Created 2 years ago

Updated 1 year ago

Starred by

Travis Fischer

Travis Fischer(Founder of Agentic).

ollama-voice-mac by apeatling

Offline voice assistant for macOS

Created 2 years ago

Updated 4 months ago

voice-assistant-whisper-chatgpt by bhattbhavesh91

AI-powered voice assistant creation

Created 3 years ago

Updated 2 years ago

AIUI by lspahija

Voice interface for AI models

Created 2 years ago

Updated 1 year ago

fast-voice-assistant by dsa

AI voice assistant demo with <500ms response

Created 1 year ago

Updated 1 year ago

voice-chat-ai by bigsk1

Voice chat app for interacting with AI characters using speech

Created 1 year ago

Updated 5 days ago

Starred by

Teknium

Teknium(Cofounder of Nous Research).

ChatWaifu by cjyaddone

Chatbot for simulating conversations with waifu-style characters

Created 3 years ago

Updated 1 year ago

Babagaboosh by DougDougGithub

Simple app for verbal conversation with GPT-4o

Created 2 years ago

Updated 1 year ago

Starred by

Teknium

Teknium(Cofounder of Nous Research).

Bing-GPT-Voice-Assistant by Ai-Austin

Voice assistant using dual wake words

Created 2 years ago

Updated 2 years ago

mi-gpt by idootop

Voice assistant for integrating smart speakers with LLMs

Created 1 year ago

Updated 4 months ago

Feedback? Help us improve.