Simple app for verbal conversation with GPT-4o
Top 66.8% on sourcepulse
Babagaboosh is a Python application enabling verbal conversations with OpenAI's GPT-4o model. It's designed for users who want to interact with advanced AI through speech, offering a straightforward way to engage in spoken dialogue.
How It Works
The application orchestrates a pipeline involving speech-to-text, AI language processing, and text-to-speech. It captures user audio via the microphone, transcribes it using Microsoft Azure TTS, sends the text to GPT-4o for a response, and then converts the AI's reply into speech using Elevenlabs. This multi-service approach allows for natural, spoken interactions with the AI.
Quick Start & Requirements
pip install -r requirements.txt
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
11 months ago
1 day