Web app for conversing with ChatGPT via speech
Top 17.6% on sourcepulse
SpeechGPT is an open-source web application enabling users to converse with ChatGPT via voice, targeting language learners and general users seeking interactive AI experiences. It offers a privacy-first, mobile-friendly interface with extensive language support and flexible speech input/output options.
How It Works
The application leverages web technologies to provide a conversational interface with ChatGPT. It integrates both browser-based speech recognition and synthesis, alongside optional, more advanced services from Azure Speech Services and Amazon Polly for enhanced accuracy and naturalness. Data is processed and stored locally, prioritizing user privacy.
Quick Start & Requirements
docker run -d -p 8080:8080 --name speechgpt hahahumble/speechgpt
AmazonPollyFullAccess
).http://localhost:8080/
.Highlighted Details
Maintenance & Community
No specific contributors, sponsorships, or community links (Discord/Slack) are mentioned in the README.
Licensing & Compatibility
Limitations & Caveats
The application requires an OpenAI API key, incurring costs based on usage. While optional cloud speech services are available, their setup involves managing cloud provider credentials and potential costs. The README does not detail specific performance benchmarks or known limitations of the built-in speech capabilities.
1 year ago
1 day