Spoken English pronunciation correction assistant using Gemini
Top 35.8% on sourcepulse
This project provides an AI-powered English pronunciation tutor using Google Gemini. It's designed for English language learners seeking real-time feedback on their speaking, including pronunciation correction, grammar suggestions, and interactive dialogue practice.
How It Works
The system leverages Google Gemini for AI-driven pronunciation assessment and feedback. It captures user speech via microphone, processes it for real-time recognition, and then provides targeted guidance on pronunciation and grammar. The architecture supports intelligent scene switching for varied practice scenarios.
Quick Start & Requirements
pip install -r requirements.txt
python starter.py
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The project requires specific system dependencies (like portaudio19-dev
on Ubuntu/Debian) to be installed before Python dependencies. The ElevenLabs API key is optional but required for voice output features.
6 months ago
1 day