Discover and explore top open-source AI tools and projects—updated daily.
gkamradtVoice bot demo using speech and language models
Top 74.7% on SourcePulse
QuickAgent is an alpha-stage Python demo showcasing a voice-controlled chatbot. It integrates Text-to-Speech (TTS), Speech-to-Text (STT), and a Large Language Model (LLM) for conversational interaction, targeting users interested in real-time voice AI applications.
How It Works
The bot leverages streaming audio processing for both STT and TTS to minimize latency. It's configured to use Deepgram for audio services and Groq for its LLM, enabling a fluid, conversational experience. The core logic is contained within the QuickAgent.py script, with reusable components in the building_blocks directory.
Quick Start & Requirements
python3 QuickAgent.pyHighlighted Details
Maintenance & Community
No specific community channels, contributors, or roadmap details are provided in the README.
Licensing & Compatibility
The license is not specified in the README.
Limitations & Caveats
This is an alpha demo, indicating potential instability and incomplete features. The project relies on specific third-party services (Deepgram, Groq) which may incur costs and require API key management.
1 year ago
Inactive
janhq