Voice assistant based on OpenAI ChatGPT
Top 72.5% on sourcepulse
SpeakGPT is an open-source AI assistant designed for users who want a versatile interface to interact with various large language models and AI services. It supports text generation, image creation, and multimodal input, aiming to provide a comprehensive and customizable AI interaction experience.
How It Works
SpeakGPT leverages a flexible API integration architecture, allowing it to connect with multiple LLM providers including OpenAI, GROQ, Azure, and OpenRouter. This approach enables users to access a wide range of models like GPT, LLAMA, MIXTRAL, GEMMA, and Gemini, offering choice and avoiding vendor lock-in. It supports features like voice input via Whisper and Google, image generation with DALL-E, and GPT-4 Vision for image analysis.
Quick Start & Requirements
Highlighted Details
Maintenance & Community
The project is part of a Bachelor Thesis by Dmytro Ostapenko. Attribution is required for use. Users are encouraged to report bugs and request features via GitHub Issues.
Licensing & Compatibility
Licensed under the Apache License, Version 2.0. This license permits commercial use and linking with closed-source projects, provided the terms of the license are followed.
Limitations & Caveats
The project is noted as part of a Bachelor Thesis, and attribution is required. Some planned features like device routines and chat history sync are not yet implemented. OpenRouter support is limited to text generation.
1 month ago
1 day