Self-hosted web app for audio transcription, summarization, and chat
Top 32.2% on sourcepulse
Speakr is a self-hosted web application for transcribing audio recordings, generating summaries and titles, and interacting with the content via chat. It targets individuals and teams seeking to securely manage and analyze their audio data, offering a private alternative to cloud-based transcription services.
How It Works
Speakr leverages OpenAI-compatible APIs for both Speech-to-Text (STT) and Large Language Models (LLMs). Users upload audio files, which are processed in the background. STT APIs convert audio to text, and LLMs then generate summaries, titles, and provide conversational interaction based on the transcript. The architecture supports configurable transcription and output languages, user-specific prompts, and integration of user professional context for more relevant AI responses.
Quick Start & Requirements
git clone https://github.com/murtaza-nasir/speakr.git
docker-compose.yml
with API keys (OpenAI-compatible for STT and LLM) and desired models.docker compose up -d
.Highlighted Details
Maintenance & Community
The project is maintained by Murtaza Nasir. Feedback, bug reports, and feature suggestions are welcomed via GitHub Issues. A Contributor License Agreement (CLA) will be required for future code contributions.
Licensing & Compatibility
Dual-licensed under GNU Affero General Public License v3.0 (AGPLv3) and a separate commercial license. AGPLv3 requires sharing source code of modified versions if accessed over a network. Commercial licensing is available for proprietary integration.
Limitations & Caveats
Local development and Linux systemd deployment methods are explicitly stated as not currently working. Users must rely on the Docker installation. The AGPLv3 license has significant implications for commercial use, requiring source code disclosure of network-accessible modifications.
1 day ago
Inactive