Self-hosted service for offline transcription, diarization, and LLM summarization
Top 41.9% on sourcepulse
This project provides a self-hosted, offline transcription and diarization service with LLM-based summarization. It targets users needing to process audio files, offering a web interface and SSH drop zones for easy integration into workflows. The service leverages Whisper for transcription/diarization, Ollama with Mistral for summarization, and Meilisearch for full-text search, aiming for a turnkey solution.
How It Works
The system utilizes Docker for deployment, bundling transcription, diarization, summarization, and search functionalities. Audio files can be uploaded via a web UI or dropped via SSH. Whisper-diarization handles speaker identification and transcription, while Ollama integrates with Mistral to generate summaries based on a customizable prompt. Meilisearch provides fast indexing and retrieval of transcribed text.
Quick Start & Requirements
./start-nobuild.sh
(for Docker images) or ./install.sh
followed by ./run.sh
(for local build).ts-gpu
image, ~26GB).Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
10 months ago
1 day