Discover and explore top open-source AI tools and projects—updated daily.
Local-first, privacy-focused speech-to-text and summarization platform for internal networks
Top 60.0% on SourcePulse
IntraScribe is a self-hosted, privacy-focused speech-to-text and summarization platform designed for internal network deployment in enterprises, schools, and government organizations. It offers real-time transcription, speaker diarization, batch processing, AI-powered summarization, and title generation, with a fully decoupled architecture allowing for flexible integration of various audio capture and transmission methods. The platform prioritizes data privacy and compliance by keeping all data within the local network.
How It Works
IntraScribe utilizes a modular architecture. Real-time transcription is handled via WebRTC for audio streaming from the browser to the backend, with results returned through Server-Sent Events (SSE). For higher quality and structured output, audio is cached, uploaded to Supabase Storage, and then processed for speaker diarization using pyannote.audio
and re-transcription. AI summarization and title generation are managed by LiteLLM
, allowing for configurable models and fallback strategies. Data persistence and real-time updates are managed through Supabase, leveraging Postgres for data, Auth for authentication, Storage for files, and Realtime for event subscriptions.
Quick Start & Requirements
.env.local
for web, .env
for backend), install backend dependencies with uv
, and start the backend and frontend.qwen3:8b
(configurable).pyannote.audio
.mkcert
is recommended for intra-network use.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
2 days ago
Inactive