noScribe by kaixxx

GUI tool for local AI-powered audio transcription

Created 2 years ago

1,696 stars

Top 24.8% on SourcePulse

2 Experts Love This Project

swyxio

Editor of Latent Space

MagMueller

Cofounder of Browser Use

Project Summary

noScribe is a free, open-source desktop application for automated audio transcription, primarily targeting qualitative social researchers and journalists. It leverages OpenAI's Whisper and pyannote for transcription and speaker identification, offering a local, privacy-focused solution with an integrated editor for transcript refinement.

How It Works

noScribe utilizes Whisper for speech-to-text conversion and pyannote for speaker diarization. Users can select transcription quality (precise or fast) and configure options like language detection, pause marking, speaker identification, disfluency inclusion, and timestamp generation. The application processes audio locally, ensuring data privacy.

Quick Start & Requirements

Installation: Downloadable executables are provided for Windows, macOS (Apple Silicon and Intel), and Linux.
Prerequisites:
- Windows: NVIDIA CUDA toolkit (for GPU acceleration, requires 6GB+ VRAM).
- macOS: Rosetta 2 for Intel-based components on Apple Silicon.
Resource Footprint: Download size is approximately 3.7 GB. Transcription of a one-hour interview can take up to three hours and requires significant CPU resources.
Links:
- Releases: https://drive.switch.ch/index.php/s/HtKDKYRZRNaYBeI
- noScribeEditor Source: https://github.com/kaixxx/noScribeEditor

Highlighted Details

Runs entirely locally, no data sent to the internet.
Supports ~60 languages, with best performance for English, Spanish, Italian, Portuguese, and German.
Integrated editor allows synchronized playback of audio with transcript text for easy correction.
Handles speaker identification and can mark overlapping speech (experimental).

Maintenance & Community

Developed by Kai Dröge.
Translations are community-contributed and may require review.
Source code available on GitHub.

Licensing & Compatibility

License: GPL-3.0.
Compatibility: Free for commercial use and integration with closed-source software, subject to GPL-3.0 terms.

Limitations & Caveats

Requires a powerful computer for reasonable processing times; slower machines may require overnight processing.
Transcription quality is highly dependent on audio quality.
Known issues include potential AI text repetition loops on long files and experimental support for multilingual audio and overlapping speech. Non-verbal expressions are not transcribed.

Health Check

Last Commit

1 week ago

Responsiveness

1 day

Pull Requests (30d)

1

Issues (30d)

3

Star History

52 stars in the last 30 days

Explore Similar Projects

Stage-Whisper by Stage-Whisper

Transcription app for journalists using OpenAI's Whisper ASR

Created 3 years ago

Updated 2 years ago

open-dubbing by Softcatala

AI dubbing system for videos

Created 1 year ago

Updated 6 months ago

Starred by

Eugene Yan

Eugene Yan(AI Scientist at AWS).

Whisper-transcription_and_diarization-speaker-identification- by lablab-ai

Audio transcription/diarization using Whisper and pyannote-audio

Created 3 years ago

Updated 3 years ago

LiveWhisper by Nikorasu

Live transcription tool using OpenAI's Whisper

Created 3 years ago

Updated 5 months ago

AudioToText by Carleslc

CLI tool for audio transcription and translation

Created 2 years ago

Updated 2 years ago

aTrain by JuergenFleiss

GUI tool for offline speech transcription, speaker diarization

Created 2 years ago

Updated 3 days ago

Speech-Translate by Dadangdut33

Speech-to-text app using Whisper for transcription and translation

Created 3 years ago

Updated 2 years ago

Starred by

Georgi Gerganov

Georgi Gerganov(Author of llama.cpp, whisper.cpp).

transcriber_app by davabase

Real-time speech-to-text transcription app

Created 3 years ago

Updated 3 years ago

Starred by

Travis Fischer

Travis Fischer(Founder of Agentic).

whispo by egoist

AI-powered dictation tool

Created 1 year ago

Updated 1 year ago

Easy-Voice-Toolkit by Spr-Aachen

Local AI voice toolkit for audio processing, recognition, transcription, and conversion

Created 2 years ago

Updated 3 weeks ago

Starred by

Emile Vauge

Emile Vauge(Founder of Traefik).

Scriberr by rishikanthc

Self-hosted app for local AI audio transcription

Created 1 year ago

Updated 4 days ago

Starred by

Travis Fischer

Travis Fischer(Founder of Agentic).

writeout.ai by beyondcode

Web app for audio transcription and translation

Created 2 years ago

Updated 2 years ago

Feedback? Help us improve.