Chrome extension for video word search
Top 95.3% on sourcepulse
This project provides a Chrome extension for searching video content by transcribing audio to text, enabling users to find specific words or phrases within videos. It is targeted at content creators, researchers, and anyone who needs to quickly locate information within video lectures, interviews, or other spoken-word content. The primary benefit is efficient content retrieval, saving significant time compared to manual searching.
How It Works
The system leverages OpenAI's Whisper model (via faster-whisper for speed improvements) to perform audio-to-text transcription. A Flask API runs locally, utilizing the user's GPU for processing. The extension then allows users to input search queries, highlighting exact matches in green and phonetically similar matches (using Levenshtein distance) in purple on the YouTube timeline. Results are stored in JSON for faster subsequent searches on the same URL.
Quick Start & Requirements
python app.py
in the pythons
folder to start the Flask API.Highlighted Details
Maintenance & Community
The project is presented as a personal side project by the author, Evan Wildenhain, with an open invitation for others to build upon it. No specific community channels or active maintenance plans are detailed.
Licensing & Compatibility
The README does not explicitly state a license. The project's open nature suggests a permissive intent, but the absence of a LICENSE file requires clarification for commercial or closed-source use.
Limitations & Caveats
Currently, the extension only supports YouTube videos. The initial search may not highlight the timebar while the model is processing; a second search will display cached results instantly. Phonetic similarity matching is noted as "pretty much useless" due to Whisper's effectiveness.
1 year ago
Inactive