Tool for turning multimedia into searchable knowledge
Top 45.5% on sourcepulse
FrogBase transforms multimedia content into navigable knowledge graphs, targeting both developers and non-technical users. It simplifies the workflow of downloading, transcribing, embedding, and indexing audio-visual data, enabling efficient searching and knowledge discovery.
How It Works
FrogBase orchestrates a pipeline leveraging yt_dlp
for media downloads, OpenAI's Whisper for speech-to-text transcription, and Sentence Transformers for embedding text segments. These embeddings are then indexed using hnswlib
for efficient similarity search. This integrated approach streamlines the process of creating searchable knowledge bases from diverse online media.
Quick Start & Requirements
ffmpeg
(e.g., sudo apt install ffmpeg
).pip install frogbase
.pip install streamlit
and run streamlit run ui/01_🏠_Home.py
.Highlighted Details
yt_dlp
.hnswlib
.Maintenance & Community
whisper-ui
.Licensing & Compatibility
Limitations & Caveats
This repository is a pre-release version and is known to be very unstable. Stable releases are available in 1.x versions.
1 year ago
1+ week