Web app for local speech-to-text using Whisper
Top 87.6% on sourcepulse
This project provides a simple web application for converting audio to subtitles using OpenAI's Whisper model. It is designed for users who need a straightforward way to generate transcriptions from audio files via a web interface, suitable for local testing and development.
How It Works
The application leverages OpenAI's Whisper model to perform speech-to-text conversion. It exposes a web interface, allowing users to upload audio files and receive subtitle outputs. The architecture is a standard Python web application, likely using a framework like Flask or FastAPI, serving the Whisper model's inference capabilities.
Quick Start & Requirements
git clone git@github.com:Kabanosk/whisper-website.git
cd whisper-website
python3 -m venv venv
then source venv/bin/activate
pip install -r requirements.txt
cd src && python3 run.py
http://127.0.0.1:8000/
docker build -t app .
docker run --name app_container -p 80:80 app
http://127.0.0.1:80/
Highlighted Details
Maintenance & Community
No specific information on contributors, community channels, or roadmap is provided in the README.
Licensing & Compatibility
The README does not specify a license. Compatibility for commercial use or closed-source linking is not detailed.
Limitations & Caveats
The project appears to be a basic implementation for local use. It does not detail performance benchmarks, scalability options, or advanced features. The lack of explicit licensing information may pose a barrier to commercial adoption.
5 months ago
Inactive