generate-subtitles by mayeaux

Web app for audio/video transcription and translation

Created 3 years ago

811 stars

Top 43.5% on SourcePulse

Project Summary

This project provides a user-friendly web UI for generating transcripts and automatic translations of audio and video content. It leverages OpenAI's Whisper for transcription and LibreTranslate for translations, with integrated video downloading via yt-dlp. The target audience includes content creators, researchers, and anyone needing to process multimedia for accessibility or analysis.

How It Works

The application utilizes OpenAI's Whisper model for accurate speech-to-text transcription. Optionally, LibreTranslate can be integrated for automatic translation of the generated transcripts into various languages. Video content is handled through yt-dlp, enabling direct downloading and processing from URLs. The architecture is built around a Node.js backend serving a web interface.

Quick Start & Requirements

Installation: Clone the repository, install Node.js 14+, install Whisper AI, install yt-dlp, run npm install, and then npm start.
Prerequisites: Node.js 14+, OpenAI Whisper (CPU or GPU), yt-dlp. GPU acceleration (CUDA) is highly recommended for performance.
Setup: Requires manual installation of Whisper and yt-dlp.
Docs: Whisper Setup, yt-dlp.

Highlighted Details

Integrates Whisper AI for transcription and LibreTranslate for translation.
Supports automatic video downloading via yt-dlp.
Offers a user-friendly web interface for easy operation.
Recommends VastAI for GPU-accelerated cloud instances.

Maintenance & Community

The project is maintained by mayeaux.
No specific community links (Discord/Slack) or roadmap are provided in the README.

Licensing & Compatibility

The README does not explicitly state a license. The repository's license file should be consulted for details.

Limitations & Caveats

CPU-based transcription is significantly slower than GPU-based.
The provided VastAI setup script is noted as "not perfect yet."
Port forwarding configuration for cloud instances requires careful attention.

Health Check

Last Commit

2 years ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

2 stars in the last 30 days

Explore Similar Projects

yt-transcriber by pmarreck

YouTube transcription TUI app

Created 1 year ago

Updated 2 months ago

pytvzhen by CuSO4Gem

CLI tool for fast YouTube English video translation to Chinese

Created 1 year ago

Updated 1 year ago

openlrc by zh-plus

Python library for audio transcription and translation to LRC files

Created 2 years ago

Updated 17 hours ago

AudioToText by Carleslc

CLI tool for audio transcription and translation

Created 3 years ago

Updated 2 years ago

Speech-Translate by Dadangdut33

Speech-to-text app using Whisper for transcription and translation

Created 3 years ago

Updated 2 years ago

Starred by

Travis Fischer

Travis Fischer(Founder of Agentic) and

Didier Lopes

Didier Lopes(Founder of OpenBB).

yt-whisper by m1guelpf

CLI tool for generating YouTube subtitles

Created 3 years ago

Updated 2 years ago

Starred by

Travis Fischer

Travis Fischer(Founder of Agentic).

writeout.ai by beyondcode

Web app for audio transcription and translation

Created 3 years ago

Updated 2 years ago

SoniTranslate by R3gm

Gradio web UI for video translation with synchronized audio

Created 2 years ago

Updated 2 months ago

Linly-Dubbing by Kedreamix

AI dubbing/translation tool for multi-language video content creation

Created 1 year ago

Updated 11 months ago

YouDub-webui by liuzhao1225

WebUI for video translation/dubbing

Created 2 years ago

Updated 2 months ago

Starred by

Abubakar Abid

Abubakar Abid(Cofounder of Gradio).

voice-pro by abus-aikorea

WebUI for speech recognition, translation, and dubbing

Created 1 year ago

Updated 2 months ago

pyvideotrans by jianchang512

Video translation CLI tool

Created 2 years ago

Updated 19 hours ago

Feedback? Help us improve.