AutoSub by abhirooptalasila

CLI tool for generating video subtitles

Created 5 years ago

653 stars

Top 51.3% on SourcePulse

1 Expert Loves This Project

shizhediao

Author of LMFlow; Research Scientist at NVIDIA

Project Summary

AutoSub is a command-line interface (CLI) tool designed to automatically generate subtitle files (SRT, VTT, TXT) for any video. It targets users who download video content and require subtitles, offering a convenient solution for creating them using open-source speech-to-text engines.

How It Works

AutoSub leverages Mozilla DeepSpeech or Coqui STT for speech-to-text inference. It first uses FFmpeg to extract audio from the video, ensuring a 16kHz sampling rate compatible with DeepSpeech. Then, it employs pyAudioAnalysis to segment the audio based on silence, creating smaller, manageable files. Inference is performed on each audio segment, and the resulting text is compiled into subtitle files.

Quick Start & Requirements

Install via pip: pip install . (after cloning the repo and optionally activating a virtual environment).
Prerequisites: Python 3, FFmpeg. For GPU support, install requirements-gpu.txt and ensure appropriate CUDA version.
Model files are downloaded automatically if not present.
See: DeepSpeech Examples, Coqui STT

Highlighted Details

Supports both DeepSpeech and Coqui STT engines.
Audio segmentation via silence detection for efficient processing.
Customizable subtitle split duration and output formats (SRT, VTT, TXT).
Docker support for CPU and GPU builds.

Maintenance & Community

Project is maintained by abhirooptalasila.
References include DeepSpeech and pyAudioAnalysis repositories.

Licensing & Compatibility

The README does not explicitly state a license. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

Requires manual download of model and scorer files for languages other than English.
Performance on a dual-core i5 with 8GB RAM for a 70-minute video was approximately 40 minutes, suggesting potentially long processing times for longer videos on less powerful hardware.

Health Check

Last Commit

2 years ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

0 stars in the last 30 days

Explore Similar Projects

fcpx-auto-captions by shaishaicookie

FCPX tool for auto-captioning videos using OpenAI's Whisper

Created 2 years ago

Updated 2 years ago

auto_ai_subtitle by qinL-cdy

Subtitle generator for videos

Created 2 years ago

Updated 2 years ago

VideoSubtitleGenerator by buxuku

CLI tool for local video subtitle generation and translation

Created 2 years ago

Updated 7 months ago

PreenCut by roothch

AI-powered tool for video retrieval and clipping

Created 7 months ago

Updated 4 months ago

Starred by

Andy Konwinski

Andy Konwinski (Cofounder of Perplexity, Databricks) and

Paul Gauthier

Paul Gauthier(Founder of Aider).

ArxivPapers by imelnyk

ArXiv paper to video/audio converter

Created 1 year ago

Updated 1 year ago

Auto-YouTube-Shorts-Maker by Binary-Bytes

CLI tool for automated YouTube Shorts creation

Created 2 years ago

Updated 1 year ago

decipher by dsymbol

CLI tool for AI-powered video subtitling

Created 3 years ago

Updated 1 year ago

bulk_transcribe_youtube_videos_from_playlist by Dicklesworthstone

CLI tool for bulk YouTube video transcription

Created 2 years ago

Updated 10 months ago

Auto-Synced-Translated-Dubs by ThioJoe

CLI tool for auto-synced, translated video dubs

Created 3 years ago

Updated 18 hours ago

auto-subtitle by m1guelpf

CLI tool for automatic video subtitling

Created 3 years ago

Updated 1 year ago

FunClip by modelscope

Video clipping tool using LLM-based AI

Created 2 years ago

Updated 6 months ago

autocut by mli

CLI tool for subtitle-based video editing

Created 3 years ago

Updated 1 year ago

Feedback? Help us improve.