CLI tool for subtitle synchronization, translation, and transcription
Top 64.7% on sourcepulse
Subaligner is a Python toolkit for synchronizing, translating, and transcribing subtitles for audiovisual content. It targets content creators, localization specialists, and researchers needing to align or generate subtitles, offering automated workflows to improve accessibility and multilingual support.
How It Works
Subaligner employs a dual-stage alignment process. The first stage performs a global shift estimation, while the second stage refines this by aligning individual subtitle segments in parallel. For transcription, it leverages models like Whisper, supporting various sizes and offering advanced prompting techniques for improved accuracy. Translation is handled by models such as Helsinki-NLP and M2M100.
Quick Start & Requirements
pip install subaligner
(recommended within a virtual environment).subaligner[llm]
for translation/transcription, subaligner[stretch]
for forced alignment.[stretch]
installations.baxtree/subaligner
available.Highlighted Details
Maintenance & Community
The project acknowledges contributions from packages like librosa, tensorflow, scikit-learn, transformers, and openai-whisper. Further community interaction details are not explicitly provided in the README.
Licensing & Compatibility
The README does not explicitly state a license. Compatibility for commercial use or closed-source linking is not specified.
Limitations & Caveats
The installation of optional features ([stretch]
, [dev]
, [harmony]
) requires setuptools<65.0.0
, which may conflict with other project dependencies. The README does not detail specific performance benchmarks or limitations regarding supported hardware (e.g., GPU requirements for transcription models).
3 weeks ago
Inactive