Whisper-TikTok by MatteoFasulo

TikTok video generator using Whisper, Edge TTS, and FFMPEG

Created 2 years ago

315 stars

Top 85.8% on SourcePulse

Project Summary

This project provides an end-to-end solution for generating TikTok videos using AI. It targets content creators looking to automate video production by transcribing audio, generating natural-sounding voiceovers, and assembling video content with customizable subtitles. The primary benefit is the significant reduction in manual effort required to create engaging short-form videos.

How It Works

The system orchestrates a pipeline involving several AI models and tools. It starts by fetching a background video (randomly or from a specified URL) and uses Microsoft Edge Cloud TTS for natural voiceovers. OpenAI's Whisper model transcribes the generated audio into SRT subtitles, which are then embedded into the background video using FFMPEG. The process is configurable via a JSON file and offers command-line options for customization.

Quick Start & Requirements

Install dependencies: pip install -U -r requirements.txt
Requires FFMPEG to be installed and available in the system's PATH.
For optimal performance, a GPU with CUDA is recommended for the Whisper model, though it will fall back to CPU.
TikTok upload requires a TikTok account and a cookies.txt file generated via a provided guide.
Local Web-UI: streamlit run app.py --server.port=8501 --server.address=0.0.0.0
Command-Line: python main.py [OPTIONS]
Online Demo: https://huggingface.co/spaces/MatteoFasulo/Whisper-TikTok-Demo

Highlighted Details

Leverages Microsoft Edge Cloud TTS for natural-sounding voiceovers.
Utilizes OpenAI Whisper for accurate audio transcription and subtitle generation.
Supports customization of subtitles (font, color, size, position) via FFMPEG.
Includes an optional feature to upload generated videos directly to TikTok.
Offers both a local Web-UI (Streamlit) and command-line interface.

Maintenance & Community

Key dependencies include edge-tts and stable-ts.
Contributions are welcomed via Contributing Guidelines.
Upcoming features include OpenAI API integration and Reddit content extraction.

Licensing & Compatibility

Licensed under the Apache License, Version 2.0.
Permissive license suitable for commercial use and integration with closed-source projects.

Limitations & Caveats

TikTok upload functionality requires manual cookie generation and may be subject to TikTok's API changes.
While GPU acceleration is supported for Whisper, CPU fallback will result in significantly slower processing.

Health Check

Last Commit

1 month ago

Responsiveness

Inactive

Pull Requests (30d)

0

Issues (30d)

0

Star History

2 stars in the last 30 days

Explore Similar Projects

easyvideotrans by sutro-planet

Web backend for AI video translation and dubbing

Created 1 year ago

Updated 2 months ago

auto_ai_subtitle by qinL-cdy

Subtitle generator for videos

Created 2 years ago

Updated 2 years ago

cliptalk by disingn

API for TikTok video analysis

Created 2 years ago

Updated 1 year ago

Auto-YouTube-Shorts-Maker by Binary-Bytes

CLI tool for automated YouTube Shorts creation

Created 2 years ago

Updated 1 year ago

Starred by

Kyle Mathews

Kyle Mathews(Author of Gatsby).

frogbase by hayabhay

Tool for turning multimedia into searchable knowledge

Created 3 years ago

Updated 2 years ago

flycut-caption by x007xyz

AI-powered video subtitle editor component

Created 3 months ago

Updated 2 months ago

SmartSub by buxuku

Cross-platform tool for batch generating & translating video/audio subtitles

Created 1 year ago

Updated 2 days ago

Chenyme-AAVT by chenyme

All-in-one tool for media translation automation

Created 2 years ago

Updated 9 months ago

SoniTranslate by R3gm

Gradio web UI for video translation with synchronized audio

Created 2 years ago

Updated 1 month ago

short-video-maker by gyoridavid

CLI tool for automated short-form video creation

Created 9 months ago

Updated 6 months ago

KrillinAI by krillinai

Video tool for translation and dubbing using LLMs

Created 1 year ago

Updated 1 month ago

Starred by

Chip Huyen

Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems").

VideoLingo by Huanshere

AI tool for automated video translation, localization, and dubbing

Created 1 year ago

Updated 7 months ago

Feedback? Help us improve.