wtffmpeg  by scottvr

Translate natural language to FFmpeg commands

Created 1 month ago
314 stars

Top 85.9% on SourcePulse

GitHubView on GitHub
Project Summary

This tool translates natural language descriptions of video and audio tasks into executable FFmpeg commands, targeting users who frequently work with FFmpeg but find its syntax complex. It offers a local, interactive, and customizable solution to streamline media manipulation workflows.

How It Works

The tool leverages a local Large Language Model (LLM) running via llama-cpp-python to interpret natural language prompts. It then generates corresponding FFmpeg commands, which are presented to the user for review and optional execution. The use of llama-cpp-python allows for GPU acceleration by offloading model layers, significantly speeding up inference.

Quick Start & Requirements

  • Install: Clone the repository, chmod +x wtffmpeg.py, and run ./wtffmpeg.py. Optionally, create a symlink for wtff.
  • Prerequisites: Python 3.8+, llama-cpp-python compiled with hardware acceleration (CUDA for NVIDIA, Metal for Apple Silicon, or CPU optimizations). A GGUF-formatted LLM model is required (e.g., Phi-3-mini-4k-instruct-gguf).
  • Setup: Installation involves cloning, setting up llama-cpp-python with hardware acceleration, and then installing the project. Model download is separate.
  • Docs: https://github.com/scottvr/wtffmpeg

Highlighted Details

  • Local LLM execution for privacy and offline use.
  • GPU acceleration via llama-cpp-python for faster inference.
  • Interactive mode for multiple commands and direct execution confirmation.
  • Customizable LLM models and system prompts.
  • Option to copy commands to clipboard or execute directly (-x).

Maintenance & Community

The project is a personal endeavor, described as "humorous performance art" that "borders on being actually useful." No specific community channels or roadmap are detailed.

Licensing & Compatibility

The repository does not explicitly state a license. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

The tool is experimental ("YMMV," "Use at your own risk"). Generated commands may require manual correction if they contain invalid FFmpeg syntax or flags. The author is not responsible for any data loss or damage.

Health Check
Last Commit

1 month ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
0
Star History
9 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems") and Jiaming Song Jiaming Song(Chief Scientist at Luma AI).

MoneyPrinterTurbo by harry0703

0.4%
40k
AI tool for one-click short video generation from text prompts
Created 1 year ago
Updated 3 months ago
Feedback? Help us improve.