Discover and explore top open-source AI tools and projects—updated daily.
scottvrTranslate natural language to FFmpeg commands
Top 84.3% on SourcePulse
This tool translates natural language descriptions of video and audio tasks into executable FFmpeg commands, targeting users who frequently work with FFmpeg but find its syntax complex. It offers a local, interactive, and customizable solution to streamline media manipulation workflows.
How It Works
The tool leverages a local Large Language Model (LLM) running via llama-cpp-python to interpret natural language prompts. It then generates corresponding FFmpeg commands, which are presented to the user for review and optional execution. The use of llama-cpp-python allows for GPU acceleration by offloading model layers, significantly speeding up inference.
Quick Start & Requirements
chmod +x wtffmpeg.py, and run ./wtffmpeg.py. Optionally, create a symlink for wtff.llama-cpp-python compiled with hardware acceleration (CUDA for NVIDIA, Metal for Apple Silicon, or CPU optimizations). A GGUF-formatted LLM model is required (e.g., Phi-3-mini-4k-instruct-gguf).llama-cpp-python with hardware acceleration, and then installing the project. Model download is separate.Highlighted Details
llama-cpp-python for faster inference.-x).Maintenance & Community
The project is a personal endeavor, described as "humorous performance art" that "borders on being actually useful." No specific community channels or roadmap are detailed.
Licensing & Compatibility
The repository does not explicitly state a license. Compatibility for commercial use or closed-source linking is not specified.
Limitations & Caveats
The tool is experimental ("YMMV," "Use at your own risk"). Generated commands may require manual correction if they contain invalid FFmpeg syntax or flags. The author is not responsible for any data loss or damage.
3 months ago
1 day
lxe
harry0703