flycut-caption  by x007xyz

AI-powered video subtitle editor component

Created 3 weeks ago

New!

555 stars

Top 57.8% on SourcePulse

GitHubView on GitHub
Project Summary

A React component designed for comprehensive video subtitle editing, FlyCut Caption offers AI-powered speech-to-text generation, visual editing, and video clipping capabilities. It targets developers seeking to integrate advanced subtitle functionalities into their applications, providing a robust solution for intelligent subtitle creation and manipulation with local AI processing.

How It Works

FlyCut Caption leverages a modern tech stack including React 19, TypeScript, Vite, and Tailwind CSS. Its core innovation lies in performing AI speech recognition locally within the browser using Hugging Face's Transformers.js and the Whisper model. To ensure a smooth user experience, the computationally intensive ASR processing is offloaded to Web Workers, preventing the main interface from blocking. The component provides a visual, real-time editing environment for subtitle segments, synchronized with a video player, and supports multi-format export.

Quick Start & Requirements

  • Primary Install/Run: Clone the repository (git clone https://github.com/x007xyz/flycut-caption.git), navigate into the directory, and run pnpm install followed by pnpm dev to start the development server.
  • Prerequisites: Node.js 18+ and pnpm (recommended) are required.
  • Access: The development server is accessible at http://localhost:5173.
  • Build: Production builds are generated using pnpm build.
  • Documentation: Usage guides and API references are available within the README.

Highlighted Details

  • Local AI Speech Recognition: Utilizes the Whisper model via Transformers.js for high-precision, multi-language speech-to-text directly in the browser.
  • Visual Subtitle Editing: Features an intuitive interface for selecting, deleting, and manipulating subtitle segments with real-time video preview.
  • Componentized Internationalization: Built with support for multiple languages (Chinese, English, custom) and flexible language pack management.
  • Multi-format Export: Supports exporting subtitles as SRT or JSON, and exporting processed video files.
  • Subtitle Styling: Allows customization of font, color, position, and background styles.

Maintenance & Community

The provided README does not contain specific details regarding maintainers, community channels (like Discord/Slack), or a public roadmap.

Licensing & Compatibility

The project is licensed under the MIT License with additional terms. Personal, educational, and commercial use, along with modification and distribution, are permitted. However, removing or altering logos, watermarks, or brand elements within the software interface is prohibited without explicit written permission from the FlyCut Team.

Limitations & Caveats

The component specifies browser support for Chrome 88+, Firefox 78+, Safari 14+, and Edge 88+, indicating limited compatibility with older browser versions. The use of React 19, which is still in development, may pose a risk for production stability. Additionally, the prohibition on removing brand elements might be a consideration for some commercial integrations.

Health Check
Last Commit

13 hours ago

Responsiveness

Inactive

Pull Requests (30d)
1
Issues (30d)
6
Star History
580 stars in the last 22 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems") and Jiaming Song Jiaming Song(Chief Scientist at Luma AI).

MoneyPrinterTurbo by harry0703

1.2%
46k
AI tool for one-click short video generation from text prompts
Created 1 year ago
Updated 4 months ago
Feedback? Help us improve.