skills  by wlzh

AI toolkit for media automation and content generation

Created 2 months ago
497 stars

Top 62.4% on SourcePulse

GitHubView on GitHub
Project Summary

Summary The wlzh/skills repository offers a curated collection of practical, open-source automation tools and scripts, primarily developed for personal utility and shared for broader adoption. It targets users seeking to streamline content creation, media processing, and data aggregation, providing solutions for YouTube-to-podcast conversion, AI voice modification, code analysis, and web scraping.

How It Works This project employs a modular "skills" architecture, where each script addresses a distinct task. Core functionalities integrate AI models for voice cloning (RVC), image generation (ModelScope, Gemini), and speech recognition (FunASR). Automation leverages scripting, FFmpeg for media, and APIs for services like YouTube and X (Twitter). The design emphasizes composability, enabling users to chain skills for complex automated processes.

Quick Start & Requirements

  • Installation: Each skill operates independently. Basic execution: python3 ~/.claude/skills/ /scripts/script.py [arguments].
  • Prerequisites: Dependencies are skill-specific; consult individual READMEs. Python 3.10 noted for audiocut-keyword.
  • Resources: No universal hardware requirements (e.g., GPU/CUDA) specified, though AI skills may need them.
  • Links: Video and text tutorials available via YouTube and Twitter/X.

Highlighted Details

  • youtube-to-xiaoyuzhou: Automates YouTube video-to-podcast conversion with AI covers, keyword filtering, voice changing, and scheduled publishing.
  • voice-changer / video-voice-changer: Uses RVC AI for realistic voice conversion on audio/video, supporting diverse languages and presets.
  • code-roaster: Delivers humorous, Gordon Ramsay-style code reviews, analyzing quality with configurable intensity.
  • youtube-to-blog-post: Transforms YouTube videos into SEO-optimized Hexo blog posts, auto-extracting content and metadata.
  • quark-mswnlz-publisher: Automates publishing from Quark Drive to GitHub, managing resource organization, link generation, and site updates.
  • x-fetcher / wespy-fetcher: Scrapes X (Twitter) and WeChat content, saving posts/articles as Markdown with media downloads and interaction data.

Maintenance & Community The repository encourages community engagement via Issues and Pull Requests. Tutorials are provided via YouTube and Twitter/X. As a personal project ("大部分是自己实际需要搞得"), maintenance levels may vary across individual skills.

Licensing & Compatibility

  • License: MIT License.
  • Compatibility: Permissive MIT license, suitable for commercial use and closed-source integration.

Limitations & Caveats Setup complexity and dependencies vary significantly across skills, requiring consultation of individual documentation. The project's origin as a personal collection may imply varying support levels and potential for unannounced changes to specific tools.

Health Check
Last Commit

1 week ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
321 stars in the last 30 days

Explore Similar Projects

Feedback? Help us improve.