NanoBanana-PPT-Skills  by op7418

AI-driven PPT generation with dynamic video transitions

Created 2 weeks ago

New!

971 stars

Top 38.1% on SourcePulse

GitHubView on GitHub
Project Summary

Summary NanoBanana PPT Skills automates high-quality presentation creation using AI, generating images and videos from document analysis. It targets users seeking efficient, automated design, offering a complete pipeline from content extraction to interactive video output, significantly reducing manual effort.

How It Works The system leverages Google's Nano Banana Pro (Gemini 3 Pro Image Preview) for slide image generation and KeLing AI for video transitions. It intelligently analyzes input documents, structures content, produces visuals in selected styles, and uses FFmpeg for synthesizing presentations with interactive playback and full video export.

Quick Start & Requirements Installation is recommended via Claude Code prompt or manual steps: clone the repo (https://github.com/op7418/NanoBanana-PPT-Skills), set up a Python virtual environment (venv), and install dependencies (pip install google-genai pillow python-dotenv). Essential API keys for Google AI (GEMINI_API_KEY) and optionally KeLing AI (KLING_ACCESS_KEY, KLING_SECRET_KEY) are required, configured via .env. FFmpeg system installation is needed for video generation.

Highlighted Details

  • AI-powered generation of high-resolution (2K/4K) PPT images and transition videos, supporting distinct visual styles and extensibility.
  • Features interactive video playback and full MP4 export, synthesizing all transitions and slides.
  • Integrates as a Claude Code Skill for conversational PPT generation.

Maintenance & Community Maintained by creator 歸藏 (op7418), the project encourages contributions via GitHub Issues. Update logs indicate ongoing development, with version 2.0.0 introducing significant video features.

Licensing & Compatibility Released under the permissive MIT License, allowing broad usage, modification, distribution, and commercial application, provided the original copyright notice is included. Compatible with closed-source projects.

Limitations & Caveats Core functionality depends on external API keys (Google AI, KeLing AI), potentially incurring costs. Advanced video features require FFmpeg installation. Given its recent release history (v2.0.0 dated Jan 11, 2026), users should anticipate evolving features or undiscovered issues.

Health Check
Last Commit

1 week ago

Responsiveness

Inactive

Pull Requests (30d)
2
Issues (30d)
2
Star History
979 stars in the last 18 days

Explore Similar Projects

Feedback? Help us improve.