mulmocast-cli  by receptron

AI-powered multi-modal content generation tool

Created 5 months ago
354 stars

Top 78.8% on SourcePulse

GitHubView on GitHub
Project Summary

MulmoCast is an AI-native presentation platform designed for collaborative content creation between humans and AI. It allows users to generate rich, multi-modal presentations, including videos, podcasts, slideshows, and manga, from a single script. The target audience includes content creators, educators, and businesses looking to streamline and diversify their presentation outputs.

How It Works

MulmoCast utilizes a proprietary intermediate language called MulmoScript, a JSON-based format that defines content structure, visuals, and narrative. AI models, such as ChatGPT and Claude, generate MulmoScript from various inputs (text, URLs, interactive prompts). The MulmoCast CLI then processes this script to produce outputs across multiple formats by orchestrating AI-driven generation of audio, images, and video. This approach enables efficient, multi-format content creation from a unified source.

Quick Start & Requirements

  • Installation: npm install -g mulmocast
  • Prerequisites: ffmpeg (install via Homebrew or download from ffmpeg.org).
  • API Keys: OPENAI_API_KEY is required. Optional keys include GOOGLE_PROJECT_ID, ANTHROPIC_API_TOKEN, REPLICATE_API_TOKEN, NIJIVOICE_API_KEY, ELEVENLABS_API_KEY, and BROWSERLESS_API_TOKEN.
  • Setup: Requires Node.js and npm. API key setup is necessary for AI generation.
  • Docs: MulmoScript Format, Image Generation

Highlighted Details

  • Supports generating content in various formats: video, podcast, slideshow, PDF, manga, and swipe anime.
  • Offers interactive script generation via CLI using various templates (e.g., children_book, ghibli_comic).
  • Includes caching mechanisms to avoid redundant generation of audio and image assets.
  • Supports multiple LLM providers (OpenAI, Anthropic, Gemini, Groq) and TTS providers (Nijivoice, ElevenLabs).

Maintenance & Community

The project appears to be actively maintained by receptron. Further community engagement details (Discord, Slack, etc.) are not explicitly provided in the README.

Licensing & Compatibility

The README does not specify a license. Users should verify licensing before commercial use or integration into closed-source projects.

Limitations & Caveats

The project is in beta (v0.1.x). Specific AI models like gpt-image-1 may require OpenAI organization verification. Some templates, like sensei_and_taro, require specific API keys (Nijivoice). The effectiveness and quality of generated content are dependent on the underlying AI models and user-provided prompts.

Health Check
Last Commit

1 day ago

Responsiveness

1 day

Pull Requests (30d)
98
Issues (30d)
4
Star History
27 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Jeffrey Morgan Jeffrey Morgan(Cofounder of Ollama), and
3 more.

modelfusion by vercel

0%
1k
TypeScript library for building AI applications
Created 2 years ago
Updated 1 year ago
Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Pietro Schirano Pietro Schirano(Founder of MagicPath), and
1 more.

SillyTavern by SillyTavern

1.1%
18k
LLM frontend for power users
Created 2 years ago
Updated 1 day ago
Feedback? Help us improve.