mulmocast-cli by receptron

AI-powered multi-modal content generation tool

Created 1 year ago

464 stars

Top 64.6% on SourcePulse

Project Summary

MulmoCast is an AI-native presentation platform designed for collaborative content creation between humans and AI. It allows users to generate rich, multi-modal presentations, including videos, podcasts, slideshows, and manga, from a single script. The target audience includes content creators, educators, and businesses looking to streamline and diversify their presentation outputs.

How It Works

MulmoCast utilizes a proprietary intermediate language called MulmoScript, a JSON-based format that defines content structure, visuals, and narrative. AI models, such as ChatGPT and Claude, generate MulmoScript from various inputs (text, URLs, interactive prompts). The MulmoCast CLI then processes this script to produce outputs across multiple formats by orchestrating AI-driven generation of audio, images, and video. This approach enables efficient, multi-format content creation from a unified source.

Quick Start & Requirements

Installation: npm install -g mulmocast
Prerequisites: ffmpeg (install via Homebrew or download from ffmpeg.org).
API Keys: OPENAI_API_KEY is required. Optional keys include GOOGLE_PROJECT_ID, ANTHROPIC_API_TOKEN, REPLICATE_API_TOKEN, NIJIVOICE_API_KEY, ELEVENLABS_API_KEY, and BROWSERLESS_API_TOKEN.
Setup: Requires Node.js and npm. API key setup is necessary for AI generation.
Docs: MulmoScript Format, Image Generation

Highlighted Details

Supports generating content in various formats: video, podcast, slideshow, PDF, manga, and swipe anime.
Offers interactive script generation via CLI using various templates (e.g., children_book, ghibli_comic).
Includes caching mechanisms to avoid redundant generation of audio and image assets.
Supports multiple LLM providers (OpenAI, Anthropic, Gemini, Groq) and TTS providers (Nijivoice, ElevenLabs).

Maintenance & Community

The project appears to be actively maintained by receptron. Further community engagement details (Discord, Slack, etc.) are not explicitly provided in the README.

Licensing & Compatibility

The README does not specify a license. Users should verify licensing before commercial use or integration into closed-source projects.

Limitations & Caveats

The project is in beta (v0.1.x). Specific AI models like gpt-image-1 may require OpenAI organization verification. Some templates, like sensei_and_taro, require specific API keys (Nijivoice). The effectiveness and quality of generated content are dependent on the underlying AI models and user-provided prompts.

mulmocast-cli by receptron

Explore Similar Projects

english-compiler by uilicious

octo-terminal-releases by johunsang

auto-shorts by alamshafil

payload-ai by ashbuilds

AI-ContentCraft by nicekate

client by google-gemini-php

modelfusion by vercel

cli by MiniMax-AI

video-podcast-maker by Agents365-ai

story-spark-ai by ronisarkarexe

baoyu-skills by JimLiu

SillyTavern by SillyTavern