Discover and explore top open-source AI tools and projects—updated daily.
axtonliuAI for contextual article illustrations and cover generation
New!
Top 83.9% on SourcePulse
This project addresses the time-consuming nature of creating contextual illustrations for articles and content. It targets newsletter writers, YouTube creators, and technical bloggers by offering an AI-powered solution that automatically generates relevant visuals, including high-CTR YouTube thumbnails, in minutes, thereby enhancing content engagement and visual consistency.
How It Works
The core innovation is a "Tri-Engine System" that intelligently selects between Gemini (for creative visuals), Excalidraw (for hand-drawn diagrams), or Mermaid (for structured diagrams) based on content analysis. This is complemented by "Smart Position Detection" to identify optimal placement points within articles and a "Cover Learning System" that incorporates YouTube thumbnail best practices. This multi-engine approach automates diverse illustration needs, from conceptual art to technical diagrams, with a focus on contextual relevance and efficiency.
Quick Start & Requirements
Installation involves cloning the repository into the Claude Code Skills directory: git clone https://github.com/axtonliu/smart-illustrator.git ~/.claude/skills/smart-illustrator. Key prerequisites include the Claude Code CLI, Bun runtime, and Mermaid CLI (npm install -g @mermaid-js/mermaid-cli). For Excalidraw export, additional dependencies require running npm install && npx playwright install firefox within the scripts directory. An optional Gemini API Key is needed for creative visuals. A full walkthrough is available at https://youtu.be/TbyJ3imLuXQ.
Highlighted Details
Maintenance & Community
The project is explicitly marked as "Experimental" and a "public prototype." The primary focus is on demonstrating system integration rather than active codebase maintenance. Contributions are welcomed for reproducible bug reports, documentation, and small PRs, but feature requests may not be addressed due to limited maintenance capacity. No specific community channels (e.g., Discord, Slack) are listed.
Licensing & Compatibility
The project is released under the permissive MIT License. This license generally allows for broad compatibility, including commercial use and integration into closed-source projects, with minimal restrictions beyond attribution.
Limitations & Caveats
As an experimental prototype, the system does not yet cover all input scales or edge cases, and output quality can fluctuate based on model versions and input structure. The developer's primary goal is showcasing system mechanics, not maintaining the codebase, meaning users encountering issues should provide detailed, reproducible bug reports.
2 days ago
Inactive
Yutong-Zhou-cv
QwenLM