jaaz  by 11cafe

Open-source AI design agent

created 2 months ago
1,696 stars

Top 25.6% on sourcepulse

GitHubView on GitHub
Project Summary

Jaaz is an open-source, locally deployable AI design agent designed for creative professionals and power users. It aims to provide a unified platform for image generation, editing, and storyboarding, acting as a local alternative to tools like Lovart, with the flexibility to integrate both local and cloud-based AI models.

How It Works

Jaaz leverages a hybrid model deployment strategy, allowing users to connect to cloud APIs (OpenAI, Replicate, Claude) or run models locally via Ollama and ComfyUI. Its core functionality is driven by a Smart Prompt Agent powered by LLMs, which interprets user ideas to generate optimized prompts for image and storyboard creation. Interactive editing features, such as object insertion and style transfer, are enabled through integrations like Flux Kontext, offering chat-based control.

Quick Start & Requirements

  • Install: Clone the repository (git clone https://github.com/11cafe/localart), navigate to the server directory, install dependencies (pip install -r requirements.txt), and run (python main.py). For development, additional npm commands are required.
  • Prerequisites: Python, Node.js (for development), and optionally Ollama or API keys for cloud services (OpenAI, Replicate, Claude, Google).
  • Resources: Local model execution may require significant GPU resources.
  • Docs: 中文入门指南

Highlighted Details

  • Supports local model execution via Ollama and ComfyUI for 100% free usage.
  • Integrates with various image generation models (GPT-4O, Recraft, Flux, Google Imagen) through Replicate.
  • Features interactive image editing with object insertion and style transfer via Flux Kontext.
  • Planned Video Agent for video generation and editing using models like Wan2.1 and Kling.

Maintenance & Community

  • Actively maintained, with a call to "Star Jaaz on GitHub" for updates.
  • Community links (Discord/Slack) are not explicitly provided in the README.

Licensing & Compatibility

  • The README does not explicitly state a license. The repository structure suggests a dual-frontend/backend setup. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

The "Infinite Canvas & Storyboarding" feature is marked as "Coming Soon." The README mentions npm install --force for development, which can sometimes indicate potential dependency conflicts or instability.

Health Check
Last commit

19 hours ago

Responsiveness

Inactive

Pull Requests (30d)
79
Issues (30d)
11
Star History
1,733 stars in the last 90 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), Pietro Schirano Pietro Schirano(Founder of MagicPath), and
1 more.

SillyTavern by SillyTavern

3.2%
17k
LLM frontend for power users
created 2 years ago
updated 3 days ago
Feedback? Help us improve.