Discover and explore top open-source AI tools and projects—updated daily.
Intelligent photo retouching agent
Top 53.0% on SourcePulse
JarvisArt is an MLLM-driven agent designed to automate and enhance photo retouching tasks. It targets users who want to leverage professional-grade editing capabilities through natural language commands, aiming to democratize artistic photo manipulation by coordinating over 200 Adobe Lightroom tools.
How It Works
JarvisArt employs a novel two-stage training framework. It begins with Chain-of-Thought supervised fine-tuning to establish foundational reasoning skills. This is followed by Group Relative Policy Optimization for Retouching (GRPO-R), a technique designed to improve the agent's decision-making and proficiency in utilizing a wide array of editing tools. This approach allows JarvisArt to mimic professional artist workflows and understand complex retouching instructions.
Quick Start & Requirements
Highlighted Details
Maintenance & Community
The project is actively updated, with recent releases including inference code, Gradio and Hugging Face demos. A WeChat discussion group is available for user support and feedback.
Licensing & Compatibility
The project is released under an unspecified license. The README mentions plans to release the MMArt dataset with an open license, but this is not yet complete. Compatibility with commercial or closed-source applications is not specified.
Limitations & Caveats
The MMArt dataset and full training code are not yet released. The project relies on Adobe Lightroom, which is proprietary software, potentially limiting its standalone usability and integration into non-Lightroom workflows.
1 week ago
Inactive