ai_webui  by jasonaidm

Universal AI creation web UI

Created 2 years ago
260 stars

Top 97.8% on SourcePulse

GitHubView on GitHub
Project Summary

AI-WEBUI offers a comprehensive, browser-based platform for AI-powered content creation, targeting users involved in short video production and general AI media processing. It integrates functionalities like image segmentation, object tracking, speech recognition, speech synthesis, and video manipulation, aiming to streamline complex workflows.

How It Works

The project leverages a modular architecture, allowing users to run specific AI functions or combine multiple models for advanced tasks. It supports various input types (video, audio, image, text) and provides configuration files to manage different operational modes, from single-function demos to full-suite operation. This approach enables flexibility and caters to varying hardware capabilities.

Quick Start & Requirements

  • Install: Clone the repository, create a conda environment (conda create -n aiwebui python=3.11), activate it (conda activate aiwebui), install dependencies (pip install -r requirements.txt), and install ffmpeg (apt install ffmpeg -y).
  • Run: Execute python webui.py -c ./configs/webui_configs.yml. Access the interface at http://localhost:9090/?__theme=dark.
  • Prerequisites: Python 3.11, conda, ffmpeg. GPU with sufficient VRAM is recommended for optimal performance, especially for combined functions. Model files must be downloaded separately.
  • Docs: 中文文档

Highlighted Details

  • Supports image segmentation (panorama, coordinate-based, text-prompted), speech recognition/synthesis, chatbot (text/voice), and video processing (watermark removal, object tracking, frame extraction, translation, BGM addition).
  • Offers single-function demos for resource-constrained environments.
  • Provides downloadable model weights for various AI tasks, including ChatGLM2-6B, SAM, FastSAM, ProPainter, and Whisper.
  • Configuration files allow fine-grained control over model loading and function execution.

Maintenance & Community

The project encourages contributions via issues for suggestions and feature requests. It references several foundational AI projects, including Segment-ant-Track-Anything, ProPainter, ChatGLM2-6B, segment-anything, FastSAM, and Whisper.

Licensing & Compatibility

The repository does not explicitly state a license. The included models have their own licenses, which may affect commercial use or closed-source integration.

Limitations & Caveats

Users with less than 8GB of GPU memory may experience performance issues, particularly when using larger models. The project requires manual download and placement of model weight files.

Health Check
Last Commit

1 year ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
4 stars in the last 30 days

Explore Similar Projects

Feedback? Help us improve.