WebRPA  by pmh1314520

Visual automation tool for web and desktop tasks

Created 1 month ago
1,189 stars

Top 32.5% on SourcePulse

GitHubView on GitHub
Project Summary

A powerful, visual, no-code automation tool designed for building complex workflows through a drag-and-drop interface. It targets users needing to automate web scraping, form filling, testing, and data processing tasks without extensive programming knowledge. The tool offers a comprehensive suite of modules, integrating AI capabilities and system-level automation, significantly accelerating development and reducing the barrier to entry for automation projects.

How It Works

The project employs a client-server architecture with a React-based frontend and a Python FastAPI backend. Users interact with a visual editor, dragging and connecting pre-built modules representing various automation actions. The backend executes these workflows using Playwright for browser automation and integrates with numerous libraries for data manipulation, database access, file operations, media processing, and AI services. This modular, visual approach allows for rapid workflow construction and modification.

Quick Start & Requirements

  • Installation: Download the latest 7z release package, extract it, and run 双击启动WebRPA本地服务.bat.
  • Environment: Strictly Windows 10/11. The project bundles Python 3.13 and Node.js, eliminating the need for separate installations.
  • Access: Backend service available at http://localhost:8000, frontend at http://localhost:5173.
  • Development: Requires navigating to backend and frontend directories, installing dependencies (pip install -r requirements.txt for backend, npm install for frontend), and running development servers (python run.py for backend, npm run dev for frontend).

Highlighted Details

  • Extensive module library (170+ modules) covering web, data, file, media, AI, system, and database operations.
  • Advanced AI integrations, including natural language-driven data extraction, AI element selectors, and support for multiple LLM providers (OpenAI, Gemini, etc.).
  • Comprehensive media processing capabilities, including format conversion, compression, editing, and M3U8 video downloading.
  • Includes a macro recorder for capturing and replaying mouse and keyboard actions, alongside real mouse/keyboard simulation.

Maintenance & Community

The project is primarily developed by 青云制作_彭明航. No specific community links (Discord/Slack) or detailed maintenance schedules are provided in the README.

Licensing & Compatibility

Licensed under CC BY-NC-SA 4.0. This license requires attribution, prohibits commercial use without a separate license, and mandates that derivative works be shared under the same terms. Commercial licenses are available for purchase from the author.

Limitations & Caveats

The tool is exclusively designed for and limited to Windows operating systems. The README notes that source code in the repository will not be updated, directing users to download the latest version from the Releases page.

Health Check
Last Commit

2 days ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
23
Star History
1,021 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Abubakar Abid Abubakar Abid(Cofounder of Gradio), and
3 more.

owl by camel-ai

0.2%
19k
Multi-agent framework for real-world task automation
Created 11 months ago
Updated 1 day ago
Feedback? Help us improve.