RPA  by A9T9

Open-source RPA software with computer vision and OCR

created 8 years ago
1,668 stars

Top 25.9% on sourcepulse

GitHubView on GitHub
Project Summary

Ui.Vision RPA is an open-source Robotic Process Automation tool designed for automating web and desktop tasks. It targets individual users and businesses seeking to automate repetitive processes, offering features like computer vision, OCR, and LLM integration, alongside Selenium IDE compatibility.

How It Works

Ui.Vision functions as a browser extension, enabling it to interact with web pages and applications. It leverages a combination of JavaScript for browser automation, computer vision for visual recognition of UI elements, and OCR for text extraction. The integration with LLMs and Anthropic's models suggests advanced capabilities for understanding and interacting with content contextually.

Quick Start & Requirements

  • Install via Chrome, Firefox, or Edge web stores.
  • For development: Node.js v20.11.1, npm v10.2.4.
  • Build commands: npm i -f, npm run build, npm run build-ff, npm run ext.
  • Official Homepage: https://ui.vision/

Highlighted Details

  • Cross-platform support for macOS, Linux, and Windows.
  • Includes Selenium IDE import/export functionality.
  • Features computer vision and OCR capabilities.
  • Integrates with LLMs, including Anthropic.

Maintenance & Community

  • User forum for questions and support.
  • Contact: TEAM AT UI.VISION

Licensing & Compatibility

  • Free for private and commercial purposes.

Limitations & Caveats

  • Building the extension is only necessary for developers; end-users should install from web stores.
Health Check
Last commit

3 months ago

Responsiveness

1 week

Pull Requests (30d)
0
Issues (30d)
0
Star History
134 stars in the last 90 days

Explore Similar Projects

Starred by Boris Cherny Boris Cherny(Creator of Claude Code; MTS at Anthropic), Chip Huyen Chip Huyen(Author of AI Engineering, Designing Machine Learning Systems), and
2 more.

TagUI by aisingapore

0.1%
6k
Free RPA tool for automating repetitive tasks on websites, desktop apps, and command lines
created 8 years ago
updated 5 months ago
Starred by Addy Osmani Addy Osmani(Engineering Leader on Google Chrome), Victor Taelin Victor Taelin(Author of Bend, Kind, HVM), and
1 more.

chatbox by chatboxai

0.3%
36k
Desktop client app for AI models/LLMs
created 2 years ago
updated 6 days ago
Feedback? Help us improve.