autoMate  by yuruotong1

AI-driven local automation assistant using natural language

Created 1 year ago
3,690 stars

Top 13.1% on SourcePulse

GitHubView on GitHub
Project Summary

This project provides an AI-driven local automation assistant that uses natural language to control computer operations, targeting users seeking to automate repetitive tasks without coding. It aims to redefine user-computer interaction by leveraging large language models for task planning and execution.

How It Works

autoMate functions as an AI+RPA tool, utilizing OmniParser for understanding requirements and planning tasks. It simulates human vision to comprehend screen content and makes autonomous decisions based on task objectives. The system supports local deployment, ensuring data security and privacy.

Quick Start & Requirements

  • Installation: Recommended via conda with Python 3.12.
    git clone https://github.com/yuruotong1/autoMate.git
    cd autoMate
    conda create -n "automate" python==3.12
    conda activate automate
    python install.py
    python main.py
    
  • Prerequisites: NVIDIA GPU recommended for optimal performance due to OCR usage. Minimum 4GB VRAM suggested. CUDA version compatibility with PyTorch is crucial.
  • Access: Web UI available at http://localhost:7888/.
  • Models: Supports OpenAI models (GPT-4o, etc.) and Yeka.

Highlighted Details

  • No-code automation via natural language descriptions.
  • Full interface control across any visual application.
  • Simplified installation, with support for Chinese environments.
  • Local deployment for data privacy.

Maintenance & Community

The project is in early stages with rapid iteration. Technical discussions and development notes are shared via a "Knowledge Planet" (QR code provided in README).

Licensing & Compatibility

The README does not explicitly state a license. Compatibility for commercial use or closed-source linking is not specified.

Limitations & Caveats

Performance may be slow without an NVIDIA GPU due to heavy OCR reliance. The project is in rapid development, implying potential for breaking changes. Support is currently limited to specific OpenAI and Yeka models due to multimodal and structured output requirements.

Health Check
Last Commit

4 months ago

Responsiveness

1 day

Pull Requests (30d)
0
Issues (30d)
0
Star History
45 stars in the last 30 days

Explore Similar Projects

Starred by Andrew Ng Andrew Ng(Founder of DeepLearning.AI; Cofounder of Coursera; Professor at Stanford), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
2 more.

vision-agent by landing-ai

0.1%
5k
Visual AI agent for generating runnable vision code from image/video prompts
Created 1 year ago
Updated 2 weeks ago
Starred by Boris Cherny Boris Cherny(Creator of Claude Code; MTS at Anthropic), Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), and
3 more.

TagUI by aisingapore

0.1%
6k
Free RPA tool for automating repetitive tasks on websites, desktop apps, and command lines
Created 8 years ago
Updated 6 months ago
Feedback? Help us improve.