Discover and explore top open-source AI tools and projects—updated daily.
lsdefineAutonomous PC agent for desktop automation
Top 50.8% on SourcePulse
This project provides a minimalist (~3,300 lines) AI-powered agent framework enabling LLMs to gain physical-level control over a PC's operating system, including browser, terminal, file system, and input devices. It targets technically savvy users seeking autonomous desktop automation, offering a self-growing skill tree without heavy dependencies like Electron or Docker.
How It Works
The core is a 92-line Sense-Think-Act loop (agent_loop.py) that utilizes 7 atomic tools (code execution, file I/O, web interaction, user prompts) to execute instructions. New tasks are learned, codified into Standard Operating Procedures (SOPs), and stored persistently. This "seed" philosophy allows the agent to autonomously discover, build, and remember new capabilities, effectively growing its own skill tree from a minimal codebase.
Quick Start & Requirements
pip install streamlit pywebview.mykey_template.py to mykey.py and insert your LLM API key.python launch.pyw for the GUI or python agentmain.py for CLI (tested on Android Termux).WELCOME_NEW_USER.md for detailed bootstrap.Highlighted Details
Maintenance & Community
No specific details regarding maintainers, community channels (e.g., Discord, Slack), or project roadmap were found in the provided text.
Licensing & Compatibility
Limitations & Caveats
The agent's functionality is dependent on an external LLM API key, introducing potential costs and third-party service reliance. The autonomous learning mechanism, while powerful, may require careful oversight to manage emergent behaviors or ensure task accuracy.
20 hours ago
Inactive
askui
OS-Copilot