Discover and explore top open-source AI tools and projects—updated daily.
RunanywhereAIOn-device AI browser automation
Top 93.1% on SourcePulse
This project offers on-device AI-powered web automation, enabling users to perform complex tasks within their browser without relying on cloud services or API keys. It targets developers and power users seeking a private, offline solution for automating web interactions, such as data extraction and form filling, directly within their local environment.
How It Works
The system employs a multi-agent architecture, featuring a Planner Agent for strategic task decomposition and a Navigator Agent for tactical execution. It leverages WebLLM with WebGPU acceleration for local inference of large language models. The Navigator agent analyzes the current web page's DOM to determine and execute actions like clicking, typing, and data extraction. This process iterates until the user's task is completed or fails, ensuring all AI processing remains on the user's device.
Quick Start & Requirements
local-browser directory, and run npm install.npm run build.chrome://extensions, enable "Developer mode," and click "Load unpacked," selecting the dist folder.Highlighted Details
Maintenance & Community
No specific details regarding maintainers, community channels (like Discord/Slack), or roadmap were provided in the README.
Licensing & Compatibility
The project is released under the MIT License, which generally permits commercial use and integration into closed-source projects.
Limitations & Caveats
This project is presented as a Proof-of-Concept (POC) and is not intended for production use. It lacks vision capabilities, relying solely on text-based DOM analysis. Functionality is limited to the currently active tab, and it supports only basic actions (navigate, click, type, extract, scroll, wait). Smaller LLM models may exhibit limitations with highly complex tasks. Certain browser pages (e.g., chrome://) may prevent content script execution.
1 month ago
Inactive