GPT-4 browser automation tool for repetitive tasks
Top 32.8% on sourcepulse
Taxy AI is an open-source browser extension that leverages GPT-4 to automate repetitive web tasks. It's designed for technical users and researchers who want to build or test browser automation workflows, offering a local execution environment without sending data to external servers.
How It Works
Taxy operates via an "Action Cycle." It extracts the DOM, simplifies it to interactive elements, and sends this, along with user instructions, to an LLM (GPT-3.5/GPT-4). The LLM can then choose to perform actions like click(id)
or setValue(id, text)
. Taxy executes these actions using the chrome.debugger
API, updates the DOM, and repeats the cycle until the task is complete, the user intervenes, or an error occurs. This approach allows for dynamic, instruction-based automation driven by powerful language models.
Quick Start & Requirements
yarn install
, then yarn start
. Load the generated build
folder as an unpacked extension in Chrome (chrome://extensions/
).Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The extension is in a research preview state, meaning many workflows may fail or confuse the agent. It is not yet available on the Chrome Web Store, requiring local installation. The maximum action limit per task is 50.
6 months ago
Inactive