Autonomous GPT agent with browser access for task completion
Top 61.2% on sourcepulse
This project provides an experimental autonomous GPT agent capable of interacting with a web browser to accomplish user-defined tasks. It is designed for developers and researchers interested in AI-driven automation and agentic workflows, offering a practical demonstration of GPTs controlling browser actions.
How It Works
The assistant leverages GPT-4 to interpret natural language prompts and generate a sequence of actions. These actions are then executed using Puppeteer, a Node.js library that provides a high-level API to control Chrome or Chromium over the DevTools Protocol. This approach allows the agent to navigate websites, extract information, and interact with web elements programmatically.
Quick Start & Requirements
npm install
.env
file from .env.template
, and add OPENAI_KEY
and DATABASE_URL
(Postgres).npx prisma migrate dev --name init
npm run dev
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
The project is in its early stages, with sparse documentation and potentially messy code. Contributing guidelines mention that it's a work in progress.
2 years ago
1 week