natbot  by nat

Browser automation via GPT-3

created 2 years ago
1,929 stars

Top 23.1% on sourcepulse

GitHubView on GitHub
Project Summary

natbot enables users to control a web browser using natural language prompts, powered by GPT-3. This tool is designed for developers and researchers interested in AI-driven automation and human-computer interaction, offering a novel way to interact with the web.

How It Works

The core mechanism involves serializing the current browser's Document Object Model (DOM) into a text format. This serialized DOM, along with the user's natural language instruction, is fed into GPT-3. The model then generates a sequence of actions (e.g., clicks, typing) that are executed within the browser to fulfill the request.

Highlighted Details

  • Drives browser actions using GPT-3.
  • DOM serialization for LLM input.
  • Open to community contributions and improvements.

Maintenance & Community

This project appears to be a personal project with limited public maintenance signals. Community contributions are explicitly welcomed.

Licensing & Compatibility

The license is not specified in the provided README.

Limitations & Caveats

The project is described as having "lots of ideas for improvement," indicating it is likely in an early, experimental stage. Specific limitations include the need for better DOM serialization, prompt engineering, and agent capabilities like multi-tab support.

Health Check
Last commit

1 year ago

Responsiveness

1 week

Pull Requests (30d)
0
Issues (30d)
0
Star History
14 stars in the last 90 days

Explore Similar Projects

Feedback? Help us improve.