browser-agent-py  by oxylabs

AI-powered browser automation for complex tasks

Created 1 month ago
647 stars

Top 51.6% on SourcePulse

GitHubView on GitHub
Project Summary

Summary

Oxylabs' browser-agent-py is an AI-powered Python tool that automates complex web browsing tasks and data extraction using natural language instructions. It targets developers and power users seeking to bypass the fragility and manual scripting of traditional automation frameworks like Selenium or Puppeteer, enabling more intuitive, human-like interaction with dynamic web content.

How It Works

The agent interprets natural language prompts or structured step lists to execute multi-step browsing flows, including navigation, form filling, and interaction with JavaScript-rendered elements. It distinguishes itself by replacing static selectors with AI-driven understanding, allowing users to describe desired actions in plain English. The tool can then extract data into structured formats like JSON, Markdown, or HTML, or capture PNG screenshots, based on user-defined or auto-generated schemas.

Quick Start & Requirements

  • Installation: pip install oxylabs-ai-studio
  • Prerequisites: Python 3.10+ required. An Oxylabs API key is necessary; a free trial with 1000 credits is available.
  • Resources: Links to the PyPI repository and JavaScript SDK guide are mentioned for further details.

Highlighted Details

  • AI-driven Control: Executes clicks, inputs, navigation, and scrolling via natural language.
  • Multi-step Task Automation: Supports complex browsing flows defined in plain English.
  • Flexible Outputs: Generates JSON (with schema extraction), Markdown, HTML, or PNG screenshots.
  • Dynamic Content Handling: Interacts seamlessly with JavaScript-rendered pages.
  • Use Cases: Applicable to e-commerce simulation, travel search, job scraping, and event discovery.

Maintenance & Community

Developed by Oxylabs AI Studio. Support and inquiries can be directed via hello@oxylabs.io, live chat, or their Discord community.

Licensing & Compatibility

The provided README does not specify the software license or offer compatibility notes for commercial use.

Limitations & Caveats

The agent may require advanced setup for websites employing sophisticated bot detection mechanisms. Users are responsible for ensuring compliance with target website Terms of Service and relevant laws. Post-trial usage incurs costs starting at $12/month for 3000 credits.

Health Check
Last Commit

1 month ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
654 stars in the last 30 days

Explore Similar Projects

Starred by Chip Huyen Chip Huyen(Author of "AI Engineering", "Designing Machine Learning Systems"), Luis Capelo Luis Capelo(Cofounder of Lightning AI), and
15 more.

stagehand by browserbase

0.4%
19k
AI browser automation framework for production
Created 1 year ago
Updated 2 hours ago
Feedback? Help us improve.