AutoWebGLM  by THUDM

LLM-based web navigating agent (KDD'24)

created 1 year ago
871 stars

Top 42.1% on sourcepulse

GitHubView on GitHub
Project Summary

AutoWebGLM is an LLM-based agent designed for efficient automated web navigation. It targets researchers and developers building AI agents that interact with the web, offering improved webpage comprehension and task execution through novel algorithms and training methodologies.

How It Works

AutoWebGLM leverages the ChatGLM3-6B model, enhancing its web navigation capabilities with an HTML simplification algorithm that mimics human browsing to make pages more digestible for LLMs. It employs a hybrid human-AI training approach using curated web browsing data and utilizes reinforcement learning with rejection sampling to boost webpage comprehension, browser operation efficiency, and task decomposition.

Quick Start & Requirements

  • Install/Run: Refer to the ChatGLM3-6B repository for inference code. Evaluation code is provided.
  • Prerequisites: Requires modifications to WebArena and MiniWob++ environments.
  • Resources: Evaluation datasets available at AutoWebBench and Mind2Web.
  • Links: AutoWebBench, Mind2Web, ChatGLM3-6B, WebArena, MiniWob++

Highlighted Details

  • Introduces a novel HTML simplification algorithm inspired by human browsing patterns.
  • Features a hybrid human-AI training strategy for web browsing data.
  • Utilizes reinforcement learning and rejection sampling for enhanced agent performance.
  • Includes AutoWebBench, a bilingual benchmark for evaluating web navigation agents.

Maintenance & Community

The project is associated with THUDM and has a KDD'24 publication. Further development is encouraged by starring the repository.

Licensing & Compatibility

Licensed under Apache-2.0. Open-sourced data is for research purposes only.

Limitations & Caveats

The project requires modifications to existing environments (WebArena, MiniWob++), and the inference code is dependent on the separate ChatGLM3-6B repository.

Health Check
Last commit

10 months ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
0
Star History
18 stars in the last 90 days

Explore Similar Projects

Starred by Ying Sheng Ying Sheng(Author of SGLang), Jiayi Pan Jiayi Pan(Author of SWE-Gym; AI Researcher at UC Berkeley), and
1 more.

webarena by web-arena-x

1.1%
1k
Web environment for autonomous agent development
created 2 years ago
updated 5 months ago
Feedback? Help us improve.