LLM-based web navigating agent (KDD'24)
Top 42.1% on sourcepulse
AutoWebGLM is an LLM-based agent designed for efficient automated web navigation. It targets researchers and developers building AI agents that interact with the web, offering improved webpage comprehension and task execution through novel algorithms and training methodologies.
How It Works
AutoWebGLM leverages the ChatGLM3-6B model, enhancing its web navigation capabilities with an HTML simplification algorithm that mimics human browsing to make pages more digestible for LLMs. It employs a hybrid human-AI training approach using curated web browsing data and utilizes reinforcement learning with rejection sampling to boost webpage comprehension, browser operation efficiency, and task decomposition.
Quick Start & Requirements
Highlighted Details
Maintenance & Community
The project is associated with THUDM and has a KDD'24 publication. Further development is encouraged by starring the repository.
Licensing & Compatibility
Licensed under Apache-2.0. Open-sourced data is for research purposes only.
Limitations & Caveats
The project requires modifications to existing environments (WebArena, MiniWob++), and the inference code is dependent on the separate ChatGLM3-6B repository.
10 months ago
Inactive