Awesome-GUI-Agent  by showlab

GUI agent resource list

created 1 year ago
807 stars

Top 44.7% on sourcepulse

GitHubView on GitHub
Project Summary

This repository is a curated list of papers, projects, and resources focused on multi-modal Graphical User Interface (GUI) agents. It serves as a comprehensive knowledge base for researchers and developers aiming to build sophisticated digital assistants capable of interacting with graphical interfaces across various platforms like desktops and mobile devices.

How It Works

The project acts as a central hub, aggregating and categorizing academic papers, open-source projects, and datasets relevant to GUI agents. It covers key areas such as datasets and benchmarks for evaluating agent performance, specific models and agent architectures, and survey papers that provide broader overviews of the field. The goal is to facilitate the development of more capable and generalist AI agents that can understand and manipulate GUIs.

Quick Start & Requirements

This repository is a curated list and does not have a direct installation or execution command. It serves as a reference guide.

Highlighted Details

  • Extensive catalog of over 100 papers and benchmarks related to GUI agents, spanning from 2017 to late 2024/early 2025.
  • Categorization into Datasets/Benchmarks, Models/Agents, Surveys, and Projects for easy navigation.
  • Includes a functional "Awesome-Paper-Agent" demo that processes arXiv URLs to extract and format paper information.
  • Features a "Projects" section listing relevant open-source tools and frameworks for building GUI agents.

Maintenance & Community

The project is actively maintained and welcomes contributions via issues and pull requests. It references templates from "Awesome-Video-Diffusion" and "Awesome-MLLM-Hallucination."

Licensing & Compatibility

The repository itself does not specify a license, but the listed papers and projects will have their own respective licenses. Compatibility for commercial use would depend on the licenses of the individual resources cited.

Limitations & Caveats

As a curated list, the repository does not provide any implementation or code for GUI agents itself. The utility is purely informational, requiring users to explore and integrate the cited resources independently.

Health Check
Last commit

2 months ago

Responsiveness

Inactive

Pull Requests (30d)
0
Issues (30d)
1
Star History
163 stars in the last 90 days

Explore Similar Projects

Feedback? Help us improve.