Discover and explore top open-source AI tools and projects—updated daily.
UI agents for precise GUI interaction
Top 63.9% on SourcePulse
UI-Venus is an open-source UI agent designed for precise GUI element grounding and navigation across mobile, desktop, and web interfaces. It leverages Reinforcement Fine-Tuning (RFT) and a novel action-level reward design to achieve state-of-the-art performance on various benchmarks, enabling more robust and generalizable autonomous UI interaction.
How It Works
UI-Venus utilizes Reinforcement Fine-Tuning (RFT) with fine-grained, action-wise reward functions for GUI navigation. This approach improves credit assignment in long-horizon tasks and enables end-to-end learnable action prediction. The project also emphasizes data quality, employing a three-stage data refinement pipeline (Prompt Rewrite, Trace Editing, Trace Generation) to enhance training signal fidelity, leading to more robust and generalizable agents.
Quick Start & Requirements
pip install -r requirements.txt
requirements.txt
.screenspot_imgs
, screenspot_test
, model_name_or_path
, log_path
(for grounding) or model_path
, input_file
, output_file
(for navigation).examples/
directory.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
3 weeks ago
Inactive