Discover and explore top open-source AI tools and projects—updated daily.
inclusionAIUI agents for precise GUI interaction
Top 62.6% on SourcePulse
UI-Venus is an open-source UI agent designed for precise GUI element grounding and navigation across mobile, desktop, and web interfaces. It leverages Reinforcement Fine-Tuning (RFT) and a novel action-level reward design to achieve state-of-the-art performance on various benchmarks, enabling more robust and generalizable autonomous UI interaction.
How It Works
UI-Venus utilizes Reinforcement Fine-Tuning (RFT) with fine-grained, action-wise reward functions for GUI navigation. This approach improves credit assignment in long-horizon tasks and enables end-to-end learnable action prediction. The project also emphasizes data quality, employing a three-stage data refinement pipeline (Prompt Rewrite, Trace Editing, Trace Generation) to enhance training signal fidelity, leading to more robust and generalizable agents.
Quick Start & Requirements
pip install -r requirements.txtrequirements.txt.screenspot_imgs, screenspot_test, model_name_or_path, log_path (for grounding) or model_path, input_file, output_file (for navigation).examples/ directory.Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
2 months ago
Inactive
THUDM