gui-agents topic

List gui-agents repositories

Awesome-GUI-Agent

1.0k
Stars
57
Forks
1.0k
Watchers

💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.

Agent-S

9.2k
Stars
1.0k
Forks
9.2k
Watchers

Agent S: an open agentic framework that uses computers like a human

UI-TARS-desktop

23.9k
Stars
2.3k
Forks
23.9k
Watchers

The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra

Screen-Point-and-Read

28
Stars
3
Forks
28
Watchers

Code repo for "Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding"

gelab-zero

1.8k
Stars
145
Forks
1.8k
Watchers

GELab: GUI Exploration Lab. One of the best GUI agent solutions in the galaxy, built by the StepFun-GELab team and powered by Step’s research capabilities.

ScaleCUA

1.0k
Stars
65
Forks
1.0k
Watchers

ScaleCUA is the open-sourced computer use agents that can operate on cross-platform environments (Windows, macOS, Ubuntu, Android).

UGround

290
Stars
13
Forks
290
Watchers

[ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents

UI-Ins

51
Stars
4
Forks
51
Watchers

Official implementation of UI-Ins: Enhancing GUI Grounding with Multi-Perspective Instruction-as-Reasoning

GUI-RCPO

51
Stars
0
Forks
51
Watchers

[AAAI 2026] Test-Time Reinforcement Learning for GUI Grounding via Region Consistency https://arxiv.org/abs/2508.05615

monday

32
Stars
1
Forks
32
Watchers

[CVPR 2025] Scalable Video-to-Dataset Generation for Cross-Platform Mobile Agents