Agentic reward modeling: Verifying gui agent via online proactive interaction.arXiv preprint arXiv:2602.00575

Chaoqun Cui, Jing Huang, Shijing Wang, Liming Zheng, Qingchao Kong, Zhixiong Zeng · arXiv 2602.00575

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

OpenComputer: Verifiable Software Worlds for Computer-Use Agents

cs.AI · 2026-05-19 · unverdicted · novelty 6.0

OpenComputer introduces a verifier-grounded framework with state verifiers, self-evolving layers, task synthesis, and auditable evaluation for 33 desktop apps and 1000 tasks to support computer-use AI agents.

GUI Agents with Reinforcement Learning: Toward Digital Inhabitants

cs.AI · 2026-04-30 · unverdicted · novelty 5.0

The paper delivers the first comprehensive overview of RL for GUI agents, organizing methods into offline, online, and hybrid strategies while analyzing trends in rewards, efficiency, and deliberation to outline a future roadmap.

citing papers explorer

Showing 2 of 2 citing papers.

OpenComputer: Verifiable Software Worlds for Computer-Use Agents cs.AI · 2026-05-19 · unverdicted · none · ref 5
OpenComputer introduces a verifier-grounded framework with state verifiers, self-evolving layers, task synthesis, and auditable evaluation for 33 desktop apps and 1000 tasks to support computer-use AI agents.
GUI Agents with Reinforcement Learning: Toward Digital Inhabitants cs.AI · 2026-04-30 · unverdicted · none · ref 15
The paper delivers the first comprehensive overview of RL for GUI agents, organizing methods into offline, online, and hybrid strategies while analyzing trends in rewards, efficiency, and deliberation to outline a future roadmap.

Agentic reward modeling: Verifying gui agent via online proactive interaction.arXiv preprint arXiv:2602.00575

fields

years

verdicts

representative citing papers

citing papers explorer