pith. sign in

hub Mixed citations

Mobile-agent-v3

Mixed citation behavior. Most common role is background (57%).

28 Pith papers citing it
Background 57% of classified citations

hub tools

citation-role summary

background 4 baseline 2 method 1

citation-polarity summary

years

2026 28

clear filters

representative citing papers

A History-Aware Visually Grounded Critic for Computer Use Agents

cs.AI · 2026-06-09 · unverdicted · novelty 7.0

HiViG is a test-time critic that combines macro-action history summarization with visual grounding of execution coordinates to reduce short-sighted and visually erroneous actions in long-horizon GUI agents.

ScaleWoB: Guiding GUI Agents with Coding Agents via Large-Scale Environmental Synthesis

cs.AI · 2026-05-24 · unverdicted · novelty 7.0

ScaleWoB generates 100+ synthetic interactive GUI environments and 1000+ verifiable tasks as web pages, releasing a 120-task mobile benchmark where state-of-the-art agents achieve 27.92% success (17.82% on long-horizon tasks) versus 92.08% for humans, with synthetic results generalizing to real apps

Benchmarking and Improving GUI Agents in High-Dynamic Environments

cs.CV · 2026-04-28 · unverdicted · novelty 7.0 · 2 refs

DynamicUI improves GUI agent performance in high-dynamic environments by processing interaction videos with frame clustering, action-conditioned refinement, and reflection, outperforming prior approaches on the new DynamicGUIBench spanning ten applications.

Skill-Guided Continuation Distillation for GUI Agents

cs.AI · 2026-06-17 · unverdicted · novelty 6.0

SGCD generates supervision for off-trajectory states in GUI agents by mixing expert trajectories with continuations produced by a skill-guided policy after the base policy reaches those states.

Beyond the GUI Paradigm: Do Mobile Agents Need the Phone Screen?

cs.SE · 2026-06-16 · conditional · novelty 6.0

CLI-based coding agents outperform GUI baselines on AndroidWorld and MobileWorld, with oracles reaching 88.8% and 86.3% solvability and a new CLI-Advantage suite showing CLI superiority in bulk operations, filtering, aggregation, cross-app workflows, and hidden state.

PhoneWorld: Scaling Phone-Use Agent Environments

cs.CL · 2026-05-28 · unverdicted · novelty 6.0

PhoneWorld is a pipeline that converts real mobile trajectories into scalable controllable environments, yielding large gains on four benchmarks when used to supplement training data.

OpenComputer: Verifiable Software Worlds for Computer-Use Agents

cs.AI · 2026-05-19 · unverdicted · novelty 6.0

OpenComputer introduces a verifier-grounded framework with state verifiers, self-evolving layers, task synthesis, and auditable evaluation for 33 desktop apps and 1000 tasks to support computer-use AI agents.

citing papers explorer

Showing 0 of 0 citing papers after filters.

No citing papers match the current filters.