Alpine Ridge

executed · arXiv blob/4490826

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

OpenWebRL: Demystifying Online Multi-turn Reinforcement Learning for Visual Web Agents

cs.LG · 2026-06-01 · unverdicted · novelty 6.0

OpenWebRL trains a 4B visual web agent with online RL on live sites using 0.4K init trajectories and 2.2K RL tasks to reach 67% success on Online-Mind2Web and 64% on DeepShop, outperforming prior open agents.

citing papers explorer

Showing 1 of 1 citing paper after filters.

OpenWebRL: Demystifying Online Multi-turn Reinforcement Learning for Visual Web Agents cs.LG · 2026-06-01 · unverdicted · none · ref 66
OpenWebRL trains a 4B visual web agent with online RL on live sites using 0.4K init trajectories and 2.2K RL tasks to reach 67% success on Online-Mind2Web and 64% on DeepShop, outperforming prior open agents.

Alpine Ridge

fields

years

verdicts

representative citing papers

citing papers explorer