Webrl: Training llm web agents via self-evolving online curriculum reinforcement learning, 2025

Zehan Qi, Xiao Liu, Iat Long Iong, Hanyu Lai, Xueqiao Sun, Wenyi Zhao, Yu Yang, Xinyue Yang, Jiadai Sun, Shuntian Yao, Tianjie Zhang, Wei Xu, Jie Tang, Yuxiao Dong · 2025

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Gym-Anything: Turn any Software into an Agent Environment

cs.LG · 2026-04-07 · unverdicted · novelty 6.0

Gym-Anything turns arbitrary software into agent environments via multi-agent setup and auditing, creating CUA-World with 10K+ long-horizon tasks and showing that trajectory distillation plus test-time auditing improves small VLMs.

citing papers explorer

Showing 1 of 1 citing paper.

Gym-Anything: Turn any Software into an Agent Environment cs.LG · 2026-04-07 · unverdicted · none · ref 35
Gym-Anything turns arbitrary software into agent environments via multi-agent setup and auditing, creating CUA-World with 10K+ long-horizon tasks and showing that trajectory distillation plus test-time auditing improves small VLMs.

Webrl: Training llm web agents via self-evolving online curriculum reinforcement learning, 2025

fields

years

verdicts

representative citing papers

citing papers explorer