A reduction of imitation learning and structured prediction to no-regret online learning

Stephane Ross, Geoffrey Gordon, Drew Bagnell · 2011

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

RL Token: Bootstrapping Online RL with Vision-Language-Action Models

cs.LG · 2026-04-24 · unverdicted · novelty 6.0

RL Token enables sample-efficient online RL fine-tuning of large VLAs, delivering up to 3x speed gains and higher success rates on real-robot manipulation tasks within minutes to hours.

citing papers explorer

Showing 1 of 1 citing paper.

RL Token: Bootstrapping Online RL with Vision-Language-Action Models cs.LG · 2026-04-24 · unverdicted · none · ref 44
RL Token enables sample-efficient online RL fine-tuning of large VLAs, delivering up to 3x speed gains and higher success rates on real-robot manipulation tasks within minutes to hours.

A reduction of imitation learning and structured prediction to no-regret online learning

fields

years

verdicts

representative citing papers

citing papers explorer