Convex methods for constrained linear bandits

· 2024

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

On-Policy Distillation of Language Models for Autonomous Vehicle Motion Planning

cs.RO · 2026-04-09 · unverdicted · novelty 5.0

On-policy GKD trains 5x smaller student LLMs to nearly match large teacher performance in AV motion planning on nuScenes while beating a dense-feedback RL baseline.

citing papers explorer

Showing 1 of 1 citing paper.

On-Policy Distillation of Language Models for Autonomous Vehicle Motion Planning cs.RO · 2026-04-09 · unverdicted · none · ref 10
On-policy GKD trains 5x smaller student LLMs to nearly match large teacher performance in AV motion planning on nuScenes while beating a dense-feedback RL baseline.

Convex methods for constrained linear bandits

fields

years

verdicts

representative citing papers

citing papers explorer