pith. sign in

Q-transformer: Scalable offline reinforcement learning via autoregressive q-functions

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

fields

cs.RO 2

years

2026 2

representative citing papers

WorldVLN: Autoregressive World Action Model for Aerial Vision-Language Navigation

cs.RO · 2026-05-15 · unverdicted · novelty 7.0

WorldVLN proposes the first autoregressive world action model for aerial vision-language navigation that predicts short-horizon latent world states, decodes them to waypoints in closed loop, and uses two-stage training with Action-aware GRPO to achieve over 12% success-rate gains on benchmarks plus零

citing papers explorer

Showing 2 of 2 citing papers.