pith. sign in

Baseline reference

Impromptu vla: Open weights and open data for driving vision-language- action models

Baseline reference. 60% of citing Pith papers use this work as a benchmark or comparison.

14 Pith papers citing it
Baseline 60% of classified citations

citation-role summary

baseline 3 background 2

citation-polarity summary

fields

cs.CV 10 cs.RO 4

years

2026 13 2025 1

clear filters

representative citing papers

Grounding Driving VLA via Inverse Kinematics

cs.CV · 2026-05-20 · conditional · novelty 7.0

By adding future visual state prediction and a dedicated inverse kinematics diffusion network that uses only visual boundary conditions, a 0.5B driving VLA recovers visual grounding and matches 7-8B models on NAVSIM-v2 and nuScenes.

citing papers explorer

Showing 2 of 2 citing papers after filters.

  • Grounding Driving VLA via Inverse Kinematics cs.CV · 2026-05-20 · conditional · none · ref 2

    By adding future visual state prediction and a dedicated inverse kinematics diffusion network that uses only visual boundary conditions, a 0.5B driving VLA recovers visual grounding and matches 7-8B models on NAVSIM-v2 and nuScenes.

  • CLOVER: Closed-Loop Value Estimation and Ranking for End-to-End Autonomous Driving Planning cs.RO · 2026-05-14 · conditional · none · ref 6

    CLOVER is a closed-loop generator-scorer framework that expands proposal coverage with pseudo-expert trajectories and performs conservative self-distillation to achieve state-of-the-art planning scores on NAVSIM and nuScenes.