Training language models to follow in- structions with human feedback

Long Ouyang et al · 2022

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

cs.AI · 2026-05-17 · unverdicted · novelty 6.0

VLA driving models show 42.5% reasoning fidelity and 48.3% reasoning-action consistency, with 97.7% trajectory fragility under perturbations.

Showing 1 of 1 citing paper.

Is VLA Reasoning Faithful? Probing Safety of Chain-of-Causation in Autonomous Driving Models cs.AI · 2026-05-17 · unverdicted · none · ref 9
VLA driving models show 42.5% reasoning fidelity and 48.3% reasoning-action consistency, with 97.7% trajectory fragility under perturbations.