Surprise Potential as a Measure of Interactivity in Driving Scenarios

Karen Leung; Marco Pavone; Sushant Veer; Wenhao Ding; Yulong Cao

arxiv: 2502.05677 · v1 · pith:P7NNSCZEnew · submitted 2025-02-08 · 💻 cs.RO · cs.LG

Surprise Potential as a Measure of Interactivity in Driving Scenarios

Wenhao Ding , Sushant Veer , Karen Leung , Yulong Cao , Marco Pavone This is my paper

classification 💻 cs.RO cs.LG

keywords potentialsurprisescenariosdrivinginteractivelogsmeasuredesign

0 comments

read the original abstract

Validating the safety and performance of an autonomous vehicle (AV) requires benchmarking on real-world driving logs. However, typical driving logs contain mostly uneventful scenarios with minimal interactions between road users. Identifying interactive scenarios in real-world driving logs enables the curation of datasets that amplify critical signals and provide a more accurate assessment of an AV's performance. In this paper, we present a novel metric that identifies interactive scenarios by measuring an AV's surprise potential on others. First, we identify three dimensions of the design space to describe a family of surprise potential measures. Second, we exhaustively evaluate and compare different instantiations of the surprise potential measure within this design space on the nuScenes dataset. To determine how well a surprise potential measure correctly identifies an interactive scenario, we use a reward model learned from human preferences to assess alignment with human intuition. Our proposed surprise potential, arising from this exhaustive comparative study, achieves a correlation of more than 0.82 with the human-aligned reward function, outperforming existing approaches. Lastly, we validate motion planners on curated interactive scenarios to demonstrate downstream applications.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Zero-Label Driving Scenario Complexity Detection via Joint Embedding Predictive Architecture
cs.CV 2026-06 unverdicted novelty 5.0

A self-supervised JEPA model on nuPlan data uses temporal prediction error to score driving scenario complexity without labels, assigning higher scores to turns and pedestrian interactions and achieving AP 0.512 in an...