Adaptive Stress Testing for Autonomous Vehicles

Mark Koren; Mykel J. Kochenderfer; Ritchie Lee; Saud Alsaif

arxiv: 1902.01909 · v1 · pith:U6FMIYV3new · submitted 2019-02-05 · 💻 cs.RO · cs.AI· cs.LG· stat.ML

Adaptive Stress Testing for Autonomous Vehicles

Mark Koren , Saud Alsaif , Ritchie Lee , Mykel J. Kochenderfer This is my paper

classification 💻 cs.RO cs.AIcs.LGstat.ML

keywords scenariosvehiclefindapproachautonomouscarlocollisiondecision

0 comments

read the original abstract

This paper presents a method for testing the decision making systems of autonomous vehicles. Our approach involves perturbing stochastic elements in the vehicle's environment until the vehicle is involved in a collision. Instead of applying direct Monte Carlo sampling to find collision scenarios, we formulate the problem as a Markov decision process and use reinforcement learning algorithms to find the most likely failure scenarios. This paper presents Monte Carlo Tree Search (MCTS) and Deep Reinforcement Learning (DRL) solutions that can scale to large environments. We show that DRL can find more likely failure scenarios than MCTS with fewer calls to the simulator. A simulation scenario involving a vehicle approaching a crosswalk is used to validate the framework. Our proposed approach is very general and can be easily applied to other scenarios given the appropriate models of the vehicle and the environment.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Planetary Exploration 3.0: A Roadmap for Software-Defined, Radically Adaptive Space Systems
astro-ph.IM 2026-04 unverdicted novelty 5.0

Planetary Exploration 3.0 proposes single adaptive missions that perform both initial exploration and follow-on science on unvisited worlds using software-defined space systems.
Adversarial Stress Testing of SPARK Humanoid Safety Filters
cs.RO 2026-05 unverdicted novelty 4.0

Replicates SPARK humanoid safety filters and stress-tests them under crowding, noise, and delays, showing trade-offs in goal tracking versus collision reduction.