pith. sign in

arxiv: 1902.01909 · v1 · pith:U6FMIYV3new · submitted 2019-02-05 · 💻 cs.RO · cs.AI· cs.LG· stat.ML

Adaptive Stress Testing for Autonomous Vehicles

classification 💻 cs.RO cs.AIcs.LGstat.ML
keywords scenariosvehiclefindapproachautonomouscarlocollisiondecision
0
0 comments X
read the original abstract

This paper presents a method for testing the decision making systems of autonomous vehicles. Our approach involves perturbing stochastic elements in the vehicle's environment until the vehicle is involved in a collision. Instead of applying direct Monte Carlo sampling to find collision scenarios, we formulate the problem as a Markov decision process and use reinforcement learning algorithms to find the most likely failure scenarios. This paper presents Monte Carlo Tree Search (MCTS) and Deep Reinforcement Learning (DRL) solutions that can scale to large environments. We show that DRL can find more likely failure scenarios than MCTS with fewer calls to the simulator. A simulation scenario involving a vehicle approaching a crosswalk is used to validate the framework. Our proposed approach is very general and can be easily applied to other scenarios given the appropriate models of the vehicle and the environment.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Planetary Exploration 3.0: A Roadmap for Software-Defined, Radically Adaptive Space Systems

    astro-ph.IM 2026-04 unverdicted novelty 5.0

    Planetary Exploration 3.0 proposes single adaptive missions that perform both initial exploration and follow-on science on unvisited worlds using software-defined space systems.

  2. Adversarial Stress Testing of SPARK Humanoid Safety Filters

    cs.RO 2026-05 unverdicted novelty 4.0

    Replicates SPARK humanoid safety filters and stress-tests them under crowding, noise, and delays, showing trade-offs in goal tracking versus collision reduction.