arxiv: 1811.07209 · v4 · pith:DUNAXIT3new · submitted 2018-11-17 · 📊 stat.ML · cs.LG

A Statistical Approach to Assessing Neural Network Robustness

Stefan Webb , Tom Rainforth , Yee Whye Teh , M. Pawan Kumar This is my paper

classification 📊 stat.ML cs.LG

keywords approachformalframeworknetworknetworksprobabilitypropertyprovides

0 comments

read the original abstract

We present a new approach to assessing the robustness of neural networks based on estimating the proportion of inputs for which a property is violated. Specifically, we estimate the probability of the event that the property is violated under an input model. Our approach critically varies from the formal verification framework in that when the property can be violated, it provides an informative notion of how robust the network is, rather than just the conventional assertion that the network is not verifiable. Furthermore, it provides an ability to scale to larger networks than formal verification approaches. Though the framework still provides a formal guarantee of satisfiability whenever it successfully finds one or more violations, these advantages do come at the cost of only providing a statistical estimate of unsatisfiability whenever no violation is found. Key to the practical success of our approach is an adaptation of multi-level splitting, a Monte Carlo approach for estimating the probability of rare events, to our statistical robustness framework. We demonstrate that our approach is able to emulate formal verification procedures on benchmark problems, while scaling to larger networks and providing reliable additional information in the form of accurate estimates of the violation probability.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Local Linearity of LLMs Enables Activation Steering via Model-Based Linear Optimal Control
cs.LG 2026-04 conditional novelty 7.0

Local linearity of LLM layers enables LQR-based closed-loop activation steering with theoretical tracking guarantees.
Bounding the Black Box: A Statistical Certification Framework for AI Risk Regulation
cs.AI 2026-04 unverdicted novelty 6.0

The paper introduces RoMA and gRoMA as statistical tools that compute auditable upper bounds on the failure probability of any black-box AI system once regulators fix an acceptable risk threshold and input domain.