Efficient Preimage Approximation for Neural Network Certification

Anton Bj\"orklund; Marta Kwiatkowska; Mykola Zaitsev; Paolo Morettin

arxiv: 2505.22798 · v3 · submitted 2025-05-28 · 💻 cs.LG · cs.AI· cs.CR

Efficient Preimage Approximation for Neural Network Certification

Anton Bj\"orklund , Mykola Zaitsev , Paolo Morettin , Marta Kwiatkowska This is my paper

Pith reviewed 2026-05-19 12:07 UTC · model grok-4.3

classification 💻 cs.LG cs.AIcs.CR

keywords neural network certificationpreimage approximationrobustness verificationconvolutional neural networksadversarial patch attacksMonte Carlo samplingbranching heuristics

0 comments

The pith

PREMAP2 extends preimage approximation to scale for convolutional networks and patch attacks in certification.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents PREMAP2 as a set of extensions to the existing PREMAP approach for approximating the preimage of neural network specifications. These extensions include improved branching heuristics, adaptive Monte Carlo sampling, and reverse bound propagation, plus added support for non-uniform priors and confidence intervals. The goal is to make preimage-based certification practical for larger and more complex models beyond moderate fully connected networks. If successful, this would let practitioners estimate the fraction of inputs that meet a given property, such as robustness to adversarial patches, instead of relying solely on worst-case output bounds. Demonstrations cover certifying reliability, robustness, interpretability, and fairness on vision and control tasks.

Core claim

PREMAP2 is a collection of algorithmic extensions to PREMAP that enhance its scalability and efficiency through improved branching heuristics, adaptive Monte Carlo sampling, and reverse bound propagation. The method also adds support for non-uniform priors and confidence intervals. These changes make it possible to apply preimage approximation to previously intractable cases, including real-world patch attacks against convolutional neural networks where parts of images are obscured by stickers or lighting.

What carries the argument

PREMAP2, the extended preimage approximation procedure that combines improved branching heuristics, adaptive Monte Carlo sampling, and reverse bound propagation to estimate the proportion of inputs satisfying a specification.

If this is right

Certification of reliability, robustness, interpretability, and fairness becomes feasible for convolutional architectures used in computer vision.
Real-world patch attacks, such as adversarial stickers or lighting changes on images, can be analyzed by estimating the fraction of affected inputs.
Non-uniform input distributions can be handled directly in the preimage estimation process.
Confidence intervals around the estimated proportions provide quantitative guarantees on the certification results.
The approach applies across domains from computer vision to control tasks.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Pairing preimage proportion estimates with traditional output-bounding verifiers could give both worst-case and average-case style guarantees in one workflow.
The same extensions might transfer to other verification tasks that rely on sampling or branching over input spaces.
Wider use could support regulatory requirements for documenting the measurable robustness of deployed models.

Load-bearing premise

The specific algorithmic extensions of improved branching heuristics, adaptive Monte Carlo sampling, and reverse bound propagation will produce scalability and efficiency gains on convolutional networks and patch attacks while keeping the preimage proportion estimates correct.

What would settle it

Running PREMAP2 on a standard convolutional network subject to a realistic patch attack and finding that the estimated preimage proportion is statistically inconsistent with exhaustive enumeration on a smaller analogous problem would falsify the scalability claim.

read the original abstract

The growing reliance on artificial intelligence in safety- and security-critical applications is raising concerns about the robustness of neural networks to erroneous or adversarial input. Certification is a methodology for ensuring model trustworthiness by providing formal guarantees on model behaviour. While most verification methods focus on worst-case analysis by bounding the network output, an alternative approach based on approximating the preimage can complement such analysis by estimating the proportion of inputs that satisfy a given specification. However, existing preimage-based methods, such as the state-of-the-art PREMAP, are limited to fully connected neural networks of moderate dimensionality. In this paper, we introduce PREMAP2, a collection of algorithmic extensions to PREMAP that enhance its scalability and efficiency through improved branching heuristics, adaptive Monte Carlo sampling, and reverse bound propagation. We further endow PREMAP2 with additional functionality such as support for non-uniform priors and confidence intervals. These advances enable the application of PREMAP2 to previously intractable settings, including real-world patch attacks against convolutional neural networks, where adversarial stickers or lighting conditions obscure parts of images. We showcase the effectiveness of our approach across several use cases, including certifying reliability, robustness, interpretability, and fairness, on domains ranging from computer vision to control tasks. Our implementation is available as open-source software.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

PREMAP2 adds branching heuristics, adaptive sampling, and reverse propagation to the original PREMAP so it can reach CNNs and patch attacks, but the abstract gives no way to check whether those changes actually work at scale.

read the letter

The main point is that this paper introduces PREMAP2 as an extension of the existing PREMAP method, adding improved branching heuristics, adaptive Monte Carlo sampling, reverse bound propagation, non-uniform priors, and confidence intervals to make preimage approximation work on convolutional networks and patch attacks. It does well in identifying concrete applications like certifying against real-world adversarial stickers or lighting changes in images, and it covers a range of properties including reliability, robustness, interpretability, and fairness in both vision and control domains. The open-source release of the implementation stands out as a practical step that could help others reproduce or extend the work. The soft spots come down to the fact that only the abstract is available for review. This makes it impossible to verify if the algorithmic changes actually deliver the promised scalability and efficiency without introducing errors in the preimage proportion estimates. The key assumption about these extensions preserving correctness on CNNs and patch attacks remains untested from the given text, though nothing in the description suggests circularity or major inconsistencies. This kind of paper is for verification researchers interested in preimage-based certification as a complement to standard bounding techniques. Readers focused on practical safety analysis for deployed AI systems, especially in computer vision, would likely find the use cases relevant if the results hold up in the full version. I would recommend putting it through peer review to get a proper look at the experiments and any formal arguments.

Referee Report

1 major / 0 minor

Summary. The paper introduces PREMAP2, a set of algorithmic extensions to the existing PREMAP preimage approximation method for neural network certification. The extensions include improved branching heuristics, adaptive Monte Carlo sampling, and reverse bound propagation, plus support for non-uniform priors and confidence intervals. These are claimed to enable scalable application to convolutional neural networks and real-world patch attacks, with demonstrations across certification use cases for reliability, robustness, interpretability, and fairness in computer vision and control tasks.

Significance. If the extensions deliver the claimed scalability and correctness on CNNs, the work would provide a useful complement to output-bounding verification techniques by estimating input proportions satisfying specifications, potentially broadening formal certification to practical adversarial settings such as patch attacks.

major comments (1)

The central claim that the algorithmic extensions enable correct and efficient preimage estimation on previously intractable CNNs and patch attacks rests on the unverified assumption that improved branching heuristics, adaptive Monte Carlo sampling, and reverse bound propagation together preserve correctness while delivering scalability gains; this cannot be assessed from the abstract alone.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for their review and for highlighting the need to substantiate the central claims of PREMAP2. We address the major comment below and are happy to provide additional clarifications or revisions as needed.

read point-by-point responses

Referee: The central claim that the algorithmic extensions enable correct and efficient preimage estimation on previously intractable CNNs and patch attacks rests on the unverified assumption that improved branching heuristics, adaptive Monte Carlo sampling, and reverse bound propagation together preserve correctness while delivering scalability gains; this cannot be assessed from the abstract alone.

Authors: We agree that the abstract alone cannot fully substantiate the correctness and scalability claims. The full manuscript provides a detailed presentation of the extensions, including formal arguments that the improved branching heuristics maintain the soundness of the preimage approximation, that adaptive Monte Carlo sampling converges to the correct measure under the stated assumptions, and that reverse bound propagation yields valid over-approximations. These are accompanied by extensive empirical evaluations on convolutional networks and patch attacks that demonstrate both accuracy (via comparison to ground-truth or tighter bounds) and scalability improvements over the original PREMAP. We would be pleased to expand the relevant sections or add a dedicated correctness subsection if the referee finds the current exposition insufficient. revision: no

Circularity Check

0 steps flagged

No significant circularity detected from available text

full rationale

The abstract describes PREMAP2 as a collection of algorithmic extensions (improved branching heuristics, adaptive Monte Carlo sampling, reverse bound propagation) to an existing method, plus added functionality for non-uniform priors and confidence intervals. No equations, derivations, or self-referential definitions are presented. Claims of enabling new applications rest on these extensions without reducing any result to a fitted parameter or prior self-citation by construction. With only the abstract available, no load-bearing step can be quoted that exhibits circularity, so the description remains self-contained.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review yields no identifiable free parameters, axioms, or invented entities; the work appears to build on standard neural network verification assumptions such as bounded input domains and network Lipschitz continuity, but these are not detailed.

pith-pipeline@v0.9.0 · 5735 in / 1051 out tokens · 46310 ms · 2026-05-19T12:07:41.911573+00:00 · methodology

Efficient Preimage Approximation for Neural Network Certification

Core claim

What carries the argument

If this is right

Where Pith is reading between the lines

Load-bearing premise

What would settle it

discussion (0)