Faster Verified Explanations for Neural Networks
Pith reviewed 2026-05-17 03:27 UTC · model grok-4.3
The pith
FaVeX speeds up verified explanations for neural networks by reusing information across verifier queries and mixing batch with sequential feature processing.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
FaVeX accelerates the computation by dynamically combining batch and sequential processing of input features, and by reusing information from previous queries, both when proving invariances with respect to certain input features, and when searching for feature assignments altering the prediction. It further introduces verifier-optimal robust explanations, a hierarchical definition that explicitly factors the incompleteness of network verifiers within the explanation.
What carries the argument
The FaVeX algorithm, which reuses prior verifier results while alternating between batch and sequential handling of input features, paired with the verifier-optimal robust explanations definition that records verifier incompleteness.
If this is right
- Verified explanations become feasible for networks orders of magnitude larger than before.
- Explanations can be produced while explicitly stating the limits of the verifier used.
- Repeated verification tasks in explanation generation share computation instead of restarting from scratch.
- Formal guarantees on feature relevance can be obtained without requiring a complete verifier.
Where Pith is reading between the lines
- The reuse mechanism suggests similar incremental verification could apply to other repeated formal checks such as robustness certification.
- The hierarchical definition opens a path to refining an explanation once a stronger verifier becomes available without restarting the entire process.
Load-bearing premise
Reusing results across separate verifier queries preserves soundness and the new hierarchical definition accurately reflects verifier incompleteness without introducing undetected errors or overclaiming robustness.
What would settle it
A concrete falsifier would be a network and input where information reuse produces an explanation claiming a feature is irrelevant when a fresh verifier call shows it is relevant, or a timing experiment on a network with over one hundred thousand activations that shows no reduction in total verification time.
read the original abstract
Verified explanations are a principled way to explain the decisions taken by neural networks, which are otherwise black-box in nature. However, these techniques face significant scalability challenges, as they require multiple calls to neural network verifiers, each of them with an exponential worst-case complexity. We present FaVeX, a novel algorithm to compute verified explanations. FaVeX accelerates the computation by dynamically combining batch and sequential processing of input features, and by reusing information from previous queries, both when proving invariances with respect to certain input features, and when searching for feature assignments altering the prediction. Furthermore, we present a novel and hierarchical definition of verified explanations, termed verifieroptimal robust explanations, that explicitly factors the incompleteness of network verifiers within the explanation. Our comprehensive experimental evaluation demonstrates the superior scalability of both FaVeX, and of verifier-optimal robust explanations, which together can produce meaningful formal explanation on networks with hundreds of thousands of non-linear activations.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper introduces FaVeX, an algorithm for faster computation of verified explanations for neural networks. It combines dynamic batch and sequential feature processing with reuse of prior verifier query results for proving feature invariances and searching for prediction-altering assignments. A new hierarchical definition of verifier-optimal robust explanations is proposed that explicitly incorporates verifier incompleteness. Experiments claim this enables meaningful formal explanations on networks with hundreds of thousands of non-linear activations.
Significance. If the reuse mechanism is shown to preserve soundness, FaVeX would meaningfully advance scalable verified explanations, a persistent bottleneck in formal methods for neural networks. The verifier-optimal definition's explicit factoring of incompleteness is a clear strength that avoids overclaiming robustness and provides a more realistic formal guarantee.
major comments (2)
- [Section 3 (FaVeX algorithm and reuse mechanism)] The central scalability claim rests on reusing cached bounds and partial assignments across verifier queries for both invariance proofs and counterexample search, yet no invariant, lemma, or inductive argument is supplied establishing that this reuse preserves soundness of the resulting verifier-optimal robust explanations (i.e., introduces neither false positives nor missed counterexamples). This is load-bearing for the reported results on networks with hundreds of thousands of activations.
- [Section 5 (Experimental evaluation)] The experimental evaluation reports superior scalability but provides insufficient detail on baseline verifiers, exact network sizes and architectures, number of runs, or statistical controls, making it difficult to assess whether the observed speedups are robust or attributable to the proposed reuse and batch-sequential strategy.
minor comments (1)
- [Section 2 (Preliminaries and definitions)] Notation for the hierarchical levels of verifier-optimal explanations could be clarified with an explicit example or small illustrative network to distinguish the new definition from prior robust explanation notions.
Simulated Author's Rebuttal
We thank the referee for their constructive and insightful comments on our manuscript. We address each major comment in detail below and describe the revisions we intend to make.
read point-by-point responses
-
Referee: [Section 3 (FaVeX algorithm and reuse mechanism)] The central scalability claim rests on reusing cached bounds and partial assignments across verifier queries for both invariance proofs and counterexample search, yet no invariant, lemma, or inductive argument is supplied establishing that this reuse preserves soundness of the resulting verifier-optimal robust explanations (i.e., introduces neither false positives nor missed counterexamples). This is load-bearing for the reported results on networks with hundreds of thousands of activations.
Authors: We agree that a formal argument establishing the soundness of the reuse mechanism is necessary to support the central claims. Section 3 describes the dynamic batch-sequential processing and the reuse of cached bounds and partial assignments for invariance proofs and counterexample search, but does not include an explicit lemma or inductive invariant. In the revised manuscript we will add a new lemma (with proof) in Section 3 that shows the reuse operations preserve the soundness of the verifier-optimal robust explanations, ensuring neither false positives nor missed counterexamples are introduced. revision: yes
-
Referee: [Section 5 (Experimental evaluation)] The experimental evaluation reports superior scalability but provides insufficient detail on baseline verifiers, exact network sizes and architectures, number of runs, or statistical controls, making it difficult to assess whether the observed speedups are robust or attributable to the proposed reuse and batch-sequential strategy.
Authors: We acknowledge that additional experimental details are required for reproducibility and to substantiate the reported speedups. In the revised Section 5 we will expand the description to specify the exact baseline verifiers and their configurations, provide precise network architectures (number of layers, neurons, and non-linear activations), state the number of independent runs performed for each experiment, and include statistical measures such as standard deviations or confidence intervals. revision: yes
Circularity Check
No circularity in algorithmic design or new definition
full rationale
The paper presents FaVeX as a novel algorithm combining batch and sequential feature processing with reuse of prior verifier query results, plus a hierarchical verifier-optimal robust explanations definition that factors in verifier incompleteness. These are constructive algorithmic and definitional contributions evaluated experimentally on large networks. No equations, derivations, or claims in the abstract or described content reduce any result to fitted parameters, self-definitions, or load-bearing self-citations by construction. The work is self-contained as a practical acceleration technique without tautological reductions.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Neural network verifiers are sound but incomplete
invented entities (1)
-
verifier-optimal robust explanations
no independent evidence
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/AbsoluteFloorClosure.lean (AbsoluteFloorWitness)reality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
verifier-optimal robust explanations... explicitly factors the incompleteness of network verifiers within the explanation
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[2]
2 Ross Anderson, Joey Huchette, Will Ma, Christian Tjandraatmadja, and Juan Pablo Vielma
URL: https://arxiv.org/abs/2109.10317. 2 Ross Anderson, Joey Huchette, Will Ma, Christian Tjandraatmadja, and Juan Pablo Vielma. Strong mixed-integer programming formulations for trained neural networks.Math. Program., 183(1):3–39,
-
[3]
On the reasons behind decisions
8 Adnan Darwiche and Auguste Hirth. On the reasons behind decisions. In Giuseppe De Giacomo, Alejandro Catalá, Bistra Dilkina, Michela Milano, Senén Barro, Alberto Bugarín, and Jérôme Lang, editors,ECAI 2020 - 24th European Conference on Artificial Intelligence, 29 August-8 September 2020, Santiago de Compostela, Spain, August 29 - September 8, 2020 - Inc...
work page 2020
-
[5]
URL: https://arxiv.org/abs/2104.06718,arXiv:2104.06718. A. De Palma and G. Dolcetti and C. Urban 23 12 Alessandro De Palma, Rudy Bunel, Krishnamurthy (Dj) Dvijotham, M. Pawan Kumar, Robert Stanforth, and Alessio Lomuscio. Expressive losses for verified robustness via convex combinations. InThe Twelfth International Conference on Learning Representations, ...
-
[6]
Towards A Rigorous Science of Interpretable Machine Learning
14 Finale Doshi-Velez and Been Kim. Towards a rigorous science of interpretable machine learning. arXiv preprint arXiv:1702.08608,
work page internal anchor Pith review Pith/arXiv arXiv
-
[7]
17 Timon Gehr, Matthew Mirman, Dana Drachsler-Cohen, Petar Tsankov, Swarat Chaudhuri, and Martin T. Vechev. AI2: safety and robustness certification of neural networks with abstract interpretation. In2018 IEEE Symposium on Security and Privacy, SP 2018, Proceedings, 21-23 May 2018, San Francisco, California, USA, pages 3–18. IEEE Computer Society,
work page 2018
-
[8]
Scalable verified training for provably robust image classification
18 Sven Gowal, Krishnamurthy Dvijotham, Robert Stanforth, Rudy Bunel, Chongli Qin, Jonathan Uesato, Relja Arandjelovic, Timothy Arthur Mann, and Pushmeet Kohli. Scalable verified training for provably robust image classification. In2019 IEEE/CVF International Conference on Computer Vision, ICCV 2019, Seoul, Korea (South), October 27 - November 2, 2019, pa...
work page 2019
-
[9]
A survey of methods for explaining black box models.ACM Comput
19 Riccardo Guidotti, Anna Monreale, Salvatore Ruggieri, Franco Turini, Fosca Giannotti, and Dino Pedreschi. A survey of methods for explaining black box models.ACM Comput. Surv., 51(5):93:1–93:42, 2019.doi:10.1145/3236009. 20 Gurobi Optimization, LLC. Gurobi Optimizer Reference Manual. URL: https://www.gurobi. com. 21 Faisal Hamman, Erfaun Noorani, Saumi...
-
[10]
Interpretation of Prediction Models Using the Input Gradient
URL: http://arxiv.org/abs/1611.07634. 23 Fred Hemery, Christophe Lecoutre, Lakhdar Sais, and Frédéric Boussemart. Extracting mucs from constraint networks. In Gerhard Brewka, Silvia Coradeschi, Anna Perini, and Paolo Traverso, editors,ECAI 2006, 17th European Conference on Artificial Intelligence, August 29 - September 1, 2006, Riva del Garda, Italy, Incl...
work page internal anchor Pith review Pith/arXiv arXiv 2006
-
[11]
24 Patrick Henriksen and Alessio Lomuscio. DEEPSPLIT: an efficient splitting method for neural network verification via indirect effect analysis. In Zhi-Hua Zhou, editor,Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, IJCAI 2021, Virtual Event / Montreal, Canada, 19-27 August 2021, pages 2549–2555. ijcai.org,
work page 2021
-
[12]
Abduction-based explanations for machine learning models
26 Alexey Ignatiev, Nina Narodytska, and João Marques-Silva. Abduction-based explanations for machine learning models. InThe Thirty-Third AAAI Conference on Artificial Intelligence, AAAI 2019, The Thirty-First Innovative Applications of Artificial Intelligence Conference, IAAI 2019, The Ninth AAAI Symposium on Educational Advances in Artificial Intelligen...
work page 2019
-
[13]
On relating explanations and adversarial examples
27 Alexey Ignatiev, Nina Narodytska, and João Marques-Silva. On relating explanations and adversarial examples. In Hanna M. Wallach, Hugo Larochelle, Alina Beygelzimer, Florence d’Alché-Buc, Emily B. Fox, and Roman Garnett, editors,Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, NeurIP...
work page 2019
-
[14]
Distance-restricted explanations: Theoretical underpinnings & efficient imple- mentation
28 Yacine Izza, Xuanxiang Huang, António Morgado, Jordi Planes, Alexey Ignatiev, and João Marques-Silva. Distance-restricted explanations: Theoretical underpinnings & efficient imple- mentation. In Pierre Marquis, Magdalena Ortiz, and Maurice Pagnucco, editors,Proceedings of the 21st International Conference on Principles of Knowledge Representation and R...
work page 2024
-
[15]
30 Junqi Jiang, Francesco Leofante, Antonio Rago, and Francesca Toni
URL: https://proceedings.mlr.press/v222/jiang24a.html. 30 Junqi Jiang, Francesco Leofante, Antonio Rago, and Francesca Toni. Robust counterfactual explanations in machine learning: A survey.CoRR, abs/2402.01928,
-
[16]
30 Junqi Jiang, Francesco Leofante, Antonio Rago, and Francesca Toni
URL: https: //doi.org/10.48550/arXiv.2402.01928,doi:10.48550/ARXIV.2402.01928. 31 Ulrich Junker. QUICKXPLAIN: preferred explanations and relaxations for over-constrained problems. In Deborah L. McGuinness and George Ferguson, editors,Proceedings of the Nineteenth National Conference on Artificial Intelligence, Sixteenth Conference on Innovative Applicatio...
-
[17]
32 Guy Katz, Clark W. Barrett, David L. Dill, Kyle Julian, and Mykel J. Kochenderfer. Reluplex: An efficient SMT solver for verifying deep neural networks. In Rupak Majumdar and Viktor Kuncak, editors,Computer Aided Verification - 29th International Conference, CAV 2017, Heidelberg, Germany, July 24-28, 2017, Proceedings, Part I, volume 10426 ofLecture No...
-
[18]
Zbrzezny, Nicola Paoletti, and Marta Kwiatkowska
34 Emanuele La Malfa, Rhiannon Michelmore, Agnieszka M. Zbrzezny, Nicola Paoletti, and Marta Kwiatkowska. On guaranteed optimal robust explanations for NLP models. In Zhi-Hua Zhou, editor,Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, IJCAI 2021, Virtual Event / Montreal, Canada, 19-27 August 2021, pages 2658–2665....
work page 2021
-
[19]
Promoting counterfactual robustness through diversity
36 Francesco Leofante and Nico Potyka. Promoting counterfactual robustness through diversity. InProceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence and Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence and Fourteenth Symposium on Educational Advances in Artificial Intelligence, AAAI’24/IAAI’24/EAAI’24. A...
-
[20]
An approach to reachability analysis for feed-forward ReLU neural networks
38 Alessio Lomuscio and Lalit Maganti. An approach to reachability analysis for feed-forward relu neural networks.arXiv preprint arXiv:1706.07351,
work page internal anchor Pith review Pith/arXiv arXiv
-
[21]
A. De Palma and G. Dolcetti and C. Urban 25 39 Scott M. Lundberg and Su-In Lee. A unified approach to interpreting model predictions. InAdvances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, pages 4765–4774,
work page 2017
-
[22]
Logic-based explainability in machine learning
42 João Marques-Silva. Logic-based explainability in machine learning. In Leopoldo E. Bertossi and Guohui Xiao, editors,Reasoning Web. Causality, Explanations and Declarative Knowledge - 18th International Summer School 2022, Berlin, Germany, September 27-30, 2022, Tutorial Lectures, volume 13759 ofLecture Notes in Computer Science, pages 24–104. Springer,
work page 2022
-
[23]
Logic-based explainability: Past, present and future
43 João Marques-Silva. Logic-based explainability: Past, present and future. In Tiziana Margaria and Bernhard Steffen, editors,Leveraging Applications of Formal Methods, Verification and Validation. Software Engineering Methodologies - 12th International Symposium, ISoLA 2024, Crete, Greece, October 27-31, 2024, Proceedings, Part IV, volume 15222 ofLectur...
work page 2024
-
[24]
Delivering trustworthy AI through formal XAI
45João Marques-Silva and Alexey Ignatiev. Delivering trustworthy AI through formal XAI. In Thirty-Sixth AAAI Conference on Artificial Intelligence, AAAI 2022, Thirty-Fourth Conference on Innovative Applications of Artificial Intelligence, IAAI 2022, The Twelveth Symposium on Educational Advances in Artificial Intelligence, EAAI 2022 Virtual Event, Februar...
work page 2022
-
[25]
49 Marco Túlio Ribeiro, Sameer Singh, and Carlos Guestrin. “Why should I trust you?”: Explaining the predictions of any classifier. InProceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016, pages 1135–1144. ACM,
work page 2016
-
[26]
A symbolic approach to explaining bayesian network classifiers
52 Andy Shih, Arthur Choi, and Adnan Darwiche. A symbolic approach to explaining bayesian network classifiers. In Jérôme Lang, editor,Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, IJCAI 2018, July 13-19, 2018, Stockholm, Sweden, pages 5103–5111. ijcai.org,
work page 2018
-
[27]
Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps
URL: https://doi.org/10.24963/ijcai.2018/708, doi:10. 24963/IJCAI.2018/708. 26 Faster Verified Explanations for Neural Networks 53 Karen Simonyan, Andrea Vedaldi, and Andrew Zisserman. Deep inside convolutional networks: Visualising image classification models and saliency maps.arXiv preprint arXiv:1312.6034,
work page internal anchor Pith review Pith/arXiv arXiv doi:10.24963/ijcai.2018/708 2018
-
[28]
54 Gagandeep Singh, Rupanshu Ganvir, Markus Püschel, and Martin T. Vechev. Beyond the single neuron convex barrier for neural network certification. In Hanna M. Wallach, Hugo Larochelle, Alina Beygelzimer, Florence d’Alché-Buc, Emily B. Fox, and Roman Garnett, editors,Advances in Neural Information Processing Systems 32: Annual Conference on Neural Inform...
work page 2019
-
[29]
55 Gagandeep Singh, Timon Gehr, Matthew Mirman, Markus Püschel, and Martin T. Vechev. Fast and effective robustness certification. In Samy Bengio, Hanna M. Wallach, Hugo Larochelle, Kristen Grauman, Nicolò Cesa-Bianchi, and Roman Garnett, editors,Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Syste...
work page 2018
-
[30]
Evaluating robustness of neural networks with mixed integer programming
58 Vincent Tjeng, Kai Yuanqing Xiao, and Russ Tedrake. Evaluating robustness of neural networks with mixed integer programming. In7th International Conference on Learning Representations, ICLR 2019, New Orleans, LA, USA, May 6-9,
work page 2019
-
[31]
59 Shubham Ugare, Debangshu Banerjee, Sasa Misailovic, and Gagandeep Singh
URL: https://openreview.net/forum?id=HyGIdiRqtm. 59 Shubham Ugare, Debangshu Banerjee, Sasa Misailovic, and Gagandeep Singh. Incremental verification of neural networks.Proc. ACM Program. Lang., 7(PLDI):1920–1945, 2023.doi: 10.1145/3591299. 60 Caterina Urban and Antoine Miné. A review of formal methods applied to machine learning. CoRR, abs/2104.02466,
-
[32]
61 Shiqi Wang, Kexin Pei, Justin Whitehouse, Junfeng Yang, and Suman Jana
URL: https://arxiv.org/abs/2104.02466. 61 Shiqi Wang, Kexin Pei, Justin Whitehouse, Junfeng Yang, and Suman Jana. Efficient formal safety analysis of neural networks. In Samy Bengio, Hanna M. Wallach, Hugo Larochelle, Kristen Grauman, Nicolò Cesa-Bianchi, and Roman Garnett, editors,Advances in Neural Information Processing Systems 31: Annual Conference on...
-
[33]
62 Shiqi Wang, Huan Zhang, Kaidi Xu, Xue Lin, Suman Jana, Cho-Jui Hsieh, and J. Zico Kolter. Beta-crown: Efficient bound propagation with per-neuron split constraints for neural network robustness verification. In Marc’Aurelio Ranzato, Alina Beygelzimer, Yann N. Dauphin, Percy Liang, and Jennifer Wortman Vaughan, editors,Advances in Neural Information Pro...
work page 2021
- [34]
-
[35]
65 Min Wu, Haoze Wu, and Clark W
arXiv:2409.03060,doi:10.48550/ARXIV.2409.03060. 65 Min Wu, Haoze Wu, and Clark W. Barrett. Verix: Towards verified explainability of deep neural networks. In Alice Oh, Tristan Naumann, Amir Globerson, Kate Saenko, Moritz Hardt, and Sergey Levine, editors,Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processi...
-
[36]
Fast and complete: Enabling complete neural network verification with rapid and massively A
66 Kaidi Xu, Huan Zhang, Shiqi Wang, Yihan Wang, Suman Jana, Xue Lin, and Cho-Jui Hsieh. Fast and complete: Enabling complete neural network verification with rapid and massively A. De Palma and G. Dolcetti and C. Urban 27 parallel incomplete verifiers. In9th International Conference on Learning Representations, ICLR 2021, Virtual Event, Austria, May 3-7,
work page 2021
-
[37]
Efficient neural network robustness certification with general activation functions
68 Huan Zhang, Tsui-Wei Weng, Pin-Yu Chen, Cho-Jui Hsieh, and Luca Daniel. Efficient neural network robustness certification with general activation functions. In Samy Bengio, Hanna M. Wallach, Hugo Larochelle, Kristen Grauman, Nicolò Cesa-Bianchi, and Roman Garnett, editors,Advances in Neural Information Processing Systems 31: Annual Conference on Neural...
work page 2018
-
[38]
69 Duo Zhou, Christopher Brix, Grani A. Hanasusanto, and Huan Zhang. Scalable neural network verification with branch-and-bound inferred cutting planes. In Amir Globersons, Lester Mackey, Danielle Belgrave, Angela Fan, Ulrich Paquet, Jakub M. Tomczak, and Cheng Zhang, editors,Advances in Neural Information Processing Systems 38: Annual Conference on Neura...
work page 2024
-
[39]
Scalable neural network verification with branch-and-bound inferred cutting planes
70 Duo Zhou, Christopher Brix, Grani A Hanasusanto, and Huan Zhang. Scalable neural network verification with branch-and-bound inferred cutting planes. InNeural Information Processing Systems, pages 29324–29353, 2024
work page 2024
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.