pith. sign in

arxiv: 2606.05433 · v1 · pith:KGJNPEDGnew · submitted 2026-06-03 · 💻 cs.AI · cs.SY· eess.SY

Zero knowledge verification for frontier AI training is possible

Pith reviewed 2026-06-28 06:04 UTC · model grok-4.3

classification 💻 cs.AI cs.SYeess.SY
keywords zero-knowledge proofsAI training verificationfrontier AI governancezkVMMerkle commitmentsfloating-point verificationcompute attestationtraining compute
0
0 comments X

The pith

Zero-knowledge verification of frontier AI training runs is feasible with a zkVM architecture at single-digit overhead.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper claims that the absence of technical verification for AI training compute is not fundamental but stems from current paradigms, and offers a concrete architecture to fix it. The method combines a pre-committed training specification, inter-node network observations, on-the-fly Merkle commitments of intermediate results, and verification inside a zero-knowledge virtual machine equipped with native floating-point precompiles. This produces genesis proofs, step proofs during training, and policy attestations while keeping model architecture private. A sympathetic reader would care because governance of high-impact models currently rests on self-reported compute figures, and future binding agreements on AI would lack teeth without enforceable verification. The authors estimate a working proof of concept could appear in roughly three years rather than waiting for custom silicon.

Core claim

A verification architecture for frontier dense pre-training that combines a pre-committed training specification, inter-node network observations, and on-the-fly Merkle commitments of intermediate computation, verified through a zero-knowledge Virtual Machine with native BF16/FP32 precompiles. The proof checks the actual floating-point computation the GPU performed rather than a fixed-point approximation and preserves model-architecture confidentiality through a private training specification. The protocol produces a genesis proof at initialisation, in-training step proofs across the run, and ex-ante attestations enforcing policy-relevant claims as running invariants, turning the training re

What carries the argument

The zero-knowledge Virtual Machine (zkVM) with native BF16/FP32 precompiles that verifies actual floating-point GPU computations from Merkle commitments and network observations.

If this is right

  • Training runs become auditable artefacts that can enforce cumulative compute thresholds without self-reporting.
  • Model architecture remains confidential while still allowing verification of the computation performed.
  • International regulatory agreements on frontier AI can incorporate technical verification primitives instead of remaining declaratory.
  • A proof-of-concept system could be deployed in approximately 36 months rather than the six-to-ten-year timeline required for verification-grade custom silicon.
  • The training record can carry running invariants that serve as ex-ante policy attestations.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • If the architecture works, it could reduce the information asymmetry between AI developers and regulators on actual training effort.
  • The same combination of network observation and Merkle commitments might extend to verifying other large-scale distributed computations beyond AI training.
  • Cataloguing thirteen open research problems creates an explicit invitation for external groups to close the remaining gaps.
  • Success would make cumulative training compute a more reliable basis for tiered governance rules.

Load-bearing premise

A zkVM supporting native BF16 and FP32 operations can be realized that verifies real GPU floating-point work at frontier scale while keeping total overhead in the single-digit percent range.

What would settle it

A working implementation showing that zkVM overhead for verifying BF16/FP32 matrix multiplies at frontier batch sizes exceeds single-digit percent or cannot attest inter-node network traffic and Merkle roots without revealing the model would falsify the claim.

Figures

Figures reproduced from arXiv: 2606.05433 by Ky Nguyen, Paul Wang, Pierre Peign\'e.

Figure 1
Figure 1. Figure 1: Overview of the verification protocol. Overview [PITH_FULL_IMAGE:figures/full_fig_p004_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Proof generation from an in-training challenge. [PITH_FULL_IMAGE:figures/full_fig_p006_2.png] view at source ↗
read the original abstract

Frontier AI governance frameworks increasingly use cumulative training compute as the primary criterion for designating high-impact models, but enforcement rests on self-reporting because no technical verification primitive for training exists. Any future international agreement on frontier AI faces the same problem at higher stakes: coordinated regulation of technologies with significant externalities has historically rested on technical verification, without which agreements are declaratory. Recent governance analyses judge zero-knowledge proofs a promising candidate but currently impractical at frontier scale [26, 4]. We argue the impracticality is paradigm-bound rather than fundamental, and propose a verification architecture for frontier dense pre-training combining a pre-committed training specification, inter-node network observations, and on-the-fly Merkle commitments of intermediate computation, verified through a zero-knowledge Virtual Machine (zkVM) with native BF16/FP32 precompiles. The proof checks the actual floating-point computation the GPU performed rather than a fixed-point approximation, and preserves model-architecture confidentiality through a private training specification. The protocol produces three proof types: a genesis proof at initialisation, in-training step proofs across the run, and ex-ante attestations enforcing policy-relevant claims as running invariants, turning the training record into a governance-enforceable artefact. We estimate a deployable proof of concept within approximately 36 months at single-digit-percent training-side overhead, against a six-to-ten-year cycle for verification-grade custom silicon. Thirteen open research and engineering problems are catalogued as a research agenda for external contribution

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 1 minor

Summary. The paper claims that zero-knowledge verification of frontier dense pre-training is feasible (rather than fundamentally impossible) via an architecture that combines a pre-committed training specification, inter-node network observations, on-the-fly Merkle commitments of intermediate states, and verification inside a zkVM equipped with native BF16/FP32 precompiles. The protocol emits three classes of proofs (genesis, in-training step proofs, and ex-ante policy invariants) that turn the training record into a governance-enforceable artefact while preserving model confidentiality; the authors estimate a deployable proof-of-concept within ~36 months at single-digit-percent training-side overhead and list thirteen open research problems.

Significance. If the proposed architecture can be realized at the claimed overhead, it would supply the first technical primitive capable of converting self-reported training compute into verifiable, policy-relevant artefacts, directly addressing the enforcement gap identified in current frontier-AI governance proposals. The explicit catalog of thirteen open problems is a constructive contribution that could usefully structure community follow-on work.

major comments (2)
  1. [Abstract] Abstract: the single-digit-percent training-side overhead estimate for verifying frontier-scale matmuls and layer norms inside a zkVM with native BF16/FP32 precompiles is asserted without any circuit-size bound, reduction to existing zkVM benchmarks, or even order-of-magnitude calculation of the dominant kernels; this figure is load-bearing for both the 36-month POC timeline and the claim that the approach is deployable rather than requiring custom silicon.
  2. [Abstract] Abstract / architecture description: the claim that inter-node network observations plus Merkle commitments suffice to attest the actual floating-point computation performed by GPUs (without custom silicon or loss of confidentiality) is presented at a high level; no concrete protocol sketch or security argument shows how these observations close the gap between the zkVM proof and the physical training run at the scale of current frontier clusters.
minor comments (1)
  1. [Abstract] The abstract would be clearer if it briefly contrasted the proposed zkVM-plus-Merkle approach with prior zkML or zk-training proposals referenced in [26,4] rather than only stating that prior work judged the problem impractical.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive report and the recognition of the paper's potential significance for AI governance. We address the two major comments below. Both concern the level of detail in the abstract; we agree that additional supporting material strengthens the manuscript and will revise accordingly.

read point-by-point responses
  1. Referee: [Abstract] Abstract: the single-digit-percent training-side overhead estimate for verifying frontier-scale matmuls and layer norms inside a zkVM with native BF16/FP32 precompiles is asserted without any circuit-size bound, reduction to existing zkVM benchmarks, or even order-of-magnitude calculation of the dominant kernels; this figure is load-bearing for both the 36-month POC timeline and the claim that the approach is deployable rather than requiring custom silicon.

    Authors: The referee correctly notes that the single-digit overhead figure is presented as an estimate without an explicit circuit-size analysis or reduction to published zkVM benchmarks in the current text. The estimate is derived from scaling known zkVM costs for dense linear algebra under the assumption that native BF16/FP32 precompiles eliminate the dominant emulation overhead; however, we accept that this reasoning should be made explicit. We will add a short order-of-magnitude calculation section (drawing on existing RISC-V zkVM benchmarks for matrix kernels) to the revised manuscript and qualify the 36-month timeline as conditional on those precompiles being available. revision: yes

  2. Referee: [Abstract] Abstract / architecture description: the claim that inter-node network observations plus Merkle commitments suffice to attest the actual floating-point computation performed by GPUs (without custom silicon or loss of confidentiality) is presented at a high level; no concrete protocol sketch or security argument shows how these observations close the gap between the zkVM proof and the physical training run at the scale of current frontier clusters.

    Authors: The architecture is described at a conceptual level in the abstract and introduction. The full manuscript expands on the three proof classes and the role of committed specifications, but we agree that a concise protocol sketch showing how network observations and Merkle commitments bind the zkVM execution trace to the physical GPU run (while preserving confidentiality) would address the concern. We will insert a one-page protocol outline with a high-level security argument in the revised version. revision: yes

Circularity Check

0 steps flagged

No circularity: forward-looking proposal with explicit open problems

full rationale

The paper advances a high-level verification architecture combining pre-committed specs, network observations, Merkle commitments, and a zkVM with native floating-point precompiles. No equations, fitted parameters, or derivations appear in the provided text. The 36-month POC estimate and single-digit overhead are presented as engineering targets, not outputs of any internal fitting or self-referential reduction. Cited works [26,4] address prior impracticality judgments rather than supplying load-bearing uniqueness theorems or ansatzes from the same authors. Thirteen open research problems are explicitly catalogued, confirming the central claims remain non-self-contained proposals rather than closed loops.

Axiom & Free-Parameter Ledger

1 free parameters · 2 axioms · 0 invented entities

The proposal depends on several unverified performance and integration assumptions about zkVMs and network monitoring at frontier scale that are not demonstrated or derived in the abstract.

free parameters (1)
  • single-digit-percent training-side overhead
    Presented as an achievable target without supporting derivation, benchmarks, or data.
axioms (2)
  • domain assumption A zkVM with native BF16/FP32 precompiles can verify actual floating-point GPU computations at frontier scale with low overhead.
    Invoked as the core verification mechanism in the proposed architecture.
  • domain assumption Inter-node network observations combined with on-the-fly Merkle commitments suffice to attest to the full training process.
    Central premise enabling the three proof types without custom hardware.

pith-pipeline@v0.9.1-grok · 5794 in / 1493 out tokens · 49596 ms · 2026-06-28T06:04:56.085518+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

45 extracted references · 1 linked inside Pith

  1. [1]

    Ideals, macaulay bases, and pcps

    Prashanth Amireddy, Amik Raj Behera, Srikanth Srinivasan, Madhu Sudan, and Sophus Valentin Willumsgaard. Ideals, macaulay bases, and pcps. InSTOC 2026, 2026

  2. [2]

    Rothblum

    Noga Amit, Shafi Goldwasser, Orr Paradise, and Guy N. Rothblum. Models that prove their own correctness. InICML 2024 Workshop on Theoretical Foundations of Foundation Models, 2024

  3. [3]

    Proof verification and the hardness of approximation problems.J

    Sanjeev Arora, Carsten Lund, Rajeev Motwani, Madhu Sudan, and Mario Szegedy. Proof verification and the hardness of approximation problems.J. ACM, 45(3):501–555, 1998

  4. [4]

    Verifying international agreements on AI: Six layers of verification for rules on large-scale AI development and deployment

    Mauricio Baker, Gabriel Kulp, Oliver Marks, Miles Brundage, and Lennart Heim. Verifying international agreements on AI: Six layers of verification for rules on large-scale AI development and deployment. 2025

  5. [5]

    Scalable, transparent, and post-quantum secure computational integrity.Cryptology ePrint Archive, Report 2018/046, 2018

    Eli Ben-Sasson, Iddo Bentov, Yinon Horesh, and Michael Riabzev. Scalable, transparent, and post-quantum secure computational integrity.Cryptology ePrint Archive, Report 2018/046, 2018

  6. [6]

    Interactive oracle proofs

    Eli Ben-Sasson, Alessandro Chiesa, and Nicholas Spooner. Interactive oracle proofs. InTheory of Cryptography Conference (TCC 2016-B), 2016

  7. [7]

    Sum-check protocol for approximate computations.Cryptology ePrint Archive, Report 2025/2152, 2025

    Dor Bitan, Zachary DeStefano, Shafi Goldwasser, Yuval Ishai, Yael Tauman Kalai, and Justin Thaler. Sum-check protocol for approximate computations.Cryptology ePrint Archive, Report 2025/2152, 2025

  8. [8]

    ZKML: An optimizing system for ML inference in zero-knowledge proofs

    Bing-Jyue Chen, Suppakit Waiwitlikhit, Ion Stoica, and Daniel Kang. ZKML: An optimizing system for ML inference in zero-knowledge proofs. InProceedings of the Nineteenth European Conference on Computer Systems (EuroSys), 2024

  9. [9]

    The rising costs of training frontier AI models.arXiv preprint arXiv:2405.21015, 2024

    Ben Cottier, Robi Rahman, Loredana Fattorini, Nestor Maslej, Tamay Besiroglu, and David Owen. The rising costs of training frontier AI models.arXiv preprint arXiv:2405.21015, 2024. Published by Epoch AI

  10. [10]

    Silent data corruptions at scale.arXiv preprint arXiv:2102.11245, 2021

    Harish Dattatraya Dixit, Sneha Pendharkar, Matt Beadon, Chris Mason, Tejasvi Chakravarthy, Bharath Muthiah, and Sriram Sankar. Silent data corruptions at scale.arXiv preprint arXiv:2102.11245, 2021

  11. [11]

    The Llama 3 herd of models.arXiv preprint arXiv:2407.21783, 2024

    Abhimanyu Dubey, Abhinav Jauhri, Abhinav Pandey, et al. The Llama 3 herd of models.arXiv preprint arXiv:2407.21783, 2024

  12. [12]

    Article 51: Classification of General-Purpose AI Models as General-Purpose AI Models with Systemic Risk

    EU Council. Article 51: Classification of General-Purpose AI Models as General-Purpose AI Models with Systemic Risk. EU Artificial Intelligence Act Online Resource, https: //artificialintelligenceact.eu/article/51/, 2024. Regulation (EU) 2024/1689, Official Journal version of 13 June 2024. Accessed: 2026-04-22

  13. [13]

    Experimenting with zero-knowledge proofs of training.arXiv preprint arXiv:2310.02421, 2023

    Sanjam Garg, Aarushi Guo, Omer Reingold, and Ron Roth. Experimenting with zero-knowledge proofs of training.arXiv preprint arXiv:2310.02421, 2023. 10

  14. [14]

    Cambridge University Press, 2001

    Oded Goldreich.Foundations of Cryptography: Basic Tools. Cambridge University Press, 2001

  15. [15]

    Rothblum, Jonathan Shafer, and Amir Yehudayoff

    Shafi Goldwasser, Guy N. Rothblum, Jonathan Shafer, and Amir Yehudayoff. Interactive Proofs for Verifying Machine Learning. In James R. Lee, editor,12th Innovations in Theoretical Computer Science Conference (ITCS 2021), volume 185 ofLeibniz International Proceedings in Informatics (LIPIcs), pages 41:1–41:19, Dagstuhl, Germany, 2021. Schloss Dagstuhl – Le...

  16. [16]

    On the size of pairing-based non-interactive arguments

    Jens Groth. On the size of pairing-based non-interactive arguments. InAdvances in Cryptology – EUROCRYPT 2016, 2016

  17. [17]

    Hochschild, Paul Turner, Jeffrey C

    Peter H. Hochschild, Paul Turner, Jeffrey C. Mogul, Rama Govindaraju, Parthasarathy Ran- ganathan, David E. Culler, and Amin Vahdat. Cores that don’t count. InProceedings of the Workshop on Hot Topics in Operating Systems (HotOS), 2021

  18. [18]

    Scaling up trustless DNN inference with zero-knowledge proofs

    Daniel Kang, Tatsunori Hashimoto, Ion Stoica, and Yi Sun. Scaling up trustless DNN inference with zero-knowledge proofs. InarXiv preprint arXiv:2210.08674, 2022

  19. [19]

    VeriLoRA: Fine-tuning large language models with verifiable security via zero-knowledge proofs

    Guofu Liao, Taotao Wang, Shengli Zhang, Jiqun Zhang, Long Shi, and Dacheng Tao. VeriLoRA: Fine-tuning large language models with verifiable security via zero-knowledge proofs. In Network and Distributed System Security Symposium (NDSS), 2026

  20. [20]

    An efficient transform from sigma protocols to nizk with a crs and non- programmable random oracle

    Yehuda Lindell. An efficient transform from sigma protocols to nizk with a crs and non- programmable random oracle. InAdvances in Cryptology – EUROCRYPT 2015, 2015

  21. [21]

    Llama 3 training economics and cost breakdown, 2024

    Dylan Patel and SemiAnalysis. Llama 3 training economics and cost breakdown, 2024. Industry analysis; seehttps://www.semianalysis.com

  22. [22]

    Guaranteeable memory: An HBM-based chiplet for verifiable AI workloads

    James Petrie. Guaranteeable memory: An HBM-based chiplet for verifiable AI workloads. In Workshop on Technical AI Governance (TAIG) at ICML 2025, Vancouver, Canada, 2025

  23. [23]

    Flexible hardware-enabled guarantees: Part II: Technical options for flexible hardware-enabled guarantees

    James Petrie and Onni Aarne. Flexible hardware-enabled guarantees: Part II: Technical options for flexible hardware-enabled guarantees. Technical report, ARIA, 2025. arXiv:2506.03409

  24. [24]

    DASH: Deterministic attention scheduling for high-throughput reproducible LLM training

    Xinwei Qiang, Hongmin Chen, Shixuan Sun, Jingwen Leng, Xin Liu, and Minyi Guo. DASH: Deterministic attention scheduling for high-throughput reproducible LLM training. InInterna- tional Conference on Learning Representations (ICLR), 2026

  25. [25]

    An international agreement to prevent the premature creation of artificial superintelligence

    Aaron Scher, David Abecassis, Peter Barnett, and Brian Abeyta. An international agreement to prevent the premature creation of artificial superintelligence. 2025

  26. [26]

    What does it take to catch a Chinchilla? Verifying rules on large-scale neural network training via compute monitoring

    Yonadav Shavit. What does it take to catch a Chinchilla? Verifying rules on large-scale neural network training via compute monitoring. 2023

  27. [27]

    Verifiable evaluations of machine learning models using zkSNARKs.arXiv preprint arXiv:2402.02675, 2024

    Tobin South, Alexander Camuto, Shrey Jain, Shayla Nguyen, Robert Mahari, Christian Paquin, Jason Morton, and Alex Pentland. Verifiable evaluations of machine learning models using zkSNARKs.arXiv preprint arXiv:2402.02675, 2024

  28. [28]

    zkLLM: Zero knowledge proofs for large language models

    Haochen Sun, Jason Li, and Hongyang Zhang. zkLLM: Zero knowledge proofs for large language models. InProceedings of the 2024 ACM SIGSAC Conference on Computer and Communications Security (CCS), 2024

  29. [29]

    zkdl: Efficient zero-knowledge proofs of deep learning training.arXiv preprint arXiv:2307.16273, 2024

    Haochen Sun and Hongyang Zhang. zkdl: Efficient zero-knowledge proofs of deep learning training.arXiv preprint arXiv:2307.16273, 2024

  30. [30]

    US Exec. Ord. Safe, Secure, and Trustworthy Development and Use of Artificial Intelligence. Executive Order No. 14110, 88 Fed. Reg. 75191, https://www.federalregister.gov/documents/2023/11/01/2023-24283/ safe-secure-and-trustworthy-development-and-use-of-artificial-intelligence ,

  31. [31]

    Revoked by Executive Order No

    Signed October 30, 2023; published November 1, 2023. Revoked by Executive Order No. 14148 (January 20, 2025). Accessed: 2026-04-22. 11 A Open problems This appendix catalogues the open research and engineering problems that remain after the ar- chitecture of Section 3. The list complements and expands the public roadmap maintained at gpaipolicylab.org/ver...

  32. [32]

    The cleanest path, compatible with open-source verification, but requires ongoing mainte- nance as NCCL evolves

    Patched NCCL with a documented canonical format.A forked NCCL exposes a version- locked on-wire layout and enumerates which logical tensor bytes map to which wire offsets. The cleanest path, compatible with open-source verification, but requires ongoing mainte- nance as NCCL evolves

  33. [33]

    Trades a small amount of additional memory traffic for independence from NCCL’s internals

    Canonicalisation shim at the NCCL-to-transport boundary.A signed, auditable library interposed between NCCL and the NIC emits a canonicalised copy of each outgoing payload for the anchor to hash. Trades a small amount of additional memory traffic for independence from NCCL’s internals

  34. [34]

    verifiable NCCL

    Vendor-supported “verifiable NCCL” mode.A first-class configuration flag with docu- mented wire semantics. The only path that does not require the verification ecosystem to maintain its own fork or shim; depends on vendor cooperation. 13 All three yield equivalent verification guarantees and differ in deployment effort, maintenance burden, and institution...

  35. [35]

    Network anchor.Catches any deviation that changes wire-bound content; deterministic (not statistical) detection on any sampled wire-bound tensor

  36. [36]

    Any error in any intermediate that propagates to the layer output changes the committed root; the proof catches this at the layer boundary

    On-the-fly Merkle commitments.Each layer’s output is committed during training. Any error in any intermediate that propagates to the layer output changes the committed root; the proof catches this at the layer boundary

  37. [37]

    many cheap, few expensive

    Interactive sampling.Adds 10−20 statistical assurance on intermediate GEMMs for deviations affecting≥1%of entries. Each layer is necessary: (1) alone cannot see intra-node computation; (2) alone does not rule out an adversary who produces internally consistent fake commitments; (3) alone has a detection floor at the deviation fraction. Together they provi...

  38. [38]

    A direct measurement is part of the planned empirical validation work

    Proving throughput (∼106 constraints/s/GPU)is approximate and may be optimistic for constraint-heavy FP precompiles. A direct measurement is part of the planned empirical validation work. Relative comparisons between approaches are robust to this assumption

  39. [39]

    Absolute proof times scale linearly with this; the relative structure of the breakdown does not

    Per-MAC constraint count (∼90)has a range of 50–150 depending on implementation. Absolute proof times scale linearly with this; the relative structure of the breakdown does not

  40. [40]

    Generation and validation of these tables is routine engineering work

    Non-linear lookup tables(BF16 GELU, SiLU) must match the GPU kernel output at every input. Generation and validation of these tables is routine engineering work

  41. [41]

    Formal sensitivity analysis (OP-1) would close this

    Sampling security at f <1% is a defense-in-depth argument, not a hard theorem. Formal sensitivity analysis (OP-1) would close this

  42. [42]

    GPU-concurrent Merkle hashing overhead (∼1–3%)needs empirical validation on H100 with realistic training workloads

  43. [43]

    A training stack that fuses FFN end-to-end (e.g

    Fusion assumptions(Flash Attention truly fused, FFN GEMMs separable with lazy com- mitment) are kernel-dependent. A training stack that fuses FFN end-to-end (e.g. a memory- bound variant) materially increases per-sample FFN verification cost, and the sample count for fused blocks must be reduced correspondingly

  44. [44]

    The order-of-magnitude framing (K× per-expert-dense) holds across reasonable choices; concrete per-architecture figures require OP-10 resolution

    MoE cost numbersdepend on sampling-scheme design decisions open under OP-10. The order-of-magnitude framing (K× per-expert-dense) holds across reasonable choices; concrete per-architecture figures require OP-10 resolution

  45. [45]

    FlashAttention-3-deterministic

    INT8 training with INT32 accumulation(supported by H100/B200 Tensor Cores) would eliminate float non-associativity and make Freivalds directly applicable. No frontier model currently uses full INT8 pretraining, but the industry trend toward FP8 and FP4 is adjacent; verification cost at low-precision regimes is a worthwhile future study. C Determinism enfo...