PoLO: Proof-of-Learning and Proof-of-Ownership at Once with Chained Watermarking
Pith reviewed 2026-05-22 14:46 UTC · model grok-4.3
The pith
PoLO uses chained watermarking to prove both that a model was trained on particular data and that the prover owns it, all in one privacy-preserving step.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
PoLO achieves proof-of-learning and proof-of-ownership at once by embedding a chain of watermarks during the training process, where each watermark links to the previous one to record both the learning trajectory and the owner's identity. The method allows verification to confirm the chain's integrity and the owner's claim using only the final model and public parameters, without access to private training data. Experiments demonstrate that this yields 99 percent watermark detection accuracy, reduces verification cost to 1.5-10 percent of traditional approaches, and forces forgery to consume 1.1-4 times more resources, while the proof retains over 90 percent accuracy under attacks.
What carries the argument
Chained watermarking, a sequential embedding process that links successive watermarks to jointly encode training history and ownership identity for efficient verification.
If this is right
- Verification of both training effort and ownership becomes possible at a fraction of previous computational cost.
- Model owners can demonstrate legitimate training without revealing private datasets.
- Forging a convincing proof now requires noticeably higher resources than honest creation.
- Detection accuracy remains high even after common post-training modifications.
- The same mechanism supports repeated verification without repeated access to training records.
Where Pith is reading between the lines
- In distributed training settings this could let contributors prove their share of work without exposing local data.
- Integration with public ledgers might create tamper-evident records of model provenance.
- The approach could extend to other sequential processes like incremental fine-tuning where each stage needs separate proof.
- Lower verification overhead might encourage more frequent ownership checks in model marketplaces.
Load-bearing premise
The chained watermarks stay intact and detectable even when an attacker tries to remove or replace them without having run the original training process.
What would settle it
A successful attack that forges a valid PoLO proof using fewer than 1.1 times the honest generation cost while keeping watermark detection accuracy above 90 percent after standard removal attempts would disprove the core efficiency and robustness claims.
Figures
read the original abstract
Our evaluation shows that PoLO achieves \textbf{99\%} watermark detection accuracy for ownership verification, while preserving data privacy and cutting verification costs to just \textbf{1.5--10\%} of traditional methods. Forging PoLO demands \textbf{1.1--4$\times$} more resources than honest proof generation, with the original proof retaining over \textbf{90\%} detection accuracy even after attacks.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript introduces PoLO, a scheme that simultaneously realizes proof-of-learning and proof-of-ownership for machine-learning models via chained watermarking. It reports 99% watermark detection accuracy for ownership verification, verification costs reduced to 1.5–10% of traditional methods, a 1.1–4× resource overhead for forging, and retention of >90% detection accuracy after attacks, all while preserving training-data privacy.
Significance. If the robustness and cost-asymmetry claims hold under realistic attack models, PoLO would provide a practical primitive for verifiable ML training and ownership in privacy-sensitive settings. The integration of two proofs into one chained construction and the reported efficiency gains are potentially valuable, but the absence of formal bounds or exhaustive adaptive-attack coverage weakens the grounding of the forgery-cost claim.
major comments (2)
- [§5] §5 (Evaluation): the reported >90% post-attack detection accuracy and 1.1–4× forgery overhead rest on experiments that appear limited to standard removal attacks; no results are shown for adaptive adversaries (e.g., fine-tuning or extraction optimized to erase sequential watermarks while preserving utility), which is load-bearing for the central cost-asymmetry claim.
- [§3.2] §3.2 (Construction): the chained-watermarking definition supplies no security reduction or explicit assumptions on the base watermarking primitive that would provably prevent an adversary from breaking the chain without the original training data or model internals; the empirical robustness therefore lacks a formal anchor.
minor comments (2)
- The abstract and §5 state concrete accuracy and cost figures without accompanying error bars, confidence intervals, or dataset/model details; adding these would improve reproducibility.
- [Table 3] Table 3 (or equivalent cost table): the baseline for the 1.5–10% verification-cost reduction should be clarified; it is unclear whether the comparison uses the most efficient existing PoO schemes or a naïve full-model verification.
Simulated Author's Rebuttal
We thank the referee for the detailed and constructive feedback. We address each major comment below, acknowledging the need for stronger empirical coverage of adaptive attacks and clearer grounding of the construction. Revisions will focus on adding targeted experiments and explicit assumptions while remaining honest about the empirical nature of the current work.
read point-by-point responses
-
Referee: [§5] §5 (Evaluation): the reported >90% post-attack detection accuracy and 1.1–4× forgery overhead rest on experiments that appear limited to standard removal attacks; no results are shown for adaptive adversaries (e.g., fine-tuning or extraction optimized to erase sequential watermarks while preserving utility), which is load-bearing for the central cost-asymmetry claim.
Authors: We agree that the cost-asymmetry claim would be more robust with evaluation against adaptive adversaries. In the revised manuscript we will add experiments using fine-tuning and model-extraction attacks specifically tuned to remove sequential watermarks while preserving utility. These results will be reported in an expanded Section 5 with the same metrics (detection accuracy and resource overhead) to directly support the 1.1–4× forgery overhead claim. revision: yes
-
Referee: [§3.2] §3.2 (Construction): the chained-watermarking definition supplies no security reduction or explicit assumptions on the base watermarking primitive that would provably prevent an adversary from breaking the chain without the original training data or model internals; the empirical robustness therefore lacks a formal anchor.
Authors: The current manuscript presents a practical chained-watermarking construction evaluated empirically rather than via formal reduction. We will revise §3.2 to state explicit assumptions on the base watermarking primitive (e.g., that it resists removal when the training data and model internals remain private) and to clarify that security is argued empirically. A full security reduction is outside the scope of this work and would require additional theoretical development. revision: partial
- A complete formal security reduction for the chained-watermarking construction against adaptive adversaries.
Circularity Check
No significant circularity detected in PoLO construction or claims
full rationale
The paper proposes a novel chained-watermarking construction for simultaneous proof-of-learning and proof-of-ownership. Performance figures (99% detection accuracy, 1.5-10% verification cost, 1.1-4x forgery overhead, >90% post-attack retention) are stated as evaluation outcomes rather than predictions derived from fitted parameters or self-referential equations. No self-definitional loops, fitted-input-as-prediction patterns, or load-bearing self-citations that reduce the central claims to their own inputs appear in the provided text. The robustness argument rests on the explicit construction and reported tests against standard attacks; these do not collapse into ansatzes or uniqueness theorems imported from the same authors. The derivation chain is therefore self-contained as an original scheme whose security and efficiency properties are asserted via design and measurement, not by constructional equivalence to the inputs.
Axiom & Free-Parameter Ledger
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/BranchSelection.leanbranch_selection unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Forging PoLO demands 1.1–4× more resources than honest proof generation
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Samuel Idowu, Daniel Strüber, and Thorsten Berger. Asset management in ma- chine learning: State-of-research and state-of-practice.ACM Computing Surveys, 55(7):1–35, 2022
work page 2022
-
[2]
Robert Gorwa and Michael Veale. Moderating model marketplaces: platform gov- ernance puzzles for ai intermediaries.Law, Innovation and Technology, 16(2):341– 391, 2024
work page 2024
-
[3]
Ironforge: An open, secure, fair, decentralized federated learning.TNNLS, 36(1):354–368, 2025
Guangsheng Yu, Xu Wang, Caijun Sun, Qin Wang, Ping Yu, Wei Ni, and Ren Ping Liu. Ironforge: An open, secure, fair, decentralized federated learning.TNNLS, 36(1):354–368, 2025
work page 2025
-
[4]
Jiasi Weng, Jian Weng, Chengjun Cai, Hongwei Huang, and Cong Wang. Golden grain: Building a secure and decentralized model marketplace for mlaas.TDSC, 19(5):3149–3167, 2022
work page 2022
-
[5]
Guangsheng Yu, Yanna Jiang, Qin Wang, Xu Wang, Baihe Ma, Caijun Sun, Wei Ni, and Ren Ping Liu. Split unlearning.CCS, 2025
work page 2025
-
[6]
Dataset ownership verification in contrastive pre-trained models.ICLR, 2025
Yuechen Xie, Jie Song, Mengqi Xue, Haofei Zhang, Xingen Wang, Bingde Hu, Genlang Chen, and Mingli Song. Dataset ownership verification in contrastive pre-trained models.ICLR, 2025
work page 2025
-
[7]
European Union. Artificial intelligence act, article 53 — transparency obligations for general-purpose ai models, 2024. Consolidated text
work page 2024
-
[8]
U.S. Copyright Office. Section 512 of title 17: Limitations on liability relating to material online (dmca safe harbor), 2024. Official DMCA §512 guidance
work page 2024
-
[9]
European Commission. Eu rules for general-purpose ai models start to apply, bringing more transparency, safety and accountability, 2024
work page 2024
-
[10]
Proof-of-learning: Definitions and practice
Hengrui Jia, Mohammad Yaghini, et al. Proof-of-learning: Definitions and practice. SP, 2021
work page 2021
-
[11]
Rui Zhang, Jian Liu, Yuan Ding, Zhibo Wang, Qingbiao Wu, and Kui Ren. “Ad- versarial examples” for proof-of-learning.SP, 2022
work page 2022
-
[12]
On the necessity of auditable algorithmic definitions for machine unlearning
Anvith Thudi, Hengrui Jia, Ilia Shumailov, and Nicolas Papernot. On the necessity of auditable algorithmic definitions for machine unlearning. InUSENIX Security, 2022
work page 2022
-
[13]
Choquette-Choo, Natalie Dullerud, Varun Chandrasekaran, and Nicolas Papernot
Congyu Fang, Hengrui Jia, Anvith Thudi, Mohammad Yaghini, Christopher A. Choquette-Choo, Natalie Dullerud, Varun Chandrasekaran, and Nicolas Papernot. Proof-of-learning is currently more broken than you think. InEuroSP, 2023
work page 2023
-
[14]
Rethinking White-Box water- marks on deep learning models under neural structural obfuscation
Yifan Yan, Xudong Pan, Mi Zhang, and Min Yang. Rethinking White-Box water- marks on deep learning models under neural structural obfuscation. InUSENIX Security, pages 2347–2364, 2023
work page 2023
-
[15]
Yijia Chang, Hanrui Jiang, Chao Lin, Xinyi Huang, and Jian Weng. Towards under- standing and enhancing security of proof-of-training for dnn model ownership verification. InUSENIX Security, 2025
work page 2025
-
[16]
Proof-of-Learning with Incentive Security
Zishuo Zhao, Zhixuan Fang, Xuechao Wang, Xi Chen, and Yuan Zhou. Proof-of- learning with incentive security.arXiv preprint arXiv:2404.09005, 2024
work page internal anchor Pith review Pith/arXiv arXiv 2024
-
[17]
Zero-knowledge proofs of training for deep neural networks
Kasra Abbaszadeh, Christodoulos Pappas, Jonathan Katz, and Dimitrios Pa- padopoulos. Zero-knowledge proofs of training for deep neural networks. In CCS, page 4316–4330, 2024
work page 2024
-
[18]
Bitcoin: A peer-to-peer electronic cash system.Satoshi Nakamoto, 2008
Satoshi Nakamoto. Bitcoin: A peer-to-peer electronic cash system.Satoshi Nakamoto, 2008
work page 2008
-
[19]
Peter Gaži, Aggelos Kiayias, and Dionysis Zindros. Proof-of-stake sidechains. In SP, pages 139–156, 2019
work page 2019
-
[20]
Multi-certificate attacks against proof-of-elapsed-time and their countermeasures
Huibo Wang, Guoxing Chen, Yinqian Zhang, and Zhiqiang Lin. Multi-certificate attacks against proof-of-elapsed-time and their countermeasures. InNDSS, 2022. 13
work page 2022
-
[21]
Simple proofs of space-time and rational proofs of storage
Tal Moran and Ilan Orlov. Simple proofs of space-time and rational proofs of storage. InCRYPTO, pages 381–409, 2019
work page 2019
-
[22]
Exploring unfairness on proof of authority: Order manipulation attacks and remedies
Qin Wang, Rujia Li, Qi Wang, Shiping Chen, and Yang Xiang. Exploring unfairness on proof of authority: Order manipulation attacks and remedies. InAsiaCCS, pages 123–137, 2022
work page 2022
-
[23]
Confidential-profitt: confidential proof of fair training of trees.ICLR, 2022
Ali Shahin Shamsabadi et al. Confidential-profitt: confidential proof of fair training of trees.ICLR, 2022
work page 2022
-
[24]
Veriml: Enabling integrity assurances and fair payments for machine learning as a service
Lingchen Zhao, Qian Wang, Cong Wang, Qi Li, Chao Shen, and Bo Feng. Veriml: Enabling integrity assurances and fair payments for machine learning as a service. TPDS, 2021
work page 2021
-
[25]
Dilum Bandara, and Shiping Chen
Qin Wang, Guangsheng Yu, Yilin Sai, H.M.N. Dilum Bandara, and Shiping Chen. Is your AI truly yours? leveraging blockchain for copyrights, provenance, and lineage.IEEE TSC, 2025
work page 2025
-
[26]
Shuo Shao, Yiming Li, Hongwei Yao, et al. Explanation as a watermark: Towards harmless and multi-bit model ownership verification via watermarking feature attribution.NDSS, 2025
work page 2025
-
[27]
Deepeclipse: How to break white-box dnn-watermarking schemes.USENIX Security, 2024
Alessandro Pegoraro and others Segna. Deepeclipse: How to break white-box dnn-watermarking schemes.USENIX Security, 2024
work page 2024
-
[28]
Xinyun Chen et al. REFIT: A unified watermark removal framework for deep learning systems with limited data.AsiaCCS, 2021
work page 2021
-
[29]
Watermark removal scheme based on neural network model pruning
Wenwen Gu. Watermark removal scheme based on neural network model pruning. InMLNLP, page 377–382, 2023
work page 2023
-
[30]
Unmarker: A universal attack on defensive image watermarking
Andre Kassis and Urs Hengartner. Unmarker: A universal attack on defensive image watermarking. InSP, volume 2, page 8, 2025
work page 2025
-
[31]
Robust model watermarking for image processing networks via structure consistency.TPAMI, 2024
Jie Zhang et al. Robust model watermarking for image processing networks via structure consistency.TPAMI, 2024
work page 2024
-
[32]
Promptcare: Prompt copyright protection by watermark injection and verification.SP, 2024
Hongwei Yao, Jian Lou, et al. Promptcare: Prompt copyright protection by watermark injection and verification.SP, 2024
work page 2024
-
[33]
Detecting voice cloning attacks via timbre water- marking
Chang Liu, Jie Zhang, et al. Detecting voice cloning attacks via timbre water- marking. InNDSS, 2024
work page 2024
-
[34]
Embedding watermarks into deep neural networks.ICML, 2017
Yusuke Uchida, Yuki Nagai, et al. Embedding watermarks into deep neural networks.ICML, 2017
work page 2017
-
[35]
A robustness-assured white-box watermark in neural networks.TDSC, 2023
Peizhuo Lv, Pan Li, Shengzhi Zhang, Kai Chen, Ruigang Liang, Hualong Ma, Yue Zhao, and Yingjiu Li. A robustness-assured white-box watermark in neural networks.TDSC, 2023
work page 2023
-
[36]
Riga: Covert and robust white-box watermarking of deep neural networks
Tianhao Wang and Florian Kerschbaum. Riga: Covert and robust white-box watermarking of deep neural networks. InWWW, 2021
work page 2021
-
[37]
FedIPR: Ownership verification for federated deep neural network models.TPAMI, 2022
Bowen Li, Lixin Fan, et al. FedIPR: Ownership verification for federated deep neural network models.TPAMI, 2022
work page 2022
-
[38]
Shuo Shao, Wenyuan Yang, Hanlin Gu, Zhan Qin, Lixin Fan, and Qiang Yang. Fedtracker: Furnishing ownership verification and traceability for federated learning model.TDSC, 2024
work page 2024
-
[39]
Wei Zong, Yang-Wai Chow, Willy Susilo, et al. Ipremover: A generative model inversion attack against deep neural network fingerprinting and watermarking. AAAI, 2024
work page 2024
-
[40]
Collusion resistant watermarking for deep learning models protection
Sayoko Kakikura, Hyunho Kang, et al. Collusion resistant watermarking for deep learning models protection. InICACT, 2022
work page 2022
-
[41]
Fangqi Li and Shilin Wang. Secure watermark for deep neural networks with multi-task learning.arXiv preprint arXiv:2103.10021, 2021
-
[42]
Cynthia Dwork, Aaron Roth, et al. The algorithmic foundations of differential privacy.Foundations and trends®in theoretical computer science, 9(3–4):211–407, 2014. APPENDIX 1 Notation (Tab.6) Note on chain integrity (Algorithm 1, Else branch).When the watermark detection rate has not yet reached 𝜂𝐺, the algorithm continues training from the current weight...
work page 2014
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.