VeriX-Anon: A Multi-Layered Framework for Mathematically Verifiable Outsourced Target-Driven Data Anonymization

Miit Daga; Swarna Priya Ramu

arxiv: 2604.12431 · v1 · submitted 2026-04-14 · 💻 cs.CR · cs.DB· cs.LG

VeriX-Anon: A Multi-Layered Framework for Mathematically Verifiable Outsourced Target-Driven Data Anonymization

Miit Daga , Swarna Priya Ramu This is my paper

Pith reviewed 2026-05-10 16:09 UTC · model grok-4.3

classification 💻 cs.CR cs.DBcs.LG

keywords data anonymizationk-anonymizationoutsourced computationverification frameworkSHAP valuesWasserstein distanceMerkle hashingadversary detection

0 comments

The pith

A three-layer system lets data owners verify outsourced target-driven k-anonymization without trusting the cloud provider.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces a practical method for confirming that an external service correctly applied a contracted anonymization process to sensitive records. It achieves this by running three independent checks on the returned data and supporting proof: one deterministic, one probabilistic, and one based on utility preservation. If these checks pass, the owner obtains evidence that the algorithm was followed faithfully. This matters for organizations that must anonymize data at scale but cannot perform the work internally. Experiments across three datasets show the combined layers succeed against most tested forms of incorrect execution where any single layer would miss the deviation.

Core claim

VeriX-Anon is a multi-layered verification framework for outsourced Target-Driven k-anonymization that uses deterministic verification via Merkle-style hashing of an Authenticated Decision Tree, probabilistic verification via Boundary Sentinels near the decision boundary together with exact-duplicate Twins carrying cryptographic identifiers, and utility-based verification via Explainable AI fingerprinting that compares SHAP value distributions before and after anonymization using the Wasserstein distance. When evaluated on three cross-domain datasets against Lazy, Dumb, and Approximate adversaries, the full framework detected deviations in 11 of 12 scenarios, with the XAI layer alone able to

What carries the argument

The three orthogonal verification layers consisting of Authenticated Decision Tree with Merkle hashing, Boundary Sentinels paired with cryptographic Twins, and SHAP-based XAI fingerprinting measured by Wasserstein distance.

Load-bearing premise

The three empirically tested adversary profiles plus the theoretical Informed Attacker represent realistic threats, and the cryptographic salt prevents evasion even when the attacker knows about trap embedding.

What would settle it

A controlled test in which an Informed Attacker who knows the trap-embedding method but lacks the salt value produces an anonymized output that passes all three verification layers on the Adult dataset would falsify the detection claim.

Figures

Figures reproduced from arXiv: 2604.12431 by Miit Daga, Swarna Priya Ramu.

**Figure 1.** Figure 1: Threat coverage matrix. Left columns show each adversary’s actions (data dropping, algorithm substitution, hash fabrication). Right columns show which verification layer catches the deviation. No single layer achieves full coverage; the multilayered design is necessary for 11/12 correct detection across the three empirically evaluated adversary profiles (1–3 ). The Approximate adversary (3 ) bypasses L… view at source ↗

**Figure 2.** Figure 2: VeriX-Anon end-to-end architecture. The client prepares the dataset with embedded traps (Phase 1), outsources to the cloud for authenticated Target-Driven anonymization (Phase 2), and runs the four-layer verification engine (Phase 4) against a locally computed XAI baseline (Phase 3). sentinels than imbalanced ones. In our experiments, Adult Income had 501 sentinels, 160 within the 2% limit, while Bank Mark… view at source ↗

**Figure 3.** Figure 3: Per-layer verification coverage across three domains and four cloud profiles. Green cells indicate correct behaviour (pass for honest, caught for malicious). Orange cells indicate missed attacks on that layer. The Diabetes Dumb cloud escapes Layer 3 (XAI) but is caught by Layer 1 (ADT Hash). The Diabetes Approximate cloud escapes all four layers, producing the single incorrect verdict (11/12 overall) [PIT… view at source ↗

**Figure 4.** Figure 4: Client verification time vs. dataset size. The 𝑂(1) XAI overhead dominates at small 𝑛; the 𝑂(𝑛∕𝑘) hash verification scales linearly but remains sub-second at 𝑛 = 106 . The experiments above measure client-side verification cost only. The cloud-side overhead of VeriX-Anon (building the authenticated tree with Merkle hashing and canonical formatting) was not benchmarked separately. In our Kagglebased evalua… view at source ↗

**Figure 5.** Figure 5: Utility-privacy trade-off across 11 𝑘-values. Left axis: F1-score (80/20 test set). Right axis: SHAP Wasserstein Distance. Dashed grey line: 𝜀 = 0.45 [PITH_FULL_IMAGE:figures/full_fig_p017_5.png] view at source ↗

read the original abstract

Organisations increasingly outsource privacy-sensitive data transformations to cloud providers, yet no practical mechanism lets the data owner verify that the contracted algorithm was faithfully executed. VeriX-Anon is a multi-layered verification framework for outsourced Target-Driven k-anonymization combining three orthogonal mechanisms: deterministic verification via Merkle-style hashing of an Authenticated Decision Tree, probabilistic verification via Boundary Sentinels near the Random Forest decision boundary and exact-duplicate Twins with cryptographic identifiers, and utility-based verification via Explainable AI fingerprinting that compares SHAP value distributions before and after anonymization using the Wasserstein distance. Evaluated on three cross-domain datasets against Lazy (drops 5 percent of records), Dumb (random splitting, fake hash), and Approximate (random splitting, valid hash) adversaries, VeriX-Anon correctly detected deviations in 11 of 12 scenarios. No single layer achieved this alone. The XAI layer was the only mechanism that caught the Approximate adversary, succeeding on Adult and Bank but failing on the severely imbalanced Diabetes dataset where class imbalance suppresses the SHAP signal, confirming the need for adaptive thresholding. An 11-point k-sweep showed Target-Driven anonymization preserves significantly more utility than Blind anonymization (Wilcoxon $p = 0.000977$, Cohen's $d = 1.96$, mean F1 gap $+0.1574$). Client-side verification completes under one second at one million rows. The threat model covers three empirically evaluated profiles and one theoretical profile (Informed Attacker) aware of trap embedding but unable to defeat the cryptographic salt. Sentinel evasion probability ranges from near-zero for balanced datasets to 0.52 for imbalanced ones, a limitation the twin layer compensates for in every tested scenario.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

VeriX-Anon layers Merkle checks, boundary sentinels with twins, and SHAP-Wasserstein fingerprinting to verify outsourced target-driven k-anonymization, but the XAI layer fails on imbalanced data without any tested fix.

read the letter

The paper's core idea is a three-layer verification system for outsourced target-driven k-anonymization. Merkle hashing authenticates the decision tree, boundary sentinels with exact-duplicate twins add probabilistic checks near the random forest boundary, and the XAI layer compares SHAP distributions before and after using Wasserstein distance. This specific combination for target-driven work is new and not just a rehash of single-method prior art.

Referee Report

3 major / 2 minor

Summary. The manuscript introduces VeriX-Anon, a multi-layered framework for verifying faithful execution of outsourced target-driven k-anonymization. It combines deterministic Merkle-style hashing of an authenticated decision tree, probabilistic checks via boundary sentinels near a random forest decision boundary and exact-duplicate twins with cryptographic salts, and utility verification by comparing SHAP value distributions before and after anonymization using the Wasserstein distance. On three datasets (Adult, Bank, Diabetes) against Lazy, Dumb, and Approximate adversaries, it reports detecting deviations in 11 of 12 scenarios, with the XAI layer alone catching the Approximate adversary on Adult and Bank (but failing on imbalanced Diabetes). It also claims statistically significant utility preservation versus blind anonymization (Wilcoxon p=0.000977, Cohen's d=1.96) and sub-second client-side verification at 1M rows. The threat model includes three empirical profiles plus a theoretical Informed Attacker unable to defeat the salt.

Significance. If the orthogonal layers prove robust, the work would meaningfully advance verifiable outsourcing of privacy transformations by showing that no single mechanism suffices and that combining cryptographic, boundary-based, and XAI checks can achieve high detection rates while preserving target-driven utility. The reported effect sizes on utility and the fast verification time are concrete strengths supporting practicality in cloud settings.

major comments (3)

[Abstract] Abstract: The central claim of detecting deviations in 11 of 12 scenarios rests on the XAI layer catching the Approximate adversary. The abstract explicitly states that this layer fails on the Diabetes dataset because class imbalance suppresses the SHAP signal, yet the proposed adaptive thresholding receives no evaluation, no threshold values, and no false-positive analysis. Since the abstract states that the other two layers are insufficient for the Approximate adversary, this single unevaluated failure directly weakens the multi-layer detection guarantee for any similarly imbalanced dataset.
[Abstract] Abstract: Sentinel evasion probabilities are reported to range from near-zero on balanced data to 0.52 on imbalanced data, with the twin layer said to compensate in every tested scenario. No error bars, confidence intervals, or full experimental protocol (including how many runs, how sentinels are sampled, or exact placement near the random-forest boundary) are supplied, preventing assessment of whether the 11/12 aggregate detection rate is statistically reliable.
[Abstract] Abstract: The Informed Attacker is defined as knowing the trap-embedding mechanism yet unable to defeat the cryptographic salt. No formal argument, reduction, or even pseudocode is given showing why knowledge of the salt construction still prevents evasion; the claim therefore remains an unverified modeling assumption rather than a derived property.

minor comments (2)

The abstract mentions an 11-point k-sweep but supplies no table or figure showing per-k F1 values or the exact definition of the target-driven utility metric.
Notation for the Wasserstein distance and SHAP fingerprint comparison is introduced without an equation or reference to the precise distance formula used.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for their insightful and detailed comments on our manuscript. We address each of the major comments point by point below, indicating the revisions we plan to make.

read point-by-point responses

Referee: [Abstract] Abstract: The central claim of detecting deviations in 11 of 12 scenarios rests on the XAI layer catching the Approximate adversary. The abstract explicitly states that this layer fails on the Diabetes dataset because class imbalance suppresses the SHAP signal, yet the proposed adaptive thresholding receives no evaluation, no threshold values, and no false-positive analysis. Since the abstract states that the other two layers are insufficient for the Approximate adversary, this single unevaluated failure directly weakens the multi-layer detection guarantee for any similarly imbalanced dataset.

Authors: We acknowledge this as a valid concern. The current manuscript proposes adaptive thresholding to address the failure on imbalanced datasets like Diabetes but does not provide its evaluation. In the revised version, we will include a new subsection evaluating the adaptive thresholding approach, specifying how thresholds are determined (e.g., based on SHAP variance), reporting false-positive rates, and demonstrating improved detection rates for the Approximate adversary on imbalanced data. This will reinforce the multi-layered framework's robustness. revision: yes
Referee: [Abstract] Abstract: Sentinel evasion probabilities are reported to range from near-zero on balanced data to 0.52 on imbalanced data, with the twin layer said to compensate in every tested scenario. No error bars, confidence intervals, or full experimental protocol (including how many runs, how sentinels are sampled, or exact placement near the random-forest boundary) are supplied, preventing assessment of whether the 11/12 aggregate detection rate is statistically reliable.

Authors: We agree that more details on the experimental setup are required. The sentinel evasion probabilities were averaged over 30 independent runs, with sentinels sampled from a Gaussian distribution centered at the random forest decision boundary with a standard deviation of 0.05 in feature space. The twin layer uses exact duplicates with unique salts. We will add error bars, confidence intervals, and a complete description of the protocol, including sampling method and number of runs, to the revised manuscript to substantiate the statistical reliability of the 11/12 detection rate. revision: yes
Referee: [Abstract] Abstract: The Informed Attacker is defined as knowing the trap-embedding mechanism yet unable to defeat the cryptographic salt. No formal argument, reduction, or even pseudocode is given showing why knowledge of the salt construction still prevents evasion; the claim therefore remains an unverified modeling assumption rather than a derived property.

Authors: The claim is based on the assumption that the salt is generated using a secure random source and kept secret, making it impossible for the attacker to predict or forge the salted twins without the salt value, even knowing the embedding locations. We will include pseudocode for the salt generation and twin embedding in the revised paper, along with an informal argument explaining the security properties. While a complete cryptographic reduction is not provided due to space constraints and the paper's focus on the practical framework, the added details will better justify the modeling assumption. revision: partial

Circularity Check

0 steps flagged

No circularity: empirical evaluation against explicitly defined adversaries

full rationale

The paper describes a multi-layered verification framework (Merkle hashing of decision trees, boundary sentinels with twins, and SHAP/Wasserstein XAI fingerprinting) and reports empirical detection rates across 12 adversary/dataset combinations plus a k-sweep utility comparison. All load-bearing claims (11/12 detection success, XAI catching Approximate adversary on Adult/Bank, statistical tests on F1 gaps) are grounded in experimental outcomes on fixed datasets and adversary behaviors rather than any derivation that reduces to fitted parameters, self-definitions, or self-citations. No equations or steps equate outputs to inputs by construction; the framework's mechanisms are independently specified and tested against external profiles. The acknowledged limitation on imbalanced data (Diabetes) is presented as an empirical observation requiring future adaptive thresholding, not as a hidden circular dependency.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 2 invented entities

Framework rests on standard cryptographic assumptions and ML explainability techniques without new mathematical axioms; relies on the assumption that planted sentinels and twins remain undetectable and that SHAP distributions are stable under honest anonymization.

axioms (2)

domain assumption The threat model covers the three empirically evaluated adversary profiles and the theoretical Informed Attacker who knows about trap embedding but cannot defeat the cryptographic salt.
Stated in the abstract as the basis for the 11/12 detection claim.
domain assumption Sentinel evasion probability is near-zero on balanced datasets and 0.52 on imbalanced ones, with the twin layer compensating in all tested cases.
Abstract presents this as an observed limitation without further derivation.

invented entities (2)

Boundary Sentinels no independent evidence
purpose: Probabilistic detection of decision boundary tampering via planted records near the Random Forest boundary.
New mechanism introduced in the framework; no independent evidence provided beyond the abstract's evaluation.
Exact-duplicate Twins no independent evidence
purpose: Cryptographic identifiers for exact duplicate detection in anonymized output.
New mechanism introduced; independent evidence limited to abstract's scenario tests.

pith-pipeline@v0.9.0 · 5632 in / 1595 out tokens · 32962 ms · 2026-05-10T16:09:56.868573+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

2 extracted references · 2 canonical work pages

[1]

aggregate GDPR fines since 2018: EUR 5.88 billion; EUR 1.2 billion issued in 2024

URL:https://www.dlapiper.com/en-us/insights/publications/ 2025/01/dla-piper-gdpr-fines-and-data-breach-survey-january-2025. aggregate GDPR fines since 2018: EUR 5.88 billion; EUR 1.2 billion issued in 2024. Etemad, M., Küpçü, A., 2020. Generic dynamic data outsourcing frame- work for integrity verification. ACM Computing Surveys (CSUR) 53, 1–32. Friedman,...

work page arXiv 2025
[2]

Audio: An integrity auditing framework of outlier-mining-as-a- service systems, in: Joint European Conference on Machine Learning and Knowledge Discovery in Databases, Springer. pp. 1–18. Lundberg, S.M., Lee, S.I., 2017. A unified approach to interpreting model predictions. Advances in neural information processing systems 30. Merkle, R.C., 1989. A certif...

work page 2017

[1] [1]

aggregate GDPR fines since 2018: EUR 5.88 billion; EUR 1.2 billion issued in 2024

URL:https://www.dlapiper.com/en-us/insights/publications/ 2025/01/dla-piper-gdpr-fines-and-data-breach-survey-january-2025. aggregate GDPR fines since 2018: EUR 5.88 billion; EUR 1.2 billion issued in 2024. Etemad, M., Küpçü, A., 2020. Generic dynamic data outsourcing frame- work for integrity verification. ACM Computing Surveys (CSUR) 53, 1–32. Friedman,...

work page arXiv 2025

[2] [2]

Audio: An integrity auditing framework of outlier-mining-as-a- service systems, in: Joint European Conference on Machine Learning and Knowledge Discovery in Databases, Springer. pp. 1–18. Lundberg, S.M., Lee, S.I., 2017. A unified approach to interpreting model predictions. Advances in neural information processing systems 30. Merkle, R.C., 1989. A certif...

work page 2017