Efficient Techniques for Data Reconstruction, with Finite-Width Recovery Guarantees
Pith reviewed 2026-05-08 12:31 UTC · model grok-4.3
The pith
A unified optimization formulation provably reconstructs training data with high probability in random feature models of sufficient width.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
In the random feature model the unified optimization formulation recovers the exact training data with high probability once the network width is sufficiently large, with the required width bounded using PAC-style arguments; when the data lie in a low-dimensional subspace the width requirement relaxes to a function of the subspace dimension, and an efficient reconstruction algorithm approximates this subspace from first-layer weight changes to enable practical recovery in general models using only last-layer weights.
What carries the argument
The unified optimization formulation that matches initial and trained network parameters to candidate data points, together with the subspace approximation obtained from the change in first-layer weights.
If this is right
- Exact recovery of training data becomes possible in the random feature model once width exceeds the PAC-derived threshold.
- The subspace relaxation permits successful reconstruction at smaller widths when data intrinsic dimension is low.
- The efficient algorithm lowers the effective search-space dimension and the minimal width needed for high-quality results.
- Numerical tests on CIFAR-10 show improved reconstruction quality relative to non-subspace methods.
Where Pith is reading between the lines
- Defenses could target the leakage of subspace orientation through first-layer weight changes.
- The PAC bounds offer a concrete way to set minimum widths for privacy-preserving random-feature training.
- Similar finite-width analysis might extend to other architectures if the subspace structure is preserved.
- Testing the algorithm on datasets with known sensitive labels would quantify real-world privacy leakage.
Load-bearing premise
The optimization formulation must exactly encode the relationship between parameter changes and the training data, and the random feature model with fixed first-layer weights must accurately describe the network.
What would settle it
An experiment on a random feature network whose width meets the derived PAC bound yet the optimization fails to recover the training data with probability higher than the allowed failure rate.
Figures
read the original abstract
Data reconstruction attacks on trained neural networks aim to recover the data on which the network has been trained and pose a significant threat to privacy, especially if the training dataset contains sensitive information. Here, we propose a unified optimization formulation of the data reconstruction problem based on initial and trained parameter values, incorporating state-of-the-art proposals. We show that in the random feature model, this formulation provably leads to training data reconstruction with high probability, provided the network width is sufficiently large; this unprecedented finite-width result uses PAC-style bounds. Furthermore, when the data lies in a low-dimensional subspace, we show that the network width requirement for successful reconstruction can be relaxed, with bounds depending on the subspace dimension rather than the ambient dimension. For general neural network models and unknown data orientations, we propose an efficient reconstruction algorithm that approximates the low-dimensional data subspace through the change in the first-layer weights during training and uses only the last-layer weights for reconstruction, thus reducing the search space dimension and the required network width for high-quality reconstructions. Our numerical experiments on synthetic datasets and CIFAR-10 confirm that our subspace-aware reconstruction approach outperforms standard full-space techniques.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes a unified optimization formulation for data reconstruction attacks that incorporates initial and trained network parameters. In the random feature model it derives finite-width recovery guarantees via PAC-style bounds, showing high-probability exact reconstruction when width is sufficiently large. When data lies in a low-dimensional subspace the width requirement is relaxed to depend on subspace dimension rather than ambient dimension. For general networks an efficient algorithm recovers an approximate subspace from first-layer weight changes and performs reconstruction using only last-layer weights. Experiments on synthetic data and CIFAR-10 demonstrate that the subspace-aware method outperforms full-space baselines.
Significance. If the finite-width PAC bounds hold under the stated assumptions, the work supplies the first explicit high-probability reconstruction guarantees for finite-width random-feature models, a concrete advance over prior asymptotic or infinite-width analyses. The subspace relaxation and the practical algorithm that reduces search-space dimension are directly useful for privacy auditing. The combination of a clean optimization formulation, PAC bounds, and reproducible numerical confirmation on CIFAR-10 strengthens the contribution.
major comments (2)
- [§3, Theorem 1] §3 (Random Feature Model, Theorem 1): the PAC bound statement must explicitly list the boundedness assumptions on the random features and the precise form of the reconstruction objective; without these the high-probability claim cannot be verified from the given derivation sketch.
- [§4.2] §4.2 (Subspace Relaxation): the claim that the subspace orientation is recoverable from first-layer weight changes is load-bearing for the relaxed width bound, yet the argument appears to rely on an unstated concentration result; a self-contained lemma showing that the estimated subspace converges to the true one with high probability is required.
minor comments (3)
- [Abstract] The abstract and introduction should clarify that all theoretical results are stated for the random-feature model with fixed first-layer weights; the transition to general networks is only algorithmic.
- [Experiments] Table 1 and Figure 3: axis labels and legend entries should be enlarged for readability; the current font size makes quantitative comparison of reconstruction error difficult.
- [§5] The CIFAR-10 experiment description should specify the exact architecture (depth, width, activation) and the precise baseline methods against which the subspace-aware algorithm is compared.
Simulated Author's Rebuttal
We thank the referee for the positive assessment and constructive comments on our manuscript. We address each major comment below and will incorporate the requested clarifications in the revised version.
read point-by-point responses
-
Referee: [§3, Theorem 1] §3 (Random Feature Model, Theorem 1): the PAC bound statement must explicitly list the boundedness assumptions on the random features and the precise form of the reconstruction objective; without these the high-probability claim cannot be verified from the given derivation sketch.
Authors: We agree that the theorem statement should be self-contained. The derivation implicitly relies on the random features satisfying ||φ(x)||_2 ≤ 1 almost surely and on the reconstruction objective being the convex program min_w ||W_1 w - y||_2^2 subject to the trained last-layer weights matching the observed output. In the revision we will restate Theorem 1 with these assumptions listed explicitly at the outset, followed by the precise objective, so that the PAC bound and its high-probability claim can be verified directly from the text. revision: yes
-
Referee: [§4.2] §4.2 (Subspace Relaxation): the claim that the subspace orientation is recoverable from first-layer weight changes is load-bearing for the relaxed width bound, yet the argument appears to rely on an unstated concentration result; a self-contained lemma showing that the estimated subspace converges to the true one with high probability is required.
Authors: We acknowledge the need for an explicit supporting lemma. The current argument invokes a matrix concentration bound on the first-layer weight change ΔW ≈ (1/n) X X^T projected onto the random features, but does not isolate the result. We will add a new self-contained Lemma 4.1 that proves: under the assumption that the data lie in a d-dimensional subspace, the principal angles between the estimated subspace (top-d singular vectors of ΔW) and the true subspace converge to zero at rate O(√(d log m / n)) with probability 1-δ, using a matrix Bernstein inequality. This lemma will be placed immediately before the relaxed-width theorem and will make the dependence on subspace dimension fully rigorous. revision: yes
Circularity Check
No significant circularity; derivation is self-contained under model assumptions
full rationale
The paper proposes an optimization formulation for data reconstruction from initial and trained parameters, then derives high-probability finite-width recovery guarantees in the random feature model via PAC-style bounds. These bounds are obtained by applying standard concentration arguments to the random feature model (fixed random first layer, trainable second layer) and the optimization objective; they are not defined in terms of the target reconstruction success probability. The subspace relaxation similarly derives width requirements from the subspace dimension via the same PAC machinery rather than by construction. No equations reduce a prediction to a fitted parameter, no load-bearing self-citations are invoked for uniqueness or ansatz, and the central claim does not rename a known empirical pattern. The argument structure is therefore internally consistent and independent of its own outputs.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption Random feature model: first-layer weights are fixed random, only second layer is trained.
- standard math PAC-style concentration bounds apply to the reconstruction optimization.
Reference graph
Works this paper leans on
-
[1]
Reconstructing training data from model gradient, provably
Zihan Wang, Jason Lee, and Qi Lei. Reconstructing training data from model gradient, provably. In International Conference on Artificial Intelligence and Statistics, pages 6595–6612. PMLR, 2023
work page 2023
-
[2]
Reconstructing training data with informed adver- saries
Borja Balle, Giovanni Cherubin, and Jamie Hayes. Reconstructing training data with informed adver- saries. In2022 IEEE Symposium on Security and Privacy (SP), pages 1138–1156. IEEE, 2022
work page 2022
-
[3]
Reconstructing training data from trained neural networks
Niv Haim, Gal Vardi, Gilad Yehudai, Ohad Shamir, and Michal Irani. Reconstructing training data from trained neural networks. InAdvances in neural information processing systems, volume 35, pages 22911–22924. Curran Associates, Inc., 2022
work page 2022
-
[4]
arXiv preprint arXiv:1906.05890 , year=
Kaifeng Lyu and Jian Li. Gradient descent maximizes the margin of homogeneous neural networks. arXiv preprint arXiv:1906.05890, 2019
-
[5]
Directional convergence and alignment in deep learning
Ziwei Ji and Matus Telgarsky. Directional convergence and alignment in deep learning. InAdvances in Neural Information Processing Systems, volume 33, pages 17176–17186. Curran Associates, Inc., 2020
work page 2020
-
[6]
Gon Buzaglo, Niv Haim, Gilad Yehudai, Gal Vardi, Yakir Oz, Yaniv Nikankin, and Michal Irani. Deconstructing data reconstruction: Multiclass, weight decay and general losses.Advances in Neural Information Processing Systems, 36:51515–51535, 2023
work page 2023
-
[7]
Noel Loo, Ramin Hasani, Mathias Lechner, Alexander Amini, and Daniela Rus. Understand- ing reconstruction attacks with the neural tangent kernel and dataset distillation.arXiv preprint arXiv:2302.01428, 2023
-
[8]
arXiv preprint arXiv:2509.22214 , year=
Leonardo Iurada, Simone Bombari, Tatiana Tommasi, and Marco Mondelli. A Law of Data Reconstruc- tion for Random Features (and Beyond), September 2025. arXiv:2509.22214 [cs]
-
[9]
Arthur Jacot, Franck Gabriel, and Cl´ ement Hongler. Neural tangent kernel: Convergence and general- ization in neural networks.Advances in neural information processing systems, 31, 2018
work page 2018
-
[10]
Random Features for Large-Scale Kernel Machines
Ali Rahimi and Benjamin Recht. Random Features for Large-Scale Kernel Machines. InAdvances in Neural Information Processing Systems, volume 20. Curran Associates, Inc., 2007
work page 2007
-
[11]
Ali Rahimi and Benjamin Recht. Weighted sums of random kitchen sinks: Replacing minimization with randomization in learning.Advances in neural information processing systems, 21, 2008
work page 2008
-
[12]
The manifold tangent classifier.Advances in neural information processing systems, 24, 2011
Salah Rifai, Yann N Dauphin, Pascal Vincent, Yoshua Bengio, and Xavier Muller. The manifold tangent classifier.Advances in neural information processing systems, 24, 2011
work page 2011
-
[13]
John A Lee, Michel Verleysen, et al.Nonlinear dimensionality reduction, volume 1. Springer, 2007
work page 2007
-
[14]
Generative adversarial networks.Communications of the ACM, 63(11):139–144, 2020
Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. Generative adversarial networks.Communications of the ACM, 63(11):139–144, 2020
work page 2020
-
[15]
Odelia Melamed, Gilad Yehudai, and Gal Vardi. Adversarial examples exist in two-layer relu networks for low dimensional linear subspaces.Advances in Neural Information Processing Systems, 36:5028–5049, 2023
work page 2023
-
[16]
Adversarial vulnerability for any classifier, 2018
Alhussein Fawzi, Hamza Fawzi, and Omar Fawzi. Adversarial vulnerability for any classifier, 2018
work page 2018
-
[17]
arXiv preprint arXiv:1810.04374 , year=
Yitong Sun, Anna Gilbert, and Ambuj Tewari. On the Approximation Properties of Random ReLU Features, August 2019. arXiv:1810.04374 [stat]
-
[18]
Deep learning: a statistical viewpoint
Peter L Bartlett, Andrea Montanari, and Alexander Rakhlin. Deep learning: a statistical viewpoint. Acta numerica, 30:87–201, 2021
work page 2021
-
[19]
A kernel two-sample test.The journal of machine learning research, 13(1):723–773, 2012
Arthur Gretton, Karsten M Borgwardt, Malte J Rasch, Bernhard Sch¨ olkopf, and Alexander Smola. A kernel two-sample test.The journal of machine learning research, 13(1):723–773, 2012. 11
work page 2012
-
[20]
Bharath K Sriperumbudur, Kenji Fukumizu, and Gert RG Lanckriet. Universality, characteristic kernels and rkhs embedding of measures.Journal of Machine Learning Research, 12(7), 2011
work page 2011
-
[21]
Titouan Vayer and R´ emi Gribonval. Controlling wasserstein distances by kernel norms with application to compressive statistical learning.Journal of Machine Learning Research, 24(149):1–51, 2023
work page 2023
- [22]
-
[23]
Wassily Hoeffding. Probability Inequalities for Sums of Bounded Random Variables.Journal of the American Statistical Association, 58(301):13–30, March 1963
work page 1963
-
[24]
Roman Vershynin.High-Dimensional Probability: An Introduction with Applications in Data Science. Cambridge Series in Statistical and Probabilistic Mathematics. Cambridge University Press, Cambridge, 2018. 12 A Proof of theoretical results We recall Hoeffding’s inequality, which will be useful to bound, with high probability, the deviation of the finite-wi...
work page 2018
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.