Universal, sample-optimal algorithms for recovery of anisotropic functions from i.i.d. samples
Pith reviewed 2026-05-10 16:50 UTC · model grok-4.3
The pith
A compressed sensing algorithm recovers periodic anisotropic functions optimally from i.i.d. samples up to polylog factors.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
We present a universal algorithm that uses compressed sensing to recover the Fourier coefficients of periodic functions belonging to anisotropic Sobolev spaces or anisotropic spaces of dominating mixed smoothness from i.i.d. samples. The algorithm is nonadaptive and achieves approximation rates optimal up to a dimension-independent polylogarithmic factor, as established by a lower bound for the adaptive m-width of the unit balls of these classes. We further show that any universal linear algorithm is at best suboptimal by a dimension-dependent polylogarithmic factor, establishing the necessity of nonlinear methods for achieving universal sample-optimal recovery.
What carries the argument
The nonadaptive compressed sensing recovery of Fourier coefficients that serves as a universal approximator across anisotropic smoothness classes.
If this is right
- The same algorithm applies to both anisotropic Sobolev spaces and dominating mixed smoothness spaces without modification.
- Rates are near-optimal independently of the specific anisotropy parameters.
- Linear methods incur an extra dimension-dependent penalty in the convergence rate for universal recovery.
- Nonlinear sparse recovery is essential to avoid additional losses in high-dimensional settings.
Where Pith is reading between the lines
- Similar compressed sensing techniques might extend to non-periodic domains by substituting other orthogonal bases for the Fourier expansion.
- The results indicate that nonlinearity helps overcome extra rate penalties when the smoothness structure is hidden in high dimensions.
- Practical tests on data sets with varying unknown anisotropy could check whether the polylog overhead remains small in applications.
Load-bearing premise
The target functions are periodic so that they admit a Fourier series, and they lie in one of the specified anisotropic smoothness classes with i.i.d. samples available.
What would settle it
A concrete function in an anisotropic Sobolev space where the compressed sensing recovery produces approximation error larger than the claimed near-optimal rate by more than the polylog factor.
read the original abstract
A key problem in approximation theory is the recovery of high-dimensional functions from samples. In many cases, the functions of interest exhibit anisotropic smoothness, and, in many practical settings, the nature of this anisotropy may be unknown a priori. Therefore, an important question involves the development of universal algorithms, namely, algorithms that simultaneously achieve optimal or near-optimal rates of convergence across a range of different anisotropic smoothness classes. In this work, we consider universal approximation of periodic functions that belong to anisotropic Sobolev spaces and anisotropic dominating mixed smoothness Sobolev spaces. Our first result is the construction of a universal algorithm. This recasts function recovery as a sparse recovery problem for Fourier coefficients and then exploits compressed sensing to yield the desired approximation rates. Note that this algorithm is nonadaptive, as it does not seek to learn the anisotropic smoothness of the target function. We then demonstrate optimality of this algorithm up to a dimension-independent polylogarithmic factor. We do this by presenting a lower bound for the adaptive $m$-width for the unit balls of such function classes. Finally, we demonstrate the necessity of nonlinear algorithms. We show that universal linear algorithms can achieve rates that are at best suboptimal by a dimension-dependent polylogarithmic factor. In other words, they suffer from a curse of dimensionality in the rate -- a phenomenon which justifies the necessity of nonlinear algorithms for universal recovery.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript constructs a nonadaptive universal algorithm for recovering periodic functions belonging to anisotropic Sobolev spaces and anisotropic dominating mixed smoothness Sobolev spaces from i.i.d. samples. The approach recasts recovery as a sparse Fourier-coefficient recovery problem and applies compressed-sensing techniques to achieve the desired approximation rates without adapting to the unknown anisotropy. Optimality (up to a dimension-independent polylogarithmic factor) is established via a lower bound on the adaptive m-widths of the corresponding unit balls. The paper additionally shows that any universal linear algorithm is at best suboptimal by a dimension-dependent polylogarithmic factor, thereby justifying the use of nonlinear methods.
Significance. If the proofs hold, the work supplies a concrete, nonadaptive construction that is nearly sample-optimal across an entire family of anisotropic classes and supplies an independent lower bound that certifies the rate. The explicit separation between linear and nonlinear universal methods, together with the dimension-independent polylog gap, is a notable strength and directly addresses a central question in high-dimensional approximation theory.
major comments (2)
- [Lower-bound argument (after the algorithm construction)] The lower bound on adaptive m-widths is load-bearing for the near-optimality claim; the manuscript should make explicit how the polylogarithmic factor arises and confirm that its dependence on dimension is only polylogarithmic (rather than polynomial) in the relevant theorem statement and proof.
- [Section on linear versus nonlinear universal algorithms] The argument that linear methods incur a dimension-dependent polylogarithmic penalty is central to the necessity-of-nonlinearity conclusion; the derivation of this penalty should be stated with explicit constants or exponents so that the dimension dependence is unambiguous.
minor comments (2)
- The abstract states the rates are optimal 'up to a dimension-independent polylogarithmic factor' but does not record the precise sample complexity m in terms of the smoothness indices and dimension; adding this relation would improve readability.
- Notation for the anisotropy parameters (e.g., the vector of smoothness indices) should be introduced once and used consistently when stating both the upper and lower bounds.
Simulated Author's Rebuttal
We thank the referee for the careful reading, the positive evaluation, and the constructive suggestions that will improve the clarity of the manuscript. We address the two major comments point by point below and will incorporate the requested clarifications in the revised version.
read point-by-point responses
-
Referee: The lower bound on adaptive m-widths is load-bearing for the near-optimality claim; the manuscript should make explicit how the polylogarithmic factor arises and confirm that its dependence on dimension is only polylogarithmic (rather than polynomial) in the relevant theorem statement and proof.
Authors: We agree that the origin and dimension dependence of the polylogarithmic factor in the lower bound on adaptive m-widths should be stated more explicitly. In the proof of the lower bound (currently Theorem 3.3), the factor arises from standard volume-ratio estimates on the entropy numbers of the anisotropic Sobolev unit ball; the exponent is independent of dimension d and depends only on the smoothness parameters. We will revise the theorem statement to read explicitly that the lower bound holds up to a factor of (log m)^C with C independent of d, and we will add a short paragraph immediately after the theorem tracing the appearance of this factor in the entropy-number argument. This change will be made in the next revision. revision: yes
-
Referee: The argument that linear methods incur a dimension-dependent polylogarithmic penalty is central to the necessity-of-nonlinearity conclusion; the derivation of this penalty should be stated with explicit constants or exponents so that the dimension dependence is unambiguous.
Authors: We agree that the dimension dependence in the linear-method lower bound should be made fully explicit. The current proof of Theorem 4.2 derives the penalty via a combinatorial argument (a variant of the Sauer-Shelah lemma applied to the sign patterns of linear functionals) that produces a factor of the form (log d)^c for an explicit positive constant c depending only on the smoothness parameters. We will update the theorem statement to include the explicit exponent (or the precise functional form of the polylog) and will insert one additional sentence in the proof that isolates the combinatorial step responsible for the d-dependence. These modifications will appear in the revised manuscript. revision: yes
Circularity Check
No significant circularity detected
full rationale
The derivation proceeds by explicitly constructing a nonadaptive algorithm that recasts periodic function recovery as sparse recovery of Fourier coefficients and applies standard compressed sensing techniques to achieve the stated rates across anisotropic Sobolev and dominating mixed smoothness classes. Optimality is established via a separate lower bound on adaptive m-widths, and the suboptimality of linear methods is shown by a distinct argument on dimension-dependent polylog factors. None of these steps reduce by construction to fitted inputs, self-definitions, or self-citation chains; all rely on external results from approximation theory and compressed sensing under the explicit assumptions of periodicity and i.i.d. samples. The paper is therefore self-contained with independent content in its central claims.
Axiom & Free-Parameter Ledger
axioms (2)
- standard math Periodic functions admit Fourier expansions and anisotropic Sobolev norms control coefficient decay
- standard math Compressed sensing yields stable recovery of sparse vectors from random measurements
Reference graph
Works this paper leans on
-
[1]
Adcock.Modified Fourier expansions: theory, construction and applications
B. Adcock.Modified Fourier expansions: theory, construction and applications. PhD thesis, University of Cambridge, 2010
work page 2010
- [2]
- [3]
- [4]
- [5]
-
[6]
B. Adcock, N. Dexter, and S. Moraga. Optimal deep learning of holomorphic operators between Banach spaces. In A. Globerson, L. Mackey, D. Belgrave, A. Fan, U. Paquet, J. Tomczak, and C. Zhang, editors, Advances in Neural Information Processing Systems, volume 37, pages 27725–27789. Curran Associates, Inc., 2024
work page 2024
- [7]
-
[8]
URL:https://arxiv.org/abs/2410.23440,arXiv:2410.23440
B. Adcock, M. Griebel, and G. Maier. The sample complexity of learning Lipschitz operators with respect to Gaussian measures.arXiv:2410.23440, 2025
-
[9]
F. Bartel and P. Schr¨ oter. Learning and leveraging anisotropy parameters in ANOVA approximation. arXiv:2511.00251, 2025
-
[10]
A. Belloni, V. Chernozhukov, and L. Wang. Square-root lasso: pivotal recovery of sparse signals via conic programming.Biometrika, 98(4):791–806, 2011
work page 2011
- [11]
- [12]
-
[13]
S. Brugiapaglia, S. Dirksen, H. C. Jung, and H. Rauhut. Sparse recovery in bounded Riesz systems with applications to numerical methods for PDEs.Appl. Comput. Harmon. Anal., 53:231–269, 2021
work page 2021
-
[14]
G. Byrenheid and T. Ullrich. Optimal sampling recovery of mixed order Sobolev embeddings via discrete Littlewood–Paley type characterizations.Anal. Math., 43(2):133–191, 2017
work page 2017
-
[15]
J. Chen and H. Wang. Preasymptotics and asymptotics of approximation numbers of anisotropic Sobolev embeddings.J. Complexity, 39:94–110, 2017
work page 2017
- [16]
- [17]
- [18]
- [19]
- [20]
- [21]
- [22]
-
[23]
M. Dolbeault, D. Krieg, and M. Ullrich. A sharp upper bound for sampling numbers in L2.Appl. Comput. Harmon. Anal., 63:113–134, 2023. 36
work page 2023
-
[24]
M. Griebel and J. Hamaekers. Fast discrete Fourier transform on generalized sparse grids. In J. Garcke and D. Pfl¨ uger, editors,Sparse Grids and Applications – Munich 2012, pages 75–107. Springer, Cham, 2014
work page 2012
- [25]
-
[26]
T. Jahn, T. Ullrich, and F. Voigtlaender. Sampling numbers of smoothness classes via ℓ1-minimization. J. Complexity, 79:101786, 2023
work page 2023
-
[27]
Y. Kolomoitsev, T. Lomako, and S. Tikhonov. Sparse grid approximation in weighted Wiener spaces.J. Fourier Anal. Appl., 29(2):19, 2023
work page 2023
-
[28]
E. D. Kosov and V. N. Temlyakov. Sampling recovery of functions with mixed smoothness.Adv. Oper. Theory, 10(2):49, 2025
work page 2025
-
[29]
D. Krieg. Tractability of sampling recovery on unweighted function classes.Proc. Amer. Math. Soc. Ser. B, 11(12):115–125, 2024
work page 2024
- [30]
- [31]
-
[32]
D. Krieg and M. Sonnleitner. Random points are optimal for the approximation of Sobolev functions. IMA J. Numer. Anal., 44(3):1346–1371, 2024
work page 2024
-
[33]
D. Krieg and M. Sonnleitner. Function recovery on manifolds using scattered data.J. Approx. Theory, 305:106098, 2025
work page 2025
-
[34]
D. Krieg and M. Ullrich. Function values are enough for L2-approximation.Found. Comput. Math., 21(4):1141–1151, 2021
work page 2021
- [35]
- [36]
-
[37]
G. Migliorati.Polynomial approximation by means of the random discrete L2 projection and application to inverse problems for PDEs with stochastic data. PhD thesis, Politecnico di Milano, 2013
work page 2013
-
[38]
B. S. Mitjagin. Approximation of functions of Lp and C spaces on the torus.Mat. Sb., 58(100):397–414, 1962
work page 1962
-
[39]
M. Moeller. Gelfand numbers and best m-term trigonometric approximation for weighted mixed Wiener classes inL 2. Master’s thesis, TU Chemnitz, Germany, 2023
work page 2023
-
[40]
M. Moeller, K. Pozharska, and T. Ullrich. Instance optimal function recovery – samples, decoders and asymptotic performance.arXiv:2503.16209, 2025
-
[41]
M. Moeller, K. Pozharska, and T. Ullrich. Sampling designs for function recovery – theoretical guarantees, comparison and optimality.MCQMC 2024 Proceedings, 2025. to appear
work page 2024
-
[42]
M. Moeller, S. Stasyuk, and T. Ullrich. High-dimensional sparse trigonometric approximation in the uniform norm and consequences for sampling recovery.arXiv:2407.15965, 2024
-
[43]
M. Moeller, S. Stasyuk, and T. Ullrich. Best m-term trigonometric approximation in weighted Wiener spaces and applications.Adv. Oper. Theory, 11(2):18, 2026
work page 2026
-
[44]
V. D. Nguyen, V. K. Nguyen, and W. Sickel. s-numbers of embeddings of weighted Wiener algebras.J. Approx. Theory, 279:105745, 2022. 37
work page 2022
-
[45]
E. Novak and H. Wo´ zniakowski.Tractability of Multivariate Problems, Volume I: Linear Information. Number 6 in EMS Tracts in Mathematics. European Mathematical Society Publishing House, Z¨ urich, Switzerland, 2008
work page 2008
-
[46]
H. Rauhut and R. Ward. Sparse Legendre expansions via ℓ1-minimization.J. Approx. Theory, 164(5):517– 533, 2012
work page 2012
-
[47]
H. Rauhut and R. Ward. Interpolation via weighted ℓ1 minimization.Appl. Comput. Harmon. Anal., 40(2):321–351, 2016
work page 2016
-
[48]
M. Sonnleitner and M. Ullrich. On the power of iid information for linear approximation. arXiv:2310.12740, 2023
-
[49]
M. I. Stesin. Aleksandrov diameters of finite-dimensional sets and classes of smooth functions.Dokl. Akad. Nauk SSSR, 220(6):1278–1281, 1975. In Russian
work page 1975
-
[50]
S. A. Telyakovskii. Some bounds for trigonometric series with quasi-convex coefficients.Mat. Sb., 105(3):426–444, 1964
work page 1964
-
[51]
Temlyakov.Multivariate Approximation
V. Temlyakov.Multivariate Approximation. Cambridge University Press, 2018
work page 2018
-
[52]
V. N. Temlyakov. Approximation by elements of a finite-dimensional subspace of functions from various Sobolev or Nikol’skii spaces.Math. Notes, 43(6):444–454, 1988
work page 1988
-
[53]
X. Wang. Volumes of generalized unit balls.Math. Mag., 78(5):390–395, 2005. 38
work page 2005
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.