Approximation of Functions: Optimal Sampling and Complexity
Pith reviewed 2026-05-16 08:41 UTC · model grok-4.3
The pith
Finite function evaluations impose strict information-theoretic limits on approximation accuracy.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The central claim is that any finite collection of function values creates an information-theoretic barrier to how well a function can be recovered, and that certain sampling rules and recovery procedures come close to this barrier across linear and nonlinear, adaptive and non-adaptive, deterministic and randomized measurements.
What carries the argument
The information-theoretic limit on worst-case or average-case error that is determined solely by the number and type of allowed measurements.
If this is right
- For many standard smoothness classes the minimal error decays at a specific rate determined only by the number of evaluations.
- Adaptive and non-adaptive sampling achieve the same rates in linear settings but may differ for nonlinear measurements.
- Randomized sampling can match the performance of the best deterministic choice in expectation.
- The same limits apply whether measurements are point evaluations or other linear functionals.
Where Pith is reading between the lines
- These bounds give a benchmark that any practical numerical method or machine-learning surrogate must eventually respect.
- The relations between measurement types suggest unified design principles for choosing data in high-dimensional or infinite-dimensional problems.
- Direct extensions could include deriving explicit constants for concrete function spaces used in PDE solvers or signal processing.
Load-bearing premise
The results presuppose standard function classes and error measures from optimal recovery without always spelling out the precise norms or smoothness assumptions.
What would settle it
A concrete function class and norm for which no choice of n sampling points (even adaptive or random) yields approximation error matching the claimed lower bound would disprove the stated limits.
read the original abstract
We consider approximation or recovery of functions based on a finite number of function evaluations. This is a well-studied problem in optimal recovery, machine learning, and numerical analysis in general, but many fundamental insights were obtained only recently. We discuss different aspects of the information-theoretic limit that appears because of the limited amount of data available, as well as algorithms and sampling strategies that come as close to it as possible. We also discuss (optimal) sampling in a broader sense, allowing other types of measurements that may be nonlinear, adaptive and random, and present several relations between the different settings in the spirit of information-based complexity. We hope that this article provides both, a basic introduction to the subject and a contemporary summary of the current state of research.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript is a survey on the approximation and recovery of functions from a finite number of evaluations. It reviews information-theoretic limits arising from limited data, near-optimal sampling strategies (including nonlinear, adaptive, and random measurements), and relations between these settings in the spirit of information-based complexity. The article positions itself as both a basic introduction and a contemporary summary of recent insights in optimal recovery, machine learning, and numerical analysis.
Significance. If the survey accurately consolidates recent results on information-theoretic limits and sampling strategies, it could serve as a useful overview for researchers working at the intersection of approximation theory, optimal recovery, and information-based complexity, particularly by highlighting connections across linear/nonlinear and adaptive/non-adaptive regimes.
major comments (2)
- [Introduction] Introduction and abstract: The text repeatedly invokes 'standard settings from optimal recovery and information-based complexity' without restating the precise function classes (e.g., Sobolev W^{k,p}, analytic functions) or the underlying norms and radii of the unit balls for the cited minimax rates and sampling results. This omission is load-bearing because the information-theoretic limits, role of adaptivity, and linear-vs-nonlinear gaps are class-dependent.
- [Relations between settings] Section discussing relations between measurement settings: The claimed relations between linear, nonlinear, adaptive, and random measurements are presented at a high level, but without anchoring each relation to a concrete function space and norm, it is impossible to assess whether the summarized 'near-optimal' strategies hold uniformly or only in specific regimes assumed by the referenced literature.
minor comments (2)
- [Abstract] The abstract states that 'many fundamental insights were obtained only recently' but does not provide even one or two concrete examples with citations in the opening paragraphs; adding such signposts would improve readability for non-specialists.
- [Notation] Notation for sampling operators and recovery maps is introduced without an early consolidated table or list of symbols, which would help readers track the distinctions between different measurement types.
Simulated Author's Rebuttal
We thank the referee for the careful reading and constructive comments. We address each major comment below and will incorporate clarifications in the revised manuscript.
read point-by-point responses
-
Referee: [Introduction] Introduction and abstract: The text repeatedly invokes 'standard settings from optimal recovery and information-based complexity' without restating the precise function classes (e.g., Sobolev W^{k,p}, analytic functions) or the underlying norms and radii of the unit balls for the cited minimax rates and sampling results. This omission is load-bearing because the information-theoretic limits, role of adaptivity, and linear-vs-nonlinear gaps are class-dependent.
Authors: We agree that explicitly restating the function classes and norms strengthens the presentation. In the revision we will expand the introduction to briefly recall the standard settings (e.g., Sobolev balls in W^{k,p} with the corresponding norm and radius, and analytic functions on the unit disk or strip) together with the associated minimax rates that appear in the cited literature. This will anchor the information-theoretic limits without altering the survey character of the article. revision: yes
-
Referee: [Relations between settings] Section discussing relations between measurement settings: The claimed relations between linear, nonlinear, adaptive, and random measurements are presented at a high level, but without anchoring each relation to a concrete function space and norm, it is impossible to assess whether the summarized 'near-optimal' strategies hold uniformly or only in specific regimes assumed by the referenced literature.
Authors: We accept the point. The revision will add short, concrete illustrations for each relation, using well-known cases such as Sobolev balls (where linear vs. nonlinear gaps are characterized) and analytic classes (where adaptivity yields no improvement). Each example will cite the precise norm and radius so that the reader can verify the regime in which the stated near-optimality holds. revision: yes
Circularity Check
Survey paper summarizes established results without self-referential derivations or fitted predictions
full rationale
This is a survey article that reviews information-theoretic limits, sampling strategies, and relations between measurement settings in optimal recovery and information-based complexity. No original derivations, equations, or predictions are introduced that could reduce to the paper's own inputs by construction. All cited results are drawn from prior external literature, with no load-bearing self-citations, self-definitional steps, or renaming of known results as new insights. The discussion presupposes standard settings but does not attempt to derive or predict quantities from its own fitted parameters or ansatzes.
Axiom & Free-Parameter Ledger
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We consider approximation or recovery of functions based on a finite number of function evaluations... sampling numbers g_n(F,Y) ... least squares algorithms ... relations between different settings in the spirit of information-based complexity.
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
The sampling numbers are well-studied for many (classical) classes of functions... bounds by best sparse approximation, entropy numbers, Hilbert numbers.
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.