Approximate Replicability in Learning
Pith reviewed 2026-05-18 04:09 UTC · model grok-4.3
The pith
Three relaxed versions of replicability allow near-optimal agnostic PAC learners.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
In all three cases, for constant replicability we obtain close to sample-optimal agnostic PAC learners: pointwise and approximate replicability are achievable using O(d/α² + 1/α⁴) samples, while semi-replicability requires Θ(d²/α²) labeled samples.
What carries the argument
The three relaxations of replicability (pointwise consistency on fixed inputs, approximate consistency on most of the distribution, and semi-replicability with shared unlabeled data) that weaken full stability while preserving learning guarantees.
If this is right
- Pointwise replicability admits agnostic PAC learners with O(d/α² + 1/α⁴) samples.
- Approximate replicability achieves the same sample bound.
- Semi-replicability solves the task using Θ(d²/α²) labeled samples together with shared unlabeled data.
- All three bounds are close to the information-theoretic optimum for agnostic PAC learning without replicability.
Where Pith is reading between the lines
- These relaxations could enable stable algorithms for tasks such as threshold learning that are known to be impossible under strict replicability.
- The gap between the O(d/α²) and Θ(d²/α²) bounds highlights how access to unlabeled data can reduce labeled-sample demand while retaining replicability.
- The same relaxations might be tested for compatibility with other stability requirements such as differential privacy.
Load-bearing premise
The three proposed relaxations are natural and practically meaningful approximations to full replicability.
What would settle it
A distribution and hypothesis class where every constant-replicable pointwise learner requires asymptotically more than O(d/α² + 1/α⁴) samples would refute the near-optimality result.
read the original abstract
Replicability, introduced by (Impagliazzo et al. STOC '22), is the notion that algorithms should remain stable under a resampling of their inputs (given access to shared randomness). While a strong and interesting notion of stability, the cost of replicability can be prohibitive: there is no replicable algorithm, for instance, for tasks as simple as threshold learning (Bun et al. STOC '23). Given such strong impossibility results we ask: under what approximate notions of replicability is learning possible? In this work, we propose three natural relaxations of replicability in the context of PAC learning: (1) Pointwise: the learner must be consistent on any fixed input, but not across all inputs simultaneously, (2) Approximate: the learner must output hypotheses that classify most of the distribution consistently, (3) Semi: the algorithm is fully replicable, but may additionally use shared unlabeled samples. In all three cases, for constant replicability we obtain close to sample-optimal agnostic PAC learners: 1) and 2) are achievable using $O(d/\alpha^2 + 1/\alpha^{4})$ samples, while 3) requires $\Theta(d^2/\alpha^2)$ labeled samples.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper introduces three relaxations of replicability (pointwise, approximate, and semi-replicability) in the PAC learning setting. It shows that, for constant replicability parameters, these allow agnostic PAC learners with sample complexities O(d/α² + 1/α⁴) for pointwise and approximate replicability, and Θ(d²/α²) labeled samples for semi-replicability, building on prior impossibility results for full replicability.
Significance. If the results hold, this work is significant as it identifies practical approximations to replicability that restore learnability for tasks where full replicability is impossible (e.g., threshold learning). The formal definitions and derivations from uniform convergence plus stabilization yield near-optimal bounds, extending Impagliazzo et al. (STOC '22) and Bun et al. (STOC '23) with new stability conditions that suffice for PAC guarantees.
major comments (1)
- §3: The stability conditions for the three relaxations are defined precisely enough to support the PAC claims in Theorems 4.1, 4.3 and 5.2, but the manuscript should explicitly verify that these conditions are strictly weaker than full replicability (i.e., recover the original definition only in the limit of the approximation parameters).
minor comments (3)
- Abstract: The phrase 'close to sample-optimal' would be strengthened by a one-sentence comparison to the standard agnostic PAC bound O(d/α² + 1/α²), clarifying the precise overhead introduced by each relaxation.
- §4.1 and §4.3: The replicability-stabilization step is invoked to obtain the stated sample bounds; a short high-level description of how the overhead is derived (beyond 'standard uniform convergence') would improve accessibility.
- Theorem 5.2: The Θ(d²/α²) lower bound for semi-replicability is stated for labeled samples; confirming whether the quadratic d dependence persists when unlimited shared unlabeled samples are available would clarify the result.
Simulated Author's Rebuttal
We thank the referee for the positive assessment and constructive feedback. The single major comment is addressed below; we will incorporate the requested clarification as a minor revision.
read point-by-point responses
-
Referee: §3: The stability conditions for the three relaxations are defined precisely enough to support the PAC claims in Theorems 4.1, 4.3 and 5.2, but the manuscript should explicitly verify that these conditions are strictly weaker than full replicability (i.e., recover the original definition only in the limit of the approximation parameters).
Authors: We agree that an explicit verification would improve clarity. In the revised manuscript we will add a short remark (or subsection) in §3 showing that each of the three definitions recovers the original replicability notion of Impagliazzo et al. (STOC '22) in the appropriate limit: for pointwise replicability as the per-point failure probability tends to zero; for approximate replicability as the measure of the disagreeing set tends to zero; and for semi-replicability as the number of shared unlabeled samples tends to infinity. The argument is by direct substitution into the respective definitions and does not alter any of the existing theorems or proofs. revision: yes
Circularity Check
Minor self-citation to replicability definition; new relaxations and bounds derived independently
full rationale
The paper formally introduces three new relaxations of replicability (pointwise, approximate, and semi) in Section 3 with precise stability conditions that directly enable the PAC guarantees. Sample bounds in Theorems 4.1, 4.3, and 5.2 are obtained via standard uniform convergence combined with a replicability-stabilization overhead, without any reduction to fitted parameters, self-definitions, or ansatzes. The citation to Impagliazzo et al. (STOC '22) supplies the baseline full-replicability notion and known impossibilities rather than serving as a load-bearing justification for the approximate variants; the central claims remain self-contained with independent content from classical learning theory.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Standard agnostic PAC learning setting: i.i.d. samples drawn from an unknown distribution over a domain whose hypothesis class has VC dimension d
invented entities (3)
-
Pointwise replicability
no independent evidence
-
Approximate replicability
no independent evidence
-
Semi-replicability
no independent evidence
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We propose three natural relaxations of replicability... pointwise... approximate... semi... Theorems 1.4, 1.6, 1.10 give sample bounds via boosting and hypothesis selection.
-
IndisputableMonolith/Foundation/ArithmeticFromLogic.leanLogicNat recovery unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Lower bounds via reduction to replicable bias estimation on shattered sets (Theorem 1.5, 1.9).
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.