Instance-Adaptive Online Multicalibration

Aaron Roth; Claire Jie Zhang; Jamie Morgenstern; Zhiming Huang

arxiv: 2605.09273 · v2 · pith:JOIDNAFSnew · submitted 2026-05-10 · 💻 cs.LG

Instance-Adaptive Online Multicalibration

Zhiming Huang , Jamie Morgenstern , Aaron Roth , Claire Jie Zhang This is my paper

Pith reviewed 2026-05-22 09:37 UTC · model grok-4.3

classification 💻 cs.LG

keywords online multicalibrationadaptive algorithmsdyadic gridthreshold complexityonline learningcalibrationpiecewise stationary processes

0 comments

The pith

A single efficient algorithm for online multicalibration adapts its error to the complexity of the predictable mean process via dyadic grid refinement.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces one algorithm that maintains online multicalibration by dynamically refining a dyadic grid over possible prediction values. Its total error is bounded by a function of the number of leaves that appear in the final refinement tree. This bound automatically recovers the known worst-case rate of order T to the two-thirds power, yet improves to square-root rates when the underlying means are stationary or change only a few times. The rate further depends on a threshold-complexity measure of the mean process relative to the given collection of groups. A reader would care because the same procedure works without prior knowledge of whether the data will be hard or easy.

Core claim

There exists a single efficient algorithm whose multicalibration error is controlled by the number of leaves in an adaptively refined dyadic grid, recovering the Õ(T^{2/3}) worst-case rate while automatically achieving Õ(√T) in the marginal stochastic setting and Õ(√(JT)) for piecewise-stationary means with J segments; the dependence on a threshold-complexity measure of the predictable mean process is tight up to logarithmic factors.

What carries the argument

Adaptively refined dyadic grid of prediction values, whose leaf count directly controls the multicalibration error bound.

If this is right

The algorithm matches the known optimal worst-case rate of Õ(T^{2/3}) without any special tuning.
In fully stochastic settings the same procedure automatically improves to Õ(√T).
When the mean process changes only J times the error scales as Õ(√(JT)).
The dependence on threshold complexity is information-theoretically tight up to logarithmic factors.
The algorithm remains efficient even while producing these instance-adaptive guarantees.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same adaptive-refinement idea could be applied to other online calibration or regret problems where instance difficulty varies.
One could test the method on real data streams that mix stationary periods with occasional shifts to check whether observed error tracks leaf count.
The threshold-complexity measure might serve as a new way to quantify predictability in sequential decision problems.
Extensions to continuous outcome spaces would require replacing the dyadic grid with a different adaptive partition.

Load-bearing premise

The multicalibration error incurred by the algorithm is bounded by a function of the number of leaves that appear in the refinement tree.

What would settle it

A concrete counter-example sequence whose predictable means have low threshold complexity yet produce multicalibration error strictly larger than the claimed bound on the number of leaves.

Figures

Figures reproduced from arXiv: 2605.09273 by Aaron Roth, Claire Jie Zhang, Jamie Morgenstern, Zhiming Huang.

**Figure 1.** Figure 1: A typical state of the dynamic-bin data structure. The learner predicts using the midpoints of the current leaves, and an interval is refined only after the total mass assigned to it reaches the threshold L/w2 I . Why this split threshold? The threshold N(I) ≥ L/w2 I is the scale at which further refinement becomes worthwhile. An interval of width wI contributes deterministic discretization error on the or… view at source ↗

read the original abstract

We study online multicalibration beyond the worst-case. We give a single, efficient algorithm which dynamically interpolates between benign and worst-case sequences by adaptively refining a dyadic grid of prediction values. Its error is controlled by the number of leaves in the refinement tree. Our analysis recovers the known $\widetilde O(T^{2/3})$ worst-case-optimal rate for online multicalibration, while simultaneously automatically adapting to easier instances: in the marginal stochastic setting it obtains a rate of $\widetilde O(\sqrt T)$, and for piecewise-stationary means with $J$ segments its rate is $\widetilde O(\sqrt{JT})$. More generally, the rate depends on a threshold-complexity measure of the predictable mean process relative to the group family. We show that this dependence is tight up to logarithmic factors.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper gives one algorithm for online multicalibration that adapts its rate to instance complexity through dyadic grid refinement and unifies several regimes under matching bounds.

read the letter

Hi, the main thing to know is that this paper supplies a single efficient algorithm for online multicalibration whose error is governed by the number of leaves in an adaptively refined dyadic grid. This lets the same procedure recover the known worst-case rate while automatically improving on easier data without any regime-specific tuning. The rates are Õ(T^{2/3}) in the worst case, Õ(√T) under marginal stochastic assumptions, and Õ(√(JT)) for piecewise-stationary means with J segments. They also tie the bound to a threshold-complexity measure of the predictable mean process relative to the group family and show the dependence is tight up to logs. The adaptive refinement is the central new device that makes the interpolation work. The construction looks clean and the argument is internally consistent, with no obvious circularity in how the leaf count is controlled by the complexity measure. The paper does a good job of organizing previously separate analyses into one mechanism and the efficiency claim is plausible given the dyadic structure. On the softer side, the full proofs will need to confirm that the adaptive decisions do not introduce hidden logarithmic overhead or depend on post-hoc information in a way that weakens the rates. The complexity measure itself is presented as an external property of the mean process, which is fine in theory but may be hard to verify or estimate in applications. These are details rather than fatal gaps. The work is aimed at people who care about sequential fairness, online calibration with groups, or instance-adaptive methods in learning theory. A reader already familiar with the worst-case multicalibration literature will see the most value in how the new rates sit between the known extremes. It deserves a serious referee because the technical idea is coherent, the bounds are sharp, and the unification is useful even if some practical aspects of the complexity measure need more discussion in revision.

Referee Report

0 major / 2 minor

Summary. The paper claims to provide a single efficient algorithm for online multicalibration that uses adaptive dyadic grid refinement to control the error by the number of leaves in the refinement tree. This allows it to recover the Õ(T^{2/3}) worst-case rate while achieving Õ(√T) in marginal stochastic settings and Õ(√(JT)) for piecewise-stationary means with J segments. The rate depends on a threshold-complexity measure of the predictable mean process relative to the group family, and this dependence is shown to be tight up to logarithmic factors.

Significance. If the results hold, this work is significant for introducing a unified adaptive algorithm that automatically interpolates between worst-case and easier regimes in online multicalibration via dyadic refinement and a threshold-complexity measure of the mean process. The matching lower bounds up to logs and the explicit interpolation between Õ(T^{2/3}), Õ(√T), and Õ(√(JT)) regimes strengthen the contribution to instance-adaptive online learning.

minor comments (2)

The abstract states the rates and tightness but provides no derivation steps, error-bar discussion, or explicit handling of post-hoc choices; a one-sentence outline of the proof strategy in the introduction would improve accessibility without altering the technical content.
The threshold-complexity measure is presented as an external property of the mean process; an early informal definition or example in Section 1 would help readers connect it to the group family before the formal analysis.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for their positive assessment of our work and for recommending minor revision. The referee's summary correctly captures the main contributions of the paper, including the single efficient algorithm that uses adaptive dyadic grid refinement to achieve instance-adaptive rates for online multicalibration.

Circularity Check

0 steps flagged

No significant circularity; derivation self-contained

full rationale

The paper defines an adaptive dyadic refinement algorithm whose multicalibration error is bounded by a function of the number of leaves in the refinement tree. This leaf count is then shown to be controlled by an independently defined threshold-complexity measure of the predictable mean process relative to the given group family. The complexity measure is an external property of the input sequence, not constructed from the algorithm's outputs or fitted parameters. The resulting rates (worst-case Õ(T^{2/3}), stochastic Õ(√T), piecewise-stationary Õ(√(JT))) follow directly from this instance-dependent bound without reducing to self-definition or self-citation chains. Matching lower bounds are provided separately, confirming the analysis does not rely on circular reductions.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review; no explicit free parameters, axioms, or invented entities are stated. The central claim appears to rest on an implicit modeling assumption that the multicalibration error is governed by leaf count in a dyadic refinement tree and on the existence of a threshold-complexity measure for the mean process.

pith-pipeline@v0.9.0 · 5661 in / 1330 out tokens · 26311 ms · 2026-05-22T09:37:52.359558+00:00 · methodology

Review history (2 revisions) →

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Adaptive Calibration in Non-Stationary Environments
cs.LG 2026-05 unverdicted novelty 7.0

Online algorithms achieve adaptive calibration bounds of Õ(√T + (TC)^{1/3}) for ℓ1 error and Õ((1+C)^{1/3}) for ℓ2 and pseudo-KL error, matching stationary and adversarial extremes via epoch scheduling and non-uniform...
Adaptive Calibration in Non-Stationary Environments
cs.LG 2026-05 unverdicted novelty 7.0

Algorithms achieve adaptive calibration bounds of order min{sqrt(T) + (T C)^{1/3}, sqrt(K T)} for l1 error and min{(1+C)^{1/3}, K} for l2 and pseudo-KL error, where K and C are unknown non-stationarity measures.

Reference graph

Works this paper leans on

132 extracted references · 132 canonical work pages · cited by 1 Pith paper · 6 internal anchors

[1]

Electronic Communications in Probability , publisher =

Joel Tropp , title =. Electronic Communications in Probability , publisher =

work page
[2]

Achieving all with no parameters: Adanormalhedge , author=. Proc. Conference on Learning Theory (COLT) , pages=. 2015 , organization=

work page 2015
[3]

Sample Complexity of Uniform Convergence for Multicalibration , author =. Proc. Advances in Neural Information Processing Systems (NeurIPS) , volume =

work page
[4]

Distribution-free calibration guarantees for histogram binning without sample splitting , author=. Proc. International conference on machine learning (ICML) , pages=. 2021 , organization=

work page 2021
[5]

Games and Economic Behavior , volume=

Calibrated learning and correlated equilibrium , author=. Games and Economic Behavior , volume=. 1997 , publisher=

work page 1997
[6]

Operations Research & Management Science in the Age of Analytics , pages=

Wasserstein distributionally robust optimization: Theory and applications in machine learning , author=. Operations Research & Management Science in the Age of Analytics , pages=. 2019 , publisher=

work page 2019
[7]

Calibrating predictions to decisions: A novel approach to multi-class calibration , author=. Proc. Advances in Neural Information Processing Systems (NeurIPS) , volume=

work page
[8]

Lunjia Hu and Yifan Wu , title =. 65th

work page
[9]

arXiv preprint arXiv:2501.17205 , year=

Near-optimal algorithms for omniprediction , author=. arXiv preprint arXiv:2501.17205 , year=

work page arXiv
[10]

American Economic Review , volume=

Robustness and linear contracts , author=. American Economic Review , volume=. 2015 , publisher=

work page 2015
[11]

Forecasting for Swap Regret for All Downstream Agents , booktitle =

Aaron Roth and Mirah Shi , editor =. Forecasting for Swap Regret for All Downstream Agents , booktitle =

work page
[12]

Bobby Kleinberg and Renato Paes Leme and Jon Schneider and Yifeng Teng , title =. Proc. Annual Conference on Learning Theory (COLT) , series =

work page
[13]

14th Innovations in Theoretical Computer Science Conference, ITCS 2023 , pages=

Decision-Making Under Miscalibration , author=. 14th Innovations in Theoretical Computer Science Conference, ITCS 2023 , pages=. 2023 , organization=

work page 2023
[14]

Machine learning: ECML 2002: 13th European conference on machine learning Helsinki, Finland, August 19--23, 2002 proceedings 13 , pages=

Inductive confidence machines for regression , author=. Machine learning: ECML 2002: 13th European conference on machine learning Helsinki, Finland, August 19--23, 2002 proceedings 13 , pages=. 2002 , organization=

work page 2002
[15]

The Annals of Mathematical Statistics , volume=

Determination of sample sizes for setting tolerance limits , author=. The Annals of Mathematical Statistics , volume=. 1941 , publisher=

work page 1941
[16]

Non-Parametric Estimation. I. Validation of Order Statistics , author =. Annals of Mathematical Statistics , volume =. 1945 , month =. doi:10.1214/aoms/1177731119 , url =

work page doi:10.1214/aoms/1177731119 1945
[17]

Algorithmic Learning in a Random World , journal =

Vovk, Vladimir and Gammerman, Alex and Shafer, Glenn , year =. Algorithmic Learning in a Random World , journal =

work page
[18]

and Gammerman, A

Saunders, C. and Gammerman, A. and Vovk, V. , title =. Proceedings of the 16th International Joint Conference on Artificial Intelligence - Volume 2 , pages =. 1999 , publisher =

work page 1999
[19]

Transduction with confidence and credibility , author=

work page
[20]

Machine-learning applications of algorithmic randomness , author=

work page
[21]

2005 , publisher=

Algorithmic learning in a random world , author=. 2005 , publisher=

work page 2005
[22]

2024 , eprint=

Length Optimization in Conformal Prediction , author=. 2024 , eprint=

work page 2024
[23]

2022 , eprint=

Learning Optimal Conformal Classifiers , author=. 2022 , eprint=

work page 2022
[24]

and Ramdas, Aaditya , year=

Gupta, Chirag and Kuchibhotla, Arun K. and Ramdas, Aaditya , year=. Nested conformal prediction and quantile out-of-bag ensemble methods , volume=. doi:10.1016/j.patcog.2021.108496 , journal=

work page doi:10.1016/j.patcog.2021.108496 2021
[25]

2025 , eprint=

Conformal Risk Minimization with Variance Reduction , author=. 2025 , eprint=

work page 2025
[26]

2017 , eprint=

Distribution-Free Predictive Inference For Regression , author=. 2017 , eprint=

work page 2017
[27]

Journal of mathematical economics , volume=

Maxmin expected utility with non-unique prior , author=. Journal of mathematical economics , volume=. 1989 , publisher=

work page 1989
[28]

arXiv preprint arXiv:2502.17830 , year=

Certified Decisions , author=. arXiv preprint arXiv:2502.17830 , year=

work page arXiv
[29]

Mathematical programming , volume=

Robust optimization--methodology and applications , author=. Mathematical programming , volume=. 2002 , publisher=

work page 2002
[30]

Softmax probabilities (mostly) predict large language model correctness on multiple-choice q&a

Probabilities of Chat LLMs Are Miscalibrated but Still Predict Correctness on Multiple-Choice Q&A , author=. arXiv preprint arXiv:2402.13213 , year=

work page arXiv
[31]

The Thirty Seventh Annual Conference on Learning Theory , pages=

Omnipredictors for regression and the approximate rank of convex functions , author=. The Thirty Seventh Annual Conference on Learning Theory , pages=. 2024 , organization=

work page 2024
[32]

Sample Efficient Omniprediction and Downstream Swap Regret for Non-Linear Losses , booktitle =

Jiuyao Lu and Aaron Roth and Mirah Shi , editor =. Sample Efficient Omniprediction and Downstream Swap Regret for Non-Linear Losses , booktitle =. 2025 , url =

work page 2025
[33]

International conference on machine learning , pages=

On calibration of modern neural networks , author=. International conference on machine learning , pages=. 2017 , organization=

work page 2017
[34]

International Conference on Learning Representations , year=

Top-label calibration and multiclass-to-binary reductions , author=. International Conference on Learning Representations , year=

work page
[35]

Advances in neural information processing systems , volume=

Beyond temperature scaling: Obtaining well-calibrated multi-class probabilities with dirichlet calibration , author=. Advances in neural information processing systems , volume=

work page
[36]

The Thirty Seventh Annual Conference on Learning Theory , pages=

On computationally efficient multi-class calibration , author=. The Thirty Seventh Annual Conference on Learning Theory , pages=. 2024 , organization=

work page 2024
[37]

Proceedings of the 53rd Annual ACM SIGACT Symposium on Theory of Computing , pages=

Outcome indistinguishability , author=. Proceedings of the 53rd Annual ACM SIGACT Symposium on Theory of Computing , pages=

work page
[38]

Multicalibration: Calibration for the (computationally-identifiable) masses , author=. Proc. International Conference on Machine Learning (ICML) , pages=. 2018 , organization=

work page 2018
[39]

The Annals of Statistics , volume=

Learning models with uniform performance via distributionally robust optimization , author=. The Annals of Statistics , volume=. 2021 , publisher=

work page 2021
[40]

American Economic Review , volume=

Robust control and model uncertainty , author=. American Economic Review , volume=. 2001 , publisher=

work page 2001
[41]

Breakthroughs in Statistics: Foundations and Basic Theory , pages=

Statistical decision functions , author=. Breakthroughs in Statistics: Foundations and Basic Theory , pages=. 1950 , publisher=

work page 1950
[42]

2019 , eprint=

Conformalized Quantile Regression , author=. 2019 , eprint=

work page 2019
[43]

2020 , eprint=

Classification with Valid and Adaptive Coverage , author=. 2020 , eprint=

work page 2020
[44]

2022 , eprint=

Uncertainty Sets for Image Classifiers using Conformal Prediction , author=. 2022 , eprint=

work page 2022
[45]

2025 , eprint=

Conformal Risk Control , author=. 2025 , eprint=

work page 2025
[46]

2023 , eprint=

Safe Planning in Dynamic Environments using Conformal Prediction , author=. 2023 , eprint=

work page 2023
[47]

2024 , eprint=

Conformal Decision Theory: Safe Autonomous Decisions from Imperfect Predictions , author=. 2024 , eprint=

work page 2024
[48]

Lecture Notes , volume=

Uncertain: Modern topics in uncertainty estimation , author=. Lecture Notes , volume=

work page
[49]

2025 , eprint=

Decision Theoretic Foundations for Conformal Prediction: Optimal Uncertainty Quantification for Risk-Averse Agents , author=. 2025 , eprint=

work page 2025
[50]

2024 , eprint=

Calibrated Selective Classification , author=. 2024 , eprint=

work page 2024
[51]

2018 , eprint=

Predict Responsibly: Improving Fairness and Accuracy by Learning to Defer , author=. 2018 , eprint=

work page 2018
[52]

2021 , eprint=

Consistent Estimators for Learning to Defer to an Expert , author=. 2021 , eprint=

work page 2021
[53]

Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems , articleno =

Bansal, Gagan and Wu, Tongshuang and Zhou, Joyce and Fok, Raymond and Nushi, Besmira and Kamar, Ece and Ribeiro, Marco Tulio and Weld, Daniel , title =. Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems , articleno =. 2021 , isbn =. doi:10.1145/3411764.3445717 , abstract =

work page doi:10.1145/3411764.3445717 2021
[54]

Towards Human-AI Complementarity with Prediction Sets , url =

De Toni, Giovanni and Okati, Nastaran and Thejaswi, Suhas and Straitouri, Eleni and Gomez-Rodriguez, Manuel , booktitle =. Towards Human-AI Complementarity with Prediction Sets , url =

work page
[55]

Proceedings of the 40th International Conference on Machine Learning , pages =

Improving Expert Predictions with Conformal Prediction , author =. Proceedings of the 40th International Conference on Machine Learning , pages =. 2023 , editor =

work page 2023
[56]

Controlling Counterfactual Harm in Decision Support Systems Based on Prediction Sets , url =

Straitouri, Eleni and Thejaswi, Suhas and Rodriguez, Manuel Gomez , booktitle =. Controlling Counterfactual Harm in Decision Support Systems Based on Prediction Sets , url =

work page
[57]

2021 , eprint=

Adaptive Conformal Inference Under Distribution Shift , author=. 2021 , eprint=

work page 2021
[58]

2023 , eprint=

Conformal PID Control for Time Series Prediction , author=. 2023 , eprint=

work page 2023
[59]

Journal of Econometrics , volume=

Identification Problems and Decisions under Ambiguity , author=. Journal of Econometrics , volume=

work page
[60]

Econometrica , volume=

Statistical Treatment Rules for Heterogeneous Populations , author=. Econometrica , volume=

work page
[61]

Econometrica , volume=

Admissible Treatment Rules for a Risk-Averse Planner , author=. Econometrica , volume=

work page
[62]

Annual Review of Economics , volume=

Choosing Treatment Policies under Ambiguity , author=. Annual Review of Economics , volume=

work page
[63]

Progress in Artificial Intelligence , volume=

Event labeling combining ensemble detectors and background knowledge , author=. Progress in Artificial Intelligence , volume=. 2014 , publisher=

work page 2014
[64]

Statistics & Probability Letters , volume=

Sparse spatial autoregressions , author=. Statistics & Probability Letters , volume=. 1997 , publisher=

work page 1997
[65]

The Annals of Probability , volume=

Distribution function inequalities for martingales , author=. The Annals of Probability , volume=. 1973 , publisher=

work page 1973
[66]

High-Dimensional Prediction for Sequential Decision Making , author=. Proc. International Conference on Machine Learning (ICML) , year=

work page
[67]

Supersimulators

Supersimulators , author=. arXiv preprint arXiv:2509.17994 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[68]

Breaking the

Dagan, Yuval and Daskalakis, Constantinos and Fishelson, Maxwell and Golowich, Noah and Kleinberg, Robert and Okoroafor, Princewill , booktitle=. Breaking the

work page
[69]

Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society , pages=

Multiaccuracy: Black-box post-processing for fairness in classification , author=. Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society , pages=

work page 2019
[70]

Advances in Neural Information Processing Systems , volume=

Truthfulness of calibration measures , author=. Advances in Neural Information Processing Systems , volume=

work page
[71]

The Thirty Eighth Annual Conference on Learning Theory , pages=

Truthfulness of Decision-Theoretic Calibration Measures , author=. The Thirty Eighth Annual Conference on Learning Theory , pages=. 2025 , organization=

work page 2025
[72]

A Perfectly Truthful Calibration Measure

A Perfectly Truthful Calibration Measure , author=. arXiv preprint arXiv:2508.13100 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[73]

2019 , publisher=

Probability: Theory and Examples , author=. 2019 , publisher=

work page 2019
[74]

2013 , howpublished=

Brownian Motion and Stochastic Calculus , author=. 2013 , howpublished=

work page 2013
[75]

Conference on Learning Theory , pages=

Low-degree multicalibration , author=. Conference on Learning Theory , pages=. 2022 , organization=

work page 2022
[76]

2016 , eprint =

Denisov, Denis and Sakhanenko, Alexander and Wachtel, Vitali , title =. 2016 , eprint =

work page 2016
[77]

Electronic Journal of Probability , volume=

The First Hitting Time of a Single Point for Random Walks , author=. Electronic Journal of Probability , volume=. 2011 , publisher=

work page 2011
[78]

Stronger calibration lower bounds via sidestepping , author=. Proc. Annual ACM Symposium on Theory of Computing (STOC) , pages=

work page
[79]

Innovations in Theoretical Computer Science Conference (ITCS) , volume=

Advancing Subgroup Fairness via Sleeping Experts , author=. Innovations in Theoretical Computer Science Conference (ITCS) , volume=

work page
[80]

The Twelfth International Conference on Learning Representations , year=

Oracle Efficient Algorithms for Groupwise Regret , author=. The Twelfth International Conference on Learning Representations , year=

work page

Showing first 80 references.

[1] [1]

Electronic Communications in Probability , publisher =

Joel Tropp , title =. Electronic Communications in Probability , publisher =

work page

[2] [2]

Achieving all with no parameters: Adanormalhedge , author=. Proc. Conference on Learning Theory (COLT) , pages=. 2015 , organization=

work page 2015

[3] [3]

Sample Complexity of Uniform Convergence for Multicalibration , author =. Proc. Advances in Neural Information Processing Systems (NeurIPS) , volume =

work page

[4] [4]

Distribution-free calibration guarantees for histogram binning without sample splitting , author=. Proc. International conference on machine learning (ICML) , pages=. 2021 , organization=

work page 2021

[5] [5]

Games and Economic Behavior , volume=

Calibrated learning and correlated equilibrium , author=. Games and Economic Behavior , volume=. 1997 , publisher=

work page 1997

[6] [6]

Operations Research & Management Science in the Age of Analytics , pages=

Wasserstein distributionally robust optimization: Theory and applications in machine learning , author=. Operations Research & Management Science in the Age of Analytics , pages=. 2019 , publisher=

work page 2019

[7] [7]

Calibrating predictions to decisions: A novel approach to multi-class calibration , author=. Proc. Advances in Neural Information Processing Systems (NeurIPS) , volume=

work page

[8] [8]

Lunjia Hu and Yifan Wu , title =. 65th

work page

[9] [9]

arXiv preprint arXiv:2501.17205 , year=

Near-optimal algorithms for omniprediction , author=. arXiv preprint arXiv:2501.17205 , year=

work page arXiv

[10] [10]

American Economic Review , volume=

Robustness and linear contracts , author=. American Economic Review , volume=. 2015 , publisher=

work page 2015

[11] [11]

Forecasting for Swap Regret for All Downstream Agents , booktitle =

Aaron Roth and Mirah Shi , editor =. Forecasting for Swap Regret for All Downstream Agents , booktitle =

work page

[12] [12]

Bobby Kleinberg and Renato Paes Leme and Jon Schneider and Yifeng Teng , title =. Proc. Annual Conference on Learning Theory (COLT) , series =

work page

[13] [13]

14th Innovations in Theoretical Computer Science Conference, ITCS 2023 , pages=

Decision-Making Under Miscalibration , author=. 14th Innovations in Theoretical Computer Science Conference, ITCS 2023 , pages=. 2023 , organization=

work page 2023

[14] [14]

Machine learning: ECML 2002: 13th European conference on machine learning Helsinki, Finland, August 19--23, 2002 proceedings 13 , pages=

Inductive confidence machines for regression , author=. Machine learning: ECML 2002: 13th European conference on machine learning Helsinki, Finland, August 19--23, 2002 proceedings 13 , pages=. 2002 , organization=

work page 2002

[15] [15]

The Annals of Mathematical Statistics , volume=

Determination of sample sizes for setting tolerance limits , author=. The Annals of Mathematical Statistics , volume=. 1941 , publisher=

work page 1941

[16] [16]

Non-Parametric Estimation. I. Validation of Order Statistics , author =. Annals of Mathematical Statistics , volume =. 1945 , month =. doi:10.1214/aoms/1177731119 , url =

work page doi:10.1214/aoms/1177731119 1945

[17] [17]

Algorithmic Learning in a Random World , journal =

Vovk, Vladimir and Gammerman, Alex and Shafer, Glenn , year =. Algorithmic Learning in a Random World , journal =

work page

[18] [18]

and Gammerman, A

Saunders, C. and Gammerman, A. and Vovk, V. , title =. Proceedings of the 16th International Joint Conference on Artificial Intelligence - Volume 2 , pages =. 1999 , publisher =

work page 1999

[19] [19]

Transduction with confidence and credibility , author=

work page

[20] [20]

Machine-learning applications of algorithmic randomness , author=

work page

[21] [21]

2005 , publisher=

Algorithmic learning in a random world , author=. 2005 , publisher=

work page 2005

[22] [22]

2024 , eprint=

Length Optimization in Conformal Prediction , author=. 2024 , eprint=

work page 2024

[23] [23]

2022 , eprint=

Learning Optimal Conformal Classifiers , author=. 2022 , eprint=

work page 2022

[24] [24]

and Ramdas, Aaditya , year=

Gupta, Chirag and Kuchibhotla, Arun K. and Ramdas, Aaditya , year=. Nested conformal prediction and quantile out-of-bag ensemble methods , volume=. doi:10.1016/j.patcog.2021.108496 , journal=

work page doi:10.1016/j.patcog.2021.108496 2021

[25] [25]

2025 , eprint=

Conformal Risk Minimization with Variance Reduction , author=. 2025 , eprint=

work page 2025

[26] [26]

2017 , eprint=

Distribution-Free Predictive Inference For Regression , author=. 2017 , eprint=

work page 2017

[27] [27]

Journal of mathematical economics , volume=

Maxmin expected utility with non-unique prior , author=. Journal of mathematical economics , volume=. 1989 , publisher=

work page 1989

[28] [28]

arXiv preprint arXiv:2502.17830 , year=

Certified Decisions , author=. arXiv preprint arXiv:2502.17830 , year=

work page arXiv

[29] [29]

Mathematical programming , volume=

Robust optimization--methodology and applications , author=. Mathematical programming , volume=. 2002 , publisher=

work page 2002

[30] [30]

Softmax probabilities (mostly) predict large language model correctness on multiple-choice q&a

Probabilities of Chat LLMs Are Miscalibrated but Still Predict Correctness on Multiple-Choice Q&A , author=. arXiv preprint arXiv:2402.13213 , year=

work page arXiv

[31] [31]

The Thirty Seventh Annual Conference on Learning Theory , pages=

Omnipredictors for regression and the approximate rank of convex functions , author=. The Thirty Seventh Annual Conference on Learning Theory , pages=. 2024 , organization=

work page 2024

[32] [32]

Sample Efficient Omniprediction and Downstream Swap Regret for Non-Linear Losses , booktitle =

Jiuyao Lu and Aaron Roth and Mirah Shi , editor =. Sample Efficient Omniprediction and Downstream Swap Regret for Non-Linear Losses , booktitle =. 2025 , url =

work page 2025

[33] [33]

International conference on machine learning , pages=

On calibration of modern neural networks , author=. International conference on machine learning , pages=. 2017 , organization=

work page 2017

[34] [34]

International Conference on Learning Representations , year=

Top-label calibration and multiclass-to-binary reductions , author=. International Conference on Learning Representations , year=

work page

[35] [35]

Advances in neural information processing systems , volume=

Beyond temperature scaling: Obtaining well-calibrated multi-class probabilities with dirichlet calibration , author=. Advances in neural information processing systems , volume=

work page

[36] [36]

The Thirty Seventh Annual Conference on Learning Theory , pages=

On computationally efficient multi-class calibration , author=. The Thirty Seventh Annual Conference on Learning Theory , pages=. 2024 , organization=

work page 2024

[37] [37]

Proceedings of the 53rd Annual ACM SIGACT Symposium on Theory of Computing , pages=

Outcome indistinguishability , author=. Proceedings of the 53rd Annual ACM SIGACT Symposium on Theory of Computing , pages=

work page

[38] [38]

Multicalibration: Calibration for the (computationally-identifiable) masses , author=. Proc. International Conference on Machine Learning (ICML) , pages=. 2018 , organization=

work page 2018

[39] [39]

The Annals of Statistics , volume=

Learning models with uniform performance via distributionally robust optimization , author=. The Annals of Statistics , volume=. 2021 , publisher=

work page 2021

[40] [40]

American Economic Review , volume=

Robust control and model uncertainty , author=. American Economic Review , volume=. 2001 , publisher=

work page 2001

[41] [41]

Breakthroughs in Statistics: Foundations and Basic Theory , pages=

Statistical decision functions , author=. Breakthroughs in Statistics: Foundations and Basic Theory , pages=. 1950 , publisher=

work page 1950

[42] [42]

2019 , eprint=

Conformalized Quantile Regression , author=. 2019 , eprint=

work page 2019

[43] [43]

2020 , eprint=

Classification with Valid and Adaptive Coverage , author=. 2020 , eprint=

work page 2020

[44] [44]

2022 , eprint=

Uncertainty Sets for Image Classifiers using Conformal Prediction , author=. 2022 , eprint=

work page 2022

[45] [45]

2025 , eprint=

Conformal Risk Control , author=. 2025 , eprint=

work page 2025

[46] [46]

2023 , eprint=

Safe Planning in Dynamic Environments using Conformal Prediction , author=. 2023 , eprint=

work page 2023

[47] [47]

2024 , eprint=

Conformal Decision Theory: Safe Autonomous Decisions from Imperfect Predictions , author=. 2024 , eprint=

work page 2024

[48] [48]

Lecture Notes , volume=

Uncertain: Modern topics in uncertainty estimation , author=. Lecture Notes , volume=

work page

[49] [49]

2025 , eprint=

Decision Theoretic Foundations for Conformal Prediction: Optimal Uncertainty Quantification for Risk-Averse Agents , author=. 2025 , eprint=

work page 2025

[50] [50]

2024 , eprint=

Calibrated Selective Classification , author=. 2024 , eprint=

work page 2024

[51] [51]

2018 , eprint=

Predict Responsibly: Improving Fairness and Accuracy by Learning to Defer , author=. 2018 , eprint=

work page 2018

[52] [52]

2021 , eprint=

Consistent Estimators for Learning to Defer to an Expert , author=. 2021 , eprint=

work page 2021

[53] [53]

Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems , articleno =

Bansal, Gagan and Wu, Tongshuang and Zhou, Joyce and Fok, Raymond and Nushi, Besmira and Kamar, Ece and Ribeiro, Marco Tulio and Weld, Daniel , title =. Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems , articleno =. 2021 , isbn =. doi:10.1145/3411764.3445717 , abstract =

work page doi:10.1145/3411764.3445717 2021

[54] [54]

Towards Human-AI Complementarity with Prediction Sets , url =

De Toni, Giovanni and Okati, Nastaran and Thejaswi, Suhas and Straitouri, Eleni and Gomez-Rodriguez, Manuel , booktitle =. Towards Human-AI Complementarity with Prediction Sets , url =

work page

[55] [55]

Proceedings of the 40th International Conference on Machine Learning , pages =

Improving Expert Predictions with Conformal Prediction , author =. Proceedings of the 40th International Conference on Machine Learning , pages =. 2023 , editor =

work page 2023

[56] [56]

Controlling Counterfactual Harm in Decision Support Systems Based on Prediction Sets , url =

Straitouri, Eleni and Thejaswi, Suhas and Rodriguez, Manuel Gomez , booktitle =. Controlling Counterfactual Harm in Decision Support Systems Based on Prediction Sets , url =

work page

[57] [57]

2021 , eprint=

Adaptive Conformal Inference Under Distribution Shift , author=. 2021 , eprint=

work page 2021

[58] [58]

2023 , eprint=

Conformal PID Control for Time Series Prediction , author=. 2023 , eprint=

work page 2023

[59] [59]

Journal of Econometrics , volume=

Identification Problems and Decisions under Ambiguity , author=. Journal of Econometrics , volume=

work page

[60] [60]

Econometrica , volume=

Statistical Treatment Rules for Heterogeneous Populations , author=. Econometrica , volume=

work page

[61] [61]

Econometrica , volume=

Admissible Treatment Rules for a Risk-Averse Planner , author=. Econometrica , volume=

work page

[62] [62]

Annual Review of Economics , volume=

Choosing Treatment Policies under Ambiguity , author=. Annual Review of Economics , volume=

work page

[63] [63]

Progress in Artificial Intelligence , volume=

Event labeling combining ensemble detectors and background knowledge , author=. Progress in Artificial Intelligence , volume=. 2014 , publisher=

work page 2014

[64] [64]

Statistics & Probability Letters , volume=

Sparse spatial autoregressions , author=. Statistics & Probability Letters , volume=. 1997 , publisher=

work page 1997

[65] [65]

The Annals of Probability , volume=

Distribution function inequalities for martingales , author=. The Annals of Probability , volume=. 1973 , publisher=

work page 1973

[66] [66]

High-Dimensional Prediction for Sequential Decision Making , author=. Proc. International Conference on Machine Learning (ICML) , year=

work page

[67] [67]

Supersimulators

Supersimulators , author=. arXiv preprint arXiv:2509.17994 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[68] [68]

Breaking the

Dagan, Yuval and Daskalakis, Constantinos and Fishelson, Maxwell and Golowich, Noah and Kleinberg, Robert and Okoroafor, Princewill , booktitle=. Breaking the

work page

[69] [69]

Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society , pages=

Multiaccuracy: Black-box post-processing for fairness in classification , author=. Proceedings of the 2019 AAAI/ACM Conference on AI, Ethics, and Society , pages=

work page 2019

[70] [70]

Advances in Neural Information Processing Systems , volume=

Truthfulness of calibration measures , author=. Advances in Neural Information Processing Systems , volume=

work page

[71] [71]

The Thirty Eighth Annual Conference on Learning Theory , pages=

Truthfulness of Decision-Theoretic Calibration Measures , author=. The Thirty Eighth Annual Conference on Learning Theory , pages=. 2025 , organization=

work page 2025

[72] [72]

A Perfectly Truthful Calibration Measure

A Perfectly Truthful Calibration Measure , author=. arXiv preprint arXiv:2508.13100 , year=

work page internal anchor Pith review Pith/arXiv arXiv

[73] [73]

2019 , publisher=

Probability: Theory and Examples , author=. 2019 , publisher=

work page 2019

[74] [74]

2013 , howpublished=

Brownian Motion and Stochastic Calculus , author=. 2013 , howpublished=

work page 2013

[75] [75]

Conference on Learning Theory , pages=

Low-degree multicalibration , author=. Conference on Learning Theory , pages=. 2022 , organization=

work page 2022

[76] [76]

2016 , eprint =

Denisov, Denis and Sakhanenko, Alexander and Wachtel, Vitali , title =. 2016 , eprint =

work page 2016

[77] [77]

Electronic Journal of Probability , volume=

The First Hitting Time of a Single Point for Random Walks , author=. Electronic Journal of Probability , volume=. 2011 , publisher=

work page 2011

[78] [78]

Stronger calibration lower bounds via sidestepping , author=. Proc. Annual ACM Symposium on Theory of Computing (STOC) , pages=

work page

[79] [79]

Innovations in Theoretical Computer Science Conference (ITCS) , volume=

Advancing Subgroup Fairness via Sleeping Experts , author=. Innovations in Theoretical Computer Science Conference (ITCS) , volume=

work page

[80] [80]

The Twelfth International Conference on Learning Representations , year=

Oracle Efficient Algorithms for Groupwise Regret , author=. The Twelfth International Conference on Learning Representations , year=

work page