Calibeating for general proper losses: A Bregman divergence approach
Pith reviewed 2026-05-20 13:59 UTC · model grok-4.3
The pith
A Bregman divergence framework enables calibeating for a wide family of proper losses including Tsallis losses.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
By viewing proper losses through their Bregman divergence representation and using online updating formulas for generalized variance, the authors establish a new regret equality for Be The Regularized Leader that holds for general proper losses; for the family of Tsallis losses this yields U-calibration, meaning logarithmic regret bounds that apply simultaneously to every loss in the family while depending more weakly on dimension.
What carries the argument
Bregman divergence representation of proper losses, used to derive a regret equality for Be The Regularized Leader via generalized-variance online updates.
If this is right
- Calibeating extends to the full family of alpha-Tsallis losses and their unscaled versions that recover log loss.
- Logarithmic regret is obtained simultaneously for every loss in the family.
- Dimension dependence of the regret bound is weaker than in previous results.
- The same framework covers Lipschitz proper losses.
- The new regret equality applies to any proper loss whose Bregman representation admits the generalized-variance updates.
Where Pith is reading between the lines
- The same regret-equality technique might be applied to other parametric families of proper losses beyond Tsallis and Lipschitz.
- Algorithms built on this Bregman view could reduce the need for loss-specific tuning in online calibrated prediction.
- The generalized-variance perspective may connect to variance-based analyses in other online-learning settings.
- Practitioners could substitute one loss from the family for another while retaining the same logarithmic-regret guarantee.
Load-bearing premise
The Bregman divergence representation of proper losses together with the online updating formulas for generalized variance suffice to make the new regret equality for Be The Regularized Leader hold for the considered family.
What would settle it
A direct calculation or numerical check showing that the claimed regret equality for Be The Regularized Leader fails to hold for some alpha-Tsallis loss with alpha in [1,2].
read the original abstract
This work introduces a general framework for calibeating based on regret minimization. As compared to Foster and Hart's seminal calibeating work which had specialized treatments of Brier score (squared loss) and log loss, we consider a large family of proper losses that includes $\alpha$-Tsallis losses (for $\alpha \in [1, 2]$) and Lipschitz losses. Our results for Tsallis losses also hold for an unscaled version of Tsallis loss that recovers log loss. Our analysis is oriented around the Bregman divergence view of a proper loss. Technically, our results for the family of Tsallis losses that we consider are U-calibration results, simultaneously obtaining logarithmic regret for all losses in this family while having a weaker dependence on the dimension compared to previous results. Of potential independent interest, we also show a new regret equality for the regret of Be The Regularized Leader. This regret equality holds for general proper losses and itself is based on two results related to online updating formulas for the generalized variance, the latter being a previously introduced generalization of variance based on Bregman divergences.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper introduces a Bregman divergence framework for calibeating with general proper losses, including the family of α-Tsallis losses for α ∈ [1,2] (and the unscaled version recovering log loss) as well as Lipschitz losses. It claims U-calibration results that simultaneously achieve logarithmic regret across the family with weaker dimension dependence than prior work, supported by a new regret equality for Be The Regularized Leader derived from online updating formulas for generalized variance.
Significance. If the central regret equality holds exactly (without dimension- or α-dependent residuals in the variance updates), the results would meaningfully generalize Foster and Hart's calibeating to a larger class of proper losses while improving dimension scaling; the regret equality itself may be of independent interest for online learning with Bregman divergences.
major comments (2)
- [derivation of the regret equality for BTRL] The new regret equality for BTRL (presented as holding for general proper losses via the Bregman view and generalized variance updates) is load-bearing for both the U-calibration claim and the improved dimension dependence. The derivation must explicitly verify closure of the variance update formulas under the specific Bregman divergences for Tsallis losses with α ∈ [1,2], including the unscaled log-loss case; any residual terms would undermine the uniformity and dimension improvement.
- [U-calibration results for Tsallis losses] The U-calibration results for the Tsallis family (stated to obtain logarithmic regret simultaneously with weaker dimension dependence) require an explicit comparison to prior dimension bounds; without this, it is unclear whether the improvement is uniform across α or holds only for specific parameter choices.
minor comments (2)
- [preliminaries on Bregman divergences] Define the generalized variance and its online update rule at the first point of use, with a clear statement of how it reduces to ordinary variance for squared loss.
- [results for Lipschitz losses] Clarify whether the Lipschitz losses results are U-calibration or only regret bounds, and state any additional assumptions needed beyond properness.
Simulated Author's Rebuttal
We thank the referee for their thoughtful and constructive review of our manuscript. We appreciate the recognition of the potential value of our Bregman divergence framework for general proper losses and the new regret equality for Be The Regularized Leader. We address each major comment below, providing clarifications and committing to revisions that strengthen the exposition without altering the core claims.
read point-by-point responses
-
Referee: [derivation of the regret equality for BTRL] The new regret equality for BTRL (presented as holding for general proper losses via the Bregman view and generalized variance updates) is load-bearing for both the U-calibration claim and the improved dimension dependence. The derivation must explicitly verify closure of the variance update formulas under the specific Bregman divergences for Tsallis losses with α ∈ [1,2], including the unscaled log-loss case; any residual terms would undermine the uniformity and dimension improvement.
Authors: We agree that explicit verification of closure strengthens the presentation. The manuscript derives the regret equality for general proper losses using the Bregman divergence view and the online updating formulas for generalized variance; these formulas are shown to close exactly (with no residuals) for the class of losses under consideration. For α-Tsallis losses with α ∈ [1,2], the associated Bregman divergences ensure the variance updates remain closed within the family, including the unscaled case that recovers log loss, without introducing α- or dimension-dependent residuals. This supports both the U-calibration and the improved dimension scaling. To address the referee's request directly, we will add an explicit verification subsection (or appendix paragraph) detailing the closure properties for these specific divergences. revision: yes
-
Referee: [U-calibration results for Tsallis losses] The U-calibration results for the Tsallis family (stated to obtain logarithmic regret simultaneously with weaker dimension dependence) require an explicit comparison to prior dimension bounds; without this, it is unclear whether the improvement is uniform across α or holds only for specific parameter choices.
Authors: We thank the referee for this suggestion to enhance clarity. Our results establish U-calibration for the Tsallis family (α ∈ [1,2], including unscaled log loss) with logarithmic regret and weaker dimension dependence than prior specialized analyses, thanks to the unified Bregman framework. The improvement is uniform across the parameter range because the approach avoids α-dependent factors present in earlier work. We will incorporate an explicit comparison—such as a table or paragraph in the introduction or results section—contrasting our dimension bounds with those from previous calibeating results for specific losses. revision: yes
Circularity Check
Derivation is self-contained; no load-bearing reductions to inputs or self-citations.
full rationale
The paper derives a new regret equality for Be The Regularized Leader from Bregman divergence representations of proper losses and online updating formulas for generalized variance, presenting both as technical contributions that enable U-calibration results for the Tsallis family with logarithmic regret and improved dimension dependence. No equations or claims reduce the central equality to a fitted parameter renamed as prediction, a self-definition, or an unverified self-citation chain; the generalized variance is referenced as previously introduced but the equality is claimed as newly shown for general proper losses. The analysis remains independent of the target U-calibration guarantee and does not collapse by construction.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Proper losses admit a Bregman divergence representation that supports the regret analysis
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel echoes?
echoesECHOES: this paper passage has the same mathematical shape or conceptual pattern as the Recognition theorem, but is not a direct formal dependency.
new regret equality for the regret of Be The Regularized Leader... based on two results related to online updating formulas for the generalized variance... Bregman divergences
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Journal of Machine Learning Research , volume=
Information, Divergence and Risk for Binary Experiments , author=. Journal of Machine Learning Research , volume=
-
[2]
Journal of the American Statistical Association , volume=
Elicitation of personal probabilities and expectations , author=. Journal of the American Statistical Association , volume=. 1971 , publisher=
work page 1971
-
[3]
Journal of the American statistical Association , volume=
Strictly proper scoring rules, prediction, and estimation , author=. Journal of the American statistical Association , volume=. 2007 , publisher=
work page 2007
-
[4]
The annals of statistics , volume=
A general method for comparing probability assessors , author=. The annals of statistics , volume=. 1989 , publisher=
work page 1989
-
[5]
Admissible probability measurement procedures , author=. Psychometrika , volume=. 1966 , publisher=
work page 1966
-
[6]
Proceedings of the National Academy of Sciences , volume=
Measures of the value of information , author=. Proceedings of the National Academy of Sciences , volume=
-
[7]
Monthly weather review , volume=
Verification of forecasts expressed in terms of probability , author=. Monthly weather review , volume=. 1950 , publisher=
work page 1950
-
[8]
Four Essays on Robustification of Portfolio Models , school =
Schanbacher, Peter , year=. Four Essays on Robustification of Portfolio Models , school =
-
[9]
Convergence analysis of a proximal-like minimization algorithm using
Chen, Gong and Teboulle, Marc , journal=. Convergence analysis of a proximal-like minimization algorithm using. 1993 , publisher=
work page 1993
-
[10]
Advances in Neural Information Processing Systems , volume=
Online minimax multiobjective optimization: Multicalibeating and other applications , author=. Advances in Neural Information Processing Systems , volume=
-
[11]
Foster, Dean P and Hart, Sergiu , journal=. ``. 2023 , publisher=
work page 2023
-
[12]
Note on a method for calculating corrected sums of squares and products , author=. Technometrics , volume=. 1962 , publisher=
work page 1962
-
[13]
Annals of the Institute of Statistical Mathematics , volume=
The geometry of proper scoring rules , author=. Annals of the Institute of Statistical Mathematics , volume=. 2007 , publisher=
work page 2007
-
[14]
Robust confidence distributions from proper scoring rules , author=. Statistics , volume=. 2022 , publisher=
work page 2022
-
[15]
A Modern Introduction to Online Learning
A modern introduction to online learning , author=. arXiv preprint arXiv:1912.13213 , year=
work page internal anchor Pith review Pith/arXiv arXiv 1912
- [16]
-
[17]
The Thirty Sixth Annual Conference on Learning Theory , pages=
U-calibration: Forecasting for an unknown agent , author=. The Thirty Sixth Annual Conference on Learning Theory , pages=. 2023 , organization=
work page 2023
-
[18]
Luo, Haipeng and Senapati, Spandan and Sharan, Vatsal , booktitle =. Optimal Multiclass
-
[19]
Understanding the bias-variance tradeoff of
Ben Adlam and Neha Gupta and Zelda Mariet and Jamie Smith , year=. Understanding the bias-variance tradeoff of. 2202.04167 , archivePrefix=
-
[20]
Transactions on Machine Learning Research , issn=
Ensembles of Classifiers: a Bias-Variance Perspective , author=. Transactions on Machine Learning Research , issn=. 2022 , url=
work page 2022
-
[21]
A generalized bias-variance decomposition for
Pfau, David , journal=. A generalized bias-variance decomposition for. 2025 , note=
work page 2025
-
[22]
arXiv preprint arXiv:2501.18581 , year=
Bias-variance decompositions: The exclusive privilege of Bregman divergences , author=. arXiv preprint arXiv:2501.18581 , year=
-
[23]
Working draft, November , volume=
Loss functions for binary class probability estimation and classification: Structure and applications , author=. Working draft, November , volume=
-
[24]
The Thirty Seventh Annual Conference on Learning Theory , pages=
Online structured prediction with Fenchel--Young losses and improved surrogate regret for online multiclass classification with logistic loss , author=. The Thirty Seventh Annual Conference on Learning Theory , pages=. 2024 , organization=
work page 2024
-
[25]
The Thirty Sixth Annual Conference on Learning Theory , pages=
Proper losses, moduli of convexity, and surrogate regret bounds , author=. The Thirty Sixth Annual Conference on Learning Theory , pages=. 2023 , organization=
work page 2023
-
[26]
Journal of Mathematical Physics , volume=
Fundamental properties of Tsallis relative entropy , author=. Journal of Mathematical Physics , volume=. 2004 , publisher=
work page 2004
-
[27]
A. P. Dawid , title =. Journal of the American Statistical Association , volume =. 1982 , publisher =
work page 1982
-
[28]
Scandinavian Journal of Statistics , volume=
Minimum scoring rule inference , author=. Scandinavian Journal of Statistics , volume=. 2016 , publisher=
work page 2016
-
[29]
The Annals of Statistics , number =
Peter D Gr. The Annals of Statistics , number =. 2004 , doi =
work page 2004
-
[30]
The fundamental nature of the log loss function , author=. Fields of logic and computation II: Essays dedicated To Yuri Gurevich on the Occasion of His 75th Birthday , pages=. 2015 , publisher=
work page 2015
-
[31]
Reliability, sufficiency, and the decomposition of proper scores , author=. Quarterly Journal of the Royal Meteorological Society: A journal of the atmospheric sciences, applied meteorology and physical oceanography , volume=. 2009 , publisher=
work page 2009
-
[32]
Journal of the Royal Statistical Society: Series D (The Statistician) , volume=
The comparison and evaluation of forecasters , author=. Journal of the Royal Statistical Society: Series D (The Statistician) , volume=. 1983 , publisher=
work page 1983
-
[33]
Asymptotic calibration , author=. Biometrika , volume=. 1998 , publisher=
work page 1998
-
[34]
Proceedings of the AAAI Conference on Artificial Intelligence , author=
Estimating Uncertainty Online Against an Adversary , volume=. Proceedings of the AAAI Conference on Artificial Intelligence , author=. 2017 , month=. doi:10.1609/aaai.v31i1.10949 , number=
-
[35]
Transactions on Machine Learning Research , issn=
Calibrated Probabilistic Forecasts for Arbitrary Sequences , author=. Transactions on Machine Learning Research , issn=. 2025 , url=
work page 2025
-
[36]
Platt, John C. , biburl =. Probabilistic Outputs for Support Vector Machines and Comparisons to Regularized Likelihood Methods , username =. Advances in Large Margin Classifiers , citeseerurl =
-
[37]
International Conference on Machine Learning , pages=
Online Platt scaling with calibeating , author=. International Conference on Machine Learning , pages=. 2023 , organization=
work page 2023
-
[38]
IEEE Transactions on Information Theory , volume=
Bregman divergence bounds and universality properties of the logarithmic loss , author=. IEEE Transactions on Information Theory , volume=. 2019 , publisher=
work page 2019
-
[39]
IEEE Transactions on Information Theory , volume=
Universal prediction , author=. IEEE Transactions on Information Theory , volume=. 1998 , publisher=
work page 1998
-
[40]
Conference on Learning Theory , number=
Competing in the dark: An efficient algorithm for bandit linear optimization , author=. Conference on Learning Theory , number=
-
[41]
USSR computational mathematics and mathematical physics , volume=
The relaxation method of finding the common point of convex sets and its application to the solution of problems in convex programming , author=. USSR computational mathematics and mathematical physics , volume=. 1967 , publisher=
work page 1967
-
[42]
Contributions to the Theory of Games , volume=
Approximation to Bayes risk in repeated play , author=. Contributions to the Theory of Games , volume=
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.