Journal of machine learning research , volume=

Stability, generalization , author=

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

browse 5 citing papers

citation-role summary

background 2

citation-polarity summary

background 2

representative citing papers

Structure from Strategic Interaction & Uncertainty: Risk Sensitive Games for Robust Preference Learning

cs.GT · 2026-05-11 · unverdicted · novelty 6.0 · 2 refs

Risk-sensitive preference games using convex risk measures produce policies that are robust across data strata and match or exceed standard Nash learning performance without added cost.

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

cs.LG · 2024-01-02 · unverdicted · novelty 6.0

SPIN lets weak LLMs become strong by self-generating training data from previous model versions and training to prefer human-annotated responses over its own outputs, outperforming DPO even with extra GPT-4 data on benchmarks.

A Survey on Data-Dependent Worst-Case Generalization Bounds

stat.ML · 2026-05-13 · unverdicted · novelty 4.0

The survey unifies extensions of PAC-Bayesian theory to data-dependent sets, geometric and topological complexity measures of optimization trajectories, and stability replacements for information terms into one template inequality with comparative evaluation.

A Unified Theory of Conditional Coverage in Conformal Prediction with Applications

stat.ME · 2026-05-12

Statistical Consistency and Generalization of Contrastive Representation Learning

cs.LG · 2026-05-04 · 2 refs

citing papers explorer

Showing 5 of 5 citing papers.

Structure from Strategic Interaction & Uncertainty: Risk Sensitive Games for Robust Preference Learning cs.GT · 2026-05-11 · unverdicted · none · ref 1 · 2 links
Risk-sensitive preference games using convex risk measures produce policies that are robust across data strata and match or exceed standard Nash learning performance without added cost.
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models cs.LG · 2024-01-02 · unverdicted · none · ref 118
SPIN lets weak LLMs become strong by self-generating training data from previous model versions and training to prefer human-annotated responses over its own outputs, outperforming DPO even with extra GPT-4 data on benchmarks.
A Survey on Data-Dependent Worst-Case Generalization Bounds stat.ML · 2026-05-13 · unverdicted · none · ref 3
The survey unifies extensions of PAC-Bayesian theory to data-dependent sets, geometric and topological complexity measures of optimization trajectories, and stability replacements for information terms into one template inequality with comparative evaluation.
A Unified Theory of Conditional Coverage in Conformal Prediction with Applications stat.ME · 2026-05-12 · unreviewed · ref 44
Statistical Consistency and Generalization of Contrastive Representation Learning cs.LG · 2026-05-04 · unreviewed · ref 28 · 2 links

Journal of machine learning research , volume=

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer