and Ravikumar, Pradeep and Wainwright, Martin J

Negahban, Sahand N · 2012 · DOI 10.1214/12-sts400

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open at publisher browse 3 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Preference-Based Self-Distillation: Beyond KL Matching via Reward Regularization

cs.LG · 2026-05-06 · unverdicted · novelty 7.0

PBSD derives a reward-reweighted teacher distribution as the analytic optimum of a reward-regularized objective, yielding better stability and performance than KL-based self-distillation on math reasoning and tool-use tasks.

Adaptive Estimation and Optimal Control in Offline Contextual MDPs without Stationarity

stat.ML · 2026-05-05 · unverdicted · novelty 7.0

A T-estimation-based procedure for adaptive density estimation and optimal control in offline contextual MDPs without stationarity, providing oracle risk bounds under two loss functions and finite-sample cost guarantees.

Group-Aware Matrix Estimation and Latent Subspace Recovery

stat.ML · 2026-05-19 · unverdicted · novelty 6.0

GAME is a convex estimator using overlapping nuclear-norm penalties on subgroup submatrices for low-rank matrix completion with known overlapping groups, providing finite-sample guarantees on reconstruction error and subgroup subspace recovery.

citing papers explorer

Showing 3 of 3 citing papers.

Preference-Based Self-Distillation: Beyond KL Matching via Reward Regularization cs.LG · 2026-05-06 · unverdicted · none · ref 12
PBSD derives a reward-reweighted teacher distribution as the analytic optimum of a reward-regularized objective, yielding better stability and performance than KL-based self-distillation on math reasoning and tool-use tasks.
Adaptive Estimation and Optimal Control in Offline Contextual MDPs without Stationarity stat.ML · 2026-05-05 · unverdicted · none · ref 193
A T-estimation-based procedure for adaptive density estimation and optimal control in offline contextual MDPs without stationarity, providing oracle risk bounds under two loss functions and finite-sample cost guarantees.
Group-Aware Matrix Estimation and Latent Subspace Recovery stat.ML · 2026-05-19 · unverdicted · none · ref 73
GAME is a convex estimator using overlapping nuclear-norm penalties on subgroup submatrices for low-rank matrix completion with known overlapping groups, providing finite-sample guarantees on reconstruction error and subgroup subspace recovery.

and Ravikumar, Pradeep and Wainwright, Martin J

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer