The choice of closeness measure in diffusion reward alignment determines the computational primitives and tractable reward classes, with linear exponential tilts sufficing for KL with convex rewards and proximal oracles for Wasserstein with concave or low-dimensional Lipschitz rewards.
Title resolution pending
2 Pith papers cite this work. Polarity classification is still indexing.
years
2026 2verdicts
UNVERDICTED 2representative citing papers
A generalized Tweedie identity and moment-generating-function representation enable nonparametric recovery of full posteriors for heteroscedastic normal means with unknown variances without specifying a prior.
citing papers explorer
-
The tractability landscape of diffusion alignment: regularization, rewards, and computational primitives
The choice of closeness measure in diffusion reward alignment determines the computational primitives and tractable reward classes, with linear exponential tilts sufficing for KL with convex rewards and proximal oracles for Wasserstein with concave or low-dimensional Lipschitz rewards.
-
Nonparametric f-Modeling for Empirical Bayes Inference with Unequal and Unknown Variances
A generalized Tweedie identity and moment-generating-function representation enable nonparametric recovery of full posteriors for heteroscedastic normal means with unknown variances without specifying a prior.