The choice of closeness measure in diffusion reward alignment determines the computational primitives and tractable reward classes, with linear exponential tilts sufficing for KL with convex rewards and proximal oracles for Wasserstein with concave or low-dimensional Lipschitz rewards.
Title resolution pending
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2verdicts
UNVERDICTED 2representative citing papers
A generalized Tweedie identity and moment-generating-function representation enable nonparametric recovery of full posteriors for heteroscedastic normal means with unknown variances without specifying a prior.
citing papers explorer
-
Nonparametric f-Modeling for Empirical Bayes Inference with Unequal and Unknown Variances
A generalized Tweedie identity and moment-generating-function representation enable nonparametric recovery of full posteriors for heteroscedastic normal means with unknown variances without specifying a prior.