pith. sign in

arxiv: 2510.10020 · v4 · pith:3CLMCF7Fnew · submitted 2025-10-11 · 📊 stat.ML · cs.LG· q-bio.BM

Calibrating Generative Models to Distributional Constraints

classification 📊 stat.ML cs.LGq-bio.BM
keywords calibrationconstraintsmodelsconstraintfine-tuninggenerativelossmiscalibration
0
0 comments X
read the original abstract

Generative models frequently suffer miscalibration, wherein statistics of the sampling distribution, such as the fraction of generations in a given class, deviate from desired values. We frame calibration as a constrained optimization problem and seek the closest model in Kullback-Leibler divergence satisfying a calibration constraint. To address the intractability of imposing these constraints exactly, we introduce two surrogate objectives for fine-tuning: (1) the relax loss, which replaces the constraint with a miscalibration penalty, and (2) the reward loss, which converts calibration into a reward fine-tuning problem. We demonstrate that these approaches substantially reduce calibration error across hundreds of simultaneous constraints and models with up to nine billion parameters, spanning applications in protein design, image generation, and language modeling.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. The Reward Was in Your Data All Along: Correcting Flow Matching with Discriminator-Guided RL

    cs.LG 2026-06 unverdicted novelty 7.0

    DRL trains a discriminator on data versus base-model samples in pretrained representation space and uses its logit as reward in KL-regularized RL, cutting guidance-free FID from 9.38 to 2.62 on SiT and similar gains o...

  2. Can We Predict The Human Preference For Text-to-Image Content Prior To Generation And Is It Even Useful To Do So?

    cs.CV 2026-06 unverdicted novelty 3.0

    Exploration of pre-generation prediction of human preference metrics (HPM) from noise seeds in diffusion models to improve output quality with negligible added cost.