pith. machine review for the scientific record. sign in

arxiv: 2602.06021 · v2 · submitted 2026-02-05 · 📊 stat.ML · cs.LG· cs.NA· math.NA· math.PR

Recognition: unknown

Diffusion Model's Generalization Can Be Characterized by Inductive Biases toward a Data-Dependent Ridge Manifold

Authors on Pith no claims yet
classification 📊 stat.ML cs.LGcs.NAmath.NAmath.PR
keywords ridgeerrortrainingcomponentcontrolleddatadata-dependentdiffusion
0
0 comments X
read the original abstract

We study a data-dependent notion of diffusion-model generalization: when a model does not memorize the training set, where do its generated samples go relative to the geometry induced by the data? To answer this, we introduce a time-dependent family of log-density ridge manifolds constructed from the smoothed empirical distribution, and use it to characterize reverse-time inference. Our main result shows that generated samples evolve by a reach-align-slide mechanism: they first enter a neighborhood of the ridge, then their distance to the ridge is controlled by the normal component of training error, and finally their motion along the ridge is controlled by the tangential component. We further connect this geometric picture to training dynamics through directional decompositions of the learned error, and make this link explicit for random feature models, where architectural bias and optimization error can be separated quantitatively. Experiments on synthetic multimodal data and MNIST latent diffusion support the predicted geometric behavior in both low and high dimensions.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. The finite expression method for turbulent dynamics with high-order moment recovery

    cs.LG 2026-05 unverdicted novelty 7.0

    A two-stage symbolic regression plus generative model framework recovers governing interaction terms and forcing in stochastic triad models while accurately predicting statistical moments up to order five.

  2. Christoffel-DPS: Optimal sensor placement in diffusion posterior sampling for arbitrary distributions

    cs.LG 2026-05 unverdicted novelty 7.0

    Christoffel-DPS is a distribution-free optimal sensor placement framework for diffusion posterior sampling that provides non-asymptotic recovery bounds and outperforms Gaussian baselines on non-Gaussian benchmarks.