What Regularized Auto-Encoders Learn from the Data Generating Distribution

Guillaume Alain; Yoshua Bengio

arxiv: 1211.4246 · v5 · pith:K24VAC2Znew · submitted 2012-11-18 · 💻 cs.LG · stat.ML

What Regularized Auto-Encoders Learn from the Data Generating Distribution

Guillaume Alain , Yoshua Bengio This is my paper

classification 💻 cs.LG stat.ML

keywords auto-encoderdatafunctionreconstructioncriteriondistributiongeneratingprevious

0 comments

read the original abstract

What do auto-encoders learn about the underlying data generating distribution? Recent work suggests that some auto-encoder variants do a good job of capturing the local manifold structure of data. This paper clarifies some of these previous observations by showing that minimizing a particular form of regularized reconstruction error yields a reconstruction function that locally characterizes the shape of the data generating density. We show that the auto-encoder captures the score (derivative of the log-density with respect to the input). It contradicts previous interpretations of reconstruction error as an energy function. Unlike previous results, the theorems provided here are completely generic and do not depend on the parametrization of the auto-encoder: they show what the auto-encoder would tend to if given enough capacity and examples. These results are for a contractive training criterion we show to be similar to the denoising auto-encoder training criterion with small corruption noise, but with contraction applied on the whole reconstruction function rather than just encoder. Similarly to score matching, one can consider the proposed training criterion as a convenient alternative to maximum likelihood because it does not involve a partition function. Finally, we show how an approximate Metropolis-Hastings MCMC can be setup to recover samples from the estimated distribution, and this is confirmed in sampling experiments.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Unlocking Complex Visual Generation via Closed-Loop Verified Reasoning
cs.CV 2026-05 unverdicted novelty 6.0

CLVR couples verified logical planning with pixel diffusion, uses proxy reinforcement learning on distilled histories, and merges weights to cut inference to 4 NFEs while outperforming open-source T2I models on comple...
Unlocking Complex Visual Generation via Closed-Loop Verified Reasoning
cs.CV 2026-05 unverdicted novelty 6.0

CLVR framework adds closed-loop visual verification, proxy prompt reinforcement learning, and delta-space weight merge to improve complex text-to-image generation over single-step or unverified multi-step baselines.