Noise Scheduling as Information-Guided Allocation in Diffusion Training

Bac Nguyen; Chieh-Hsin Lai; Dejan Stancevic; Gabriel Raya; Georgios Batzolis; Luca Ambrogioni; Naoki Murata; Yuhta Takida; Yuki Mitsufuji

Noise Scheduling as Information-Guided Allocation in Diffusion Training

Not yet reviewed by Pith; the record is open.

Re-run · record.json Download PDF Read on arXiv ↗

This paper has not been read by Pith yet. Machine review is queued; the pith claim, tier, and objections will appear here once it completes.

SPECIMEN: schema-true, not a live event

T0 review · schema-true

One-sentence machine reading of the paper's core claim.

pith:XXXXXXXX · record.json · timestamp

arxiv 2602.18647 v2 pith:I2QLG7LC submitted 2026-02-20 cs.LG cs.AIcs.CVcs.ITmath.IT

Noise Scheduling as Information-Guided Allocation in Diffusion Training

Gabriel Raya , Bac Nguyen , Georgios Batzolis , Yuhta Takida , Dejan Stancevic , Naoki Murata , Chieh-Hsin Lai , Yuki Mitsufuji

show 1 more author

Luca Ambrogioni

This is my paper

classification cs.LG cs.AIcs.CVcs.ITmath.IT

keywords noisetraininginfonoisescheduleallocationdenoisingfixedprofile

verification ladder T0 review T1 audit T2 compute T3 formal T4 reserved

0 comments

read the original abstract

We introduce InfoNoise, an online adaptive noise schedule for diffusion training that reallocates optimization effort toward noise levels where denoising is most informative. Together with loss weighting, a noise schedule induces an effective allocation across denoising problems, often fixed before informative noise levels are known. InfoNoise makes this allocation data-adaptive by estimating a conditional-entropy-rate profile from denoising losses during training, without auxiliary models or offline search. Through I--MMSE, this profile identifies where noisy observations rapidly reduce uncertainty about the clean sample and guides adaptation of the training noise distribution. It changes only this distribution, keeping the objective, weighting, and parameterization fixed. On image benchmarks, where schedules have been extensively tuned, InfoNoise matches or slightly exceeds strong baselines and can reach the same quality with fewer updates. On representation, sequence, and modality shifts, including DNA and language generation, InfoNoise improves over fixed and adaptive baselines and reaches target quality with up to $3\times$ less training compute. These results establish the conditional-entropy-rate profile as the data-dependent target for noise schedule design and make online adaptation a practical alternative to manual schedule search.

discussion (0)

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

What Does a Discrete Diffusion Model Learn?
cs.LG 2026-07 accept novelty 7.0

The discrete diffusion NELBO equals data entropy plus an exact path KL to the oracle reverse process, and the denoiser, cavity, and score parameterizations are three interconvertible coordinates of the unique optimal ...
Towards Closing the Autoregressive Gap in Language Modeling via Entropy-Gated Continuous Bitstream Diffusion
cs.CL 2026-05 unverdicted novelty 7.0

A 130M-parameter continuous bitstream diffusion model with entropy-gated Langevin sampling achieves GenPPL 59.76 on LM1B and 27.06 on OWT, closing the gap to autoregressive models at matched entropy with 256 NFEs.
NoiseRater: Meta-Learned Noise Valuation for Diffusion Model Training
cs.LG 2026-05 unverdicted novelty 6.0

NoiseRater meta-learns instance-level importance scores for noise in diffusion training via bilevel optimization, then uses a two-stage pipeline to improve efficiency and generation quality on FFHQ and ImageNet.