Learning multiple layers of features from tiny images

Krizhevsky, A · 2009

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

browse 5 citing papers

citation-role summary

dataset 1

citation-polarity summary

use dataset 1

representative citing papers

Is Monotonic Sampling Necessary in Diffusion Models?

cs.LG · 2026-05-12 · unverdicted · novelty 7.0

Non-monotonic sampling schedules never improve upon monotonic baselines in diffusion models, with performance gaps ranging from substantial to negligible depending on the denoiser.

A Testable Certificate for Constant Collapse in Teacher-Guided VAEs

cs.LG · 2026-05-07 · unverdicted · novelty 7.0

For any fixed nonconstant teacher T, the best constant student has alignment cost exactly equal to the teacher mutual information I_T(X;T); a latent-only witness below this threshold with margin cannot be constant.

MoCo-EA: Exploiting Adversarial Mode Connectivity for Efficient Evolutionary Attacks

cs.CR · 2026-05-18 · unverdicted · novelty 6.0

MoCo-EA uses optimized Bézier curves for crossover in evolutionary adversarial attacks by exploiting mode connectivity of successful perturbations.

Text-Conditional JEPA for Learning Semantically Rich Visual Representations

cs.LG · 2026-05-05 · unverdicted · novelty 6.0

TC-JEPA conditions masked feature prediction on text captions via sparse cross-attention to produce more semantically rich visual representations and outperforms contrastive methods on fine-grained tasks.

Attribution-Based Neuron Utility for Plasticity Restoration in Deep Networks

cs.LG · 2026-05-07 · unverdicted · novelty 5.0

GXD estimates the first-order functional cost of replacing a neuron via gradient attribution to make adaptive resets more reliable for preserving plasticity in continual learning.

citing papers explorer

Showing 5 of 5 citing papers.

Is Monotonic Sampling Necessary in Diffusion Models? cs.LG · 2026-05-12 · unverdicted · none · ref 69
Non-monotonic sampling schedules never improve upon monotonic baselines in diffusion models, with performance gaps ranging from substantial to negligible depending on the denoiser.
A Testable Certificate for Constant Collapse in Teacher-Guided VAEs cs.LG · 2026-05-07 · unverdicted · none · ref 8
For any fixed nonconstant teacher T, the best constant student has alignment cost exactly equal to the teacher mutual information I_T(X;T); a latent-only witness below this threshold with margin cannot be constant.
MoCo-EA: Exploiting Adversarial Mode Connectivity for Efficient Evolutionary Attacks cs.CR · 2026-05-18 · unverdicted · none · ref 21
MoCo-EA uses optimized Bézier curves for crossover in evolutionary adversarial attacks by exploiting mode connectivity of successful perturbations.
Text-Conditional JEPA for Learning Semantically Rich Visual Representations cs.LG · 2026-05-05 · unverdicted · none · ref 31
TC-JEPA conditions masked feature prediction on text captions via sparse cross-attention to produce more semantically rich visual representations and outperforms contrastive methods on fine-grained tasks.
Attribution-Based Neuron Utility for Plasticity Restoration in Deep Networks cs.LG · 2026-05-07 · unverdicted · none · ref 9
GXD estimates the first-order functional cost of replacing a neuron via gradient attribution to make adaptive resets more reliable for preserving plasticity in continual learning.

Learning multiple layers of features from tiny images

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer