Learning multiple layers of features from tiny images

Alex Krizhevsky, Geoffrey Hinton · 2009

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

browse 6 citing papers

citation-role summary

dataset 2

citation-polarity summary

use dataset 2

representative citing papers

Deep Learning as Neural Low-Degree Filtering: A Spectral Theory of Hierarchical Feature Learning

cs.LG · 2026-05-13 · unverdicted · novelty 8.0

Neural LoFi models deep learning as layer-wise spectral filtering that selects maximal low-degree correlations, yielding a tractable surrogate for hierarchical representation learning beyond the lazy regime.

Provable Robustness against Backdoor Attacks via the Primal-Dual Perspective on Differential Privacy

cs.LG · 2026-05-20 · unverdicted · novelty 7.0

A new framework is introduced for end-to-end provable robustness against backdoor attacks by composing randomized smoothing with differentially private training via privacy profiles.

When Accuracy Is Not Enough: Uncertainty Collapse between Noisy Label Learning and Out-of-Distribution Detection

cs.LG · 2026-05-18 · unverdicted · novelty 6.0

High accuracy in noisy-label learning does not guarantee OOD detection reliability due to uncertainty collapse, and Virtual Margin Regularization offers partial mitigation.

Bayesian Model Merging

cs.LG · 2026-05-13 · unverdicted · novelty 6.0

Bayesian Model Merging introduces a bi-level optimization framework that merges task-specific models via closed-form Bayesian regression with an anchor prior and global hyperparameter search, outperforming baselines and nearly matching expert averages on up to 20-task vision and 5-task language Merg

Why Zeroth-Order Adaptation May Forget Less: A Randomized Shaping Theory

cs.LG · 2026-05-11 · unverdicted · novelty 5.0

Norm-matched zeroth-order adaptation preserves the isotropic retention floor while contracting only the anisotropic component, producing a quadratic forgetting gap that favors ZO precisely when the first-order direction has above-average retention curvature.

Further advantages of data augmentation on convolutional neural networks

cs.CV · 2019-06-26 · unverdicted · novelty 4.0

Data augmentation enables CNNs to adapt to varying architectures and data amounts without hyperparameter fine-tuning, unlike weight decay and dropout.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Deep Learning as Neural Low-Degree Filtering: A Spectral Theory of Hierarchical Feature Learning cs.LG · 2026-05-13 · unverdicted · none · ref 27
Neural LoFi models deep learning as layer-wise spectral filtering that selects maximal low-degree correlations, yielding a tractable surrogate for hierarchical representation learning beyond the lazy regime.
Why Zeroth-Order Adaptation May Forget Less: A Randomized Shaping Theory cs.LG · 2026-05-11 · unverdicted · none · ref 15
Norm-matched zeroth-order adaptation preserves the isotropic retention floor while contracting only the anisotropic component, producing a quadratic forgetting gap that favors ZO precisely when the first-order direction has above-average retention curvature.

Learning multiple layers of features from tiny images

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer