Understanding Deep Learning (Still) Requires Rethinking Generalization , volume =

ISSN 0001-0782, 1557-7317 · 2013 · DOI 10.1145/3446776

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

open at publisher browse 4 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Estimating Implicit Regularization in Deep Learning

stat.ML · 2026-05-06 · unverdicted · novelty 7.0

Gradient matching empirically recovers implicit regularization effects such as l2 penalties from early stopping and dropout in neural networks.

Overcoming Selection Bias in Statistical Studies With Amortized Bayesian Inference

stat.ML · 2026-04-20 · unverdicted · novelty 6.0

Embedding selection mechanisms into generative simulators enables amortized Bayesian inference to produce debiased, well-calibrated posteriors without tractable likelihoods.

Representation Gap: Explaining the Unreasonable Effectiveness of Neural Networks from a Geometric Perspective

cs.LG · 2026-05-20 · unverdicted · novelty 5.0

Derives an asymptotic equivalent for the Representation Gap in equivariant diffusion models, showing it depends primarily on the intrinsic dimension of the task.

Soft Learning

cs.LG · 2026-05-16 · unverdicted · novelty 5.0

Soft Learning optimally combines heterogeneous ML specialists via cross-validated non-negative least squares, achieving top performance on 70% of 37 datasets with formal guarantees and 72-435x CPU speedups over deep networks.

citing papers explorer

Showing 4 of 4 citing papers.

Estimating Implicit Regularization in Deep Learning stat.ML · 2026-05-06 · unverdicted · none · ref 47
Gradient matching empirically recovers implicit regularization effects such as l2 penalties from early stopping and dropout in neural networks.
Overcoming Selection Bias in Statistical Studies With Amortized Bayesian Inference stat.ML · 2026-04-20 · unverdicted · none · ref 90
Embedding selection mechanisms into generative simulators enables amortized Bayesian inference to produce debiased, well-calibrated posteriors without tractable likelihoods.
Representation Gap: Explaining the Unreasonable Effectiveness of Neural Networks from a Geometric Perspective cs.LG · 2026-05-20 · unverdicted · none · ref 18
Derives an asymptotic equivalent for the Representation Gap in equivariant diffusion models, showing it depends primarily on the intrinsic dimension of the task.
Soft Learning cs.LG · 2026-05-16 · unverdicted · none · ref 4
Soft Learning optimally combines heterogeneous ML specialists via cross-validated non-negative least squares, achieving top performance on 70% of 37 datasets with formal guarantees and 72-435x CPU speedups over deep networks.

Understanding Deep Learning (Still) Requires Rethinking Generalization , volume =

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer