pith. sign in

Geometry of optimization and implicit regularization in deep learning

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it
abstract

We argue that the optimization plays a crucial role in generalization of deep learning models through implicit regularization. We do this by demonstrating that generalization ability is not controlled by network size but rather by some other implicit control. We then demonstrate how changing the empirical optimization procedure can improve generalization, even if actual optimization quality is not affected. We do so by studying the geometry of the parameter space of deep networks, and devising an optimization algorithm attuned to this geometry.

fields

cs.LG 3

years

2026 3

clear filters

representative citing papers

Convergence of Continual Learning in Homogeneous Deep Networks

cs.LG · 2026-06-29 · unverdicted · novelty 6.0

Continual classification in homogeneous models is sequential projections onto margin sets, with local linear convergence under regularity properties for random and cyclic tasks, extended to regression.

citing papers explorer

Showing 2 of 2 citing papers after filters.