pith. sign in

Implicit Regularization in Deep Learning

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it
abstract

In an attempt to better understand generalization in deep learning, we study several possible explanations. We show that implicit regularization induced by the optimization method is playing a key role in generalization and success of deep learning models. Motivated by this view, we study how different complexity measures can ensure generalization and explain how optimization algorithms can implicitly regularize complexity measures. We empirically investigate the ability of these measures to explain different observed phenomena in deep learning. We further study the invariances in neural networks, suggest complexity measures and optimization algorithms that have similar invariances to those in neural networks and evaluate them on a number of learning tasks.

years

2026 2 2025 1

verdicts

UNVERDICTED 3

clear filters

representative citing papers

Stationary Robust Mean-Field Games under Model Mismatches

cs.LG · 2026-06-21 · unverdicted · novelty 6.0

Develops infinite-horizon stationary robust mean-field games incorporating distributional uncertainty, proves equilibrium existence via fixed-point on contractive Bellman operator, gives convergent algorithm, and derives finite-population approximation bounds under contractive regime.

citing papers explorer

Showing 3 of 3 citing papers after filters.

  • Stationary Robust Mean-Field Games under Model Mismatches cs.LG · 2026-06-21 · unverdicted · none · ref 55 · internal anchor

    Develops infinite-horizon stationary robust mean-field games incorporating distributional uncertainty, proves equilibrium existence via fixed-point on contractive Bellman operator, gives convergent algorithm, and derives finite-population approximation bounds under contractive regime.

  • Reducing Class Bias In Data-Balanced Datasets Through Hardness-Based Resampling cs.LG · 2025-04-09 · unverdicted · none · ref 24 · internal anchor

    Hardness-Based Resampling reduces class recall gaps in balanced datasets by up to 32% on CIFAR-10 and 16% on CIFAR-100 by prioritizing harder samples over random or frequency-based selection.

  • Neural Architectures as Functional Priors in Physics-Informed Control Problems math.NA · 2026-06-11 · unverdicted · none · ref 6 · internal anchor

    Different neural architectures produce qualitatively distinct controls in PINN optimal control for RLC and Duffing systems, with Fourier versions yielding richer oscillations and smoother nets yielding more regular efficient trajectories.