A randomised subspace gauss-newton method for nonlinear least-squares.arXiv preprint arXiv:2211.05727,

Coralia Cartis, Jaroslav Fowkes, Zhen Shao · arXiv 2211.05727

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

On the Convergence Behavior of Preconditioned Gradient Descent Toward the Rich Learning Regime

cs.LG · 2026-01-06 · unverdicted · novelty 5.0

Preconditioned gradient descent mitigates spectral bias and reduces grokking delays by enabling uniform parameter space exploration in the NTK regime, confirming grokking as a transition to the rich regime.

citing papers explorer

Showing 1 of 1 citing paper.

On the Convergence Behavior of Preconditioned Gradient Descent Toward the Rich Learning Regime cs.LG · 2026-01-06 · unverdicted · none · ref 5
Preconditioned gradient descent mitigates spectral bias and reduces grokking delays by enabling uniform parameter space exploration in the NTK regime, confirming grokking as a transition to the rich regime.

A randomised subspace gauss-newton method for nonlinear least-squares.arXiv preprint arXiv:2211.05727,

fields

years

verdicts

representative citing papers

citing papers explorer