On Milman’s inequality and ran- dom subspaces which escape through a mesh inRn

Yehoram Gordon · 1988 · DOI 10.1007/bfb0081737

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open at publisher browse 1 citing papers

representative citing papers

On the Residual Scaling of Looped Transformers: Stability and Transferability

cs.LG · 2026-06-16 · unverdicted · novelty 6.0

Looped Transformers require residual scaling ε = 1/N due to correlated updates from weight sharing, unlike standard 1/sqrt(L), enabling learning rate transfer independent of loop count N.

citing papers explorer

Showing 1 of 1 citing paper.

On the Residual Scaling of Looped Transformers: Stability and Transferability cs.LG · 2026-06-16 · unverdicted · none · ref 12
Looped Transformers require residual scaling ε = 1/N due to correlated updates from weight sharing, unlike standard 1/sqrt(L), enabling learning rate transfer independent of loop count N.

On Milman’s inequality and ran- dom subspaces which escape through a mesh inRn

fields

years

verdicts

representative citing papers

citing papers explorer