Journal of Functional Analysis , volume =

The Sizes of Compact Subsets of · 1967 · DOI 10.1016/0022-1236(67)90017-1

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open at publisher browse 2 citing papers

citation-role summary

method 1

citation-polarity summary

use method 1

representative citing papers

Spectrum-Adaptive Generalization Bounds for Trained Deep Transformers

stat.ML · 2026-05-08 · unverdicted · novelty 6.0

Spectrum-adaptive post-hoc generalization bounds for multi-layer Transformers are derived using layerwise Schatten quantities whose indices are chosen after training based on singular-value profiles.

Understanding In-Context Learning for Nonlinear Regression with Transformers: Attention as Featurizer

cs.LG · 2026-05-06 · unverdicted · novelty 6.0

Transformers can be built to act as nonlinear featurizers via attention, supporting in-context regression with proven generalization bounds on synthetic tasks.

citing papers explorer

Showing 2 of 2 citing papers.

Spectrum-Adaptive Generalization Bounds for Trained Deep Transformers stat.ML · 2026-05-08 · unverdicted · none · ref 9
Spectrum-adaptive post-hoc generalization bounds for multi-layer Transformers are derived using layerwise Schatten quantities whose indices are chosen after training based on singular-value profiles.
Understanding In-Context Learning for Nonlinear Regression with Transformers: Attention as Featurizer cs.LG · 2026-05-06 · unverdicted · none · ref 10
Transformers can be built to act as nonlinear featurizers via attention, supporting in-context regression with proven generalization bounds on synthetic tasks.

Journal of Functional Analysis , volume =

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer