pith. sign in

Approximation by superpositions of a sigmoidal function.Mathematics of control, signals and systems, 2(4):303–314

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

citation-role summary

background 1 other 1

citation-polarity summary

years

2026 7

verdicts

UNVERDICTED 7

polarities

background 1 unclear 1

clear filters

representative citing papers

Any-Dimensional Invariant Universality

cs.LG · 2026-05-22 · unverdicted · novelty 8.0

A systematic approach maps any-dimensional invariant functions to a unique function on an infinite-dimensional limit space admitting a topology with compact sets where universality holds, with examples of non-universal architectures and fixes.

Training Transformers for KV Cache Compressibility

cs.LG · 2026-05-07 · unverdicted · novelty 6.0 · 2 refs

Training transformers with KV sparsification during continued pretraining produces representations that admit better post-hoc KV cache compression, improving quality under memory budgets for long-context tasks.

citing papers explorer

Showing 4 of 4 citing papers after filters.

  • Any-Dimensional Invariant Universality cs.LG · 2026-05-22 · unverdicted · none · ref 30

    A systematic approach maps any-dimensional invariant functions to a unique function on an infinite-dimensional limit space admitting a topology with compact sets where universality holds, with examples of non-universal architectures and fixes.

  • Approximation of Maximally Monotone Operators : A Graph Convergence Perspective cs.LG · 2026-05-12 · unverdicted · none · ref 21

    Any maximally monotone operator can be approximated in local graph convergence by continuous encoder-decoder networks, with structure-preserving versions that retain maximal monotonicity via resolvent parameterizations.

  • Training Transformers for KV Cache Compressibility cs.LG · 2026-05-07 · unverdicted · none · ref 14 · 2 links

    Training transformers with KV sparsification during continued pretraining produces representations that admit better post-hoc KV cache compression, improving quality under memory budgets for long-context tasks.

  • Structural Correspondence and Universal Approximation in Diagonal plus Low-Rank Neural Networks cs.LG · 2026-05-07 · unverdicted · none · ref 14

    Diagonal plus Low-Rank (DLoR) neural networks achieve universal approximation for general activations by additive or multiplicative decompositions of full-rank transformations.