pith. sign in

hub

Transformers are rnns: Fast autoregressive transformers with linear attention

11 Pith papers cite this work, alongside 2 external citations. Polarity classification is still indexing.

11 Pith papers citing it
2 external citations · external index

hub tools

citation-role summary

background 2

citation-polarity summary

years

2026 10 2025 1

verdicts

UNVERDICTED 11

roles

background 2

polarities

background 2

representative citing papers

Rotation Equivariant Mamba for Vision Tasks

cs.CV · 2026-03-10 · unverdicted · novelty 8.0

EQ-VMamba adds rotation-equivariant cross-scan and group Mamba blocks to enforce end-to-end rotation equivariance, yielding better rotation robustness, competitive accuracy, and roughly 50% fewer parameters than non-equivariant baselines across classification, segmentation, and super-resolution.

Training Transformers for KV Cache Compressibility

cs.LG · 2026-05-07 · unverdicted · novelty 6.0 · 2 refs

Training transformers with KV sparsification during continued pretraining produces representations that admit better post-hoc KV cache compression, improving quality under memory budgets for long-context tasks.

Axiomatizing Neural Networks via Pursuit of Subspaces

cs.LG · 2026-05-19 · unverdicted · novelty 5.0

Authors introduce the Pursuit of Subspaces (PoS) hypothesis, an axiomatic geometric framework that unifies explanations for representation, computation, and generalization in shallow and deep neural networks.

citing papers explorer

Showing 11 of 11 citing papers.