×-shaped variable-width transformers outperform parameter-matched uniform baselines on language modeling loss with 22% fewer FLOPs and 15% smaller KV cache.
Optimal Degrees of Synaptic Connectivity
4 Pith papers cite this work. Polarity classification is still indexing.
4
Pith papers citing it
citation-role summary
background 1
citation-polarity summary
years
2026 4roles
background 1polarities
background 1representative citing papers
Four axioms (Causality, Minimality, Separability, Stability) are formalized for latent thought representations; audits of open LLMs on 23 tasks show none satisfy all four and representations add little beyond input embeddings.
KLR Hopfield networks reach P/N storage of ~16 for random patterns and ~20 for structured data, with limits set by dynamical instability against noise rather than geometric separability per Cover's theorem.
citing papers explorer
No citing papers match the current filters.