Lightning self-attention coefficients are coordinates on an algebraic variety obeying Chow-type, low-rank, Veronese-type, and Sylvester-resultant invariants.
Are transformers universal approximators of sequence-to-sequence functions? In International Conference on Learning Representations, 2020
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
math.AG 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Algebraic Invariants of Lightning Self-Attention
Lightning self-attention coefficients are coordinates on an algebraic variety obeying Chow-type, low-rank, Veronese-type, and Sylvester-resultant invariants.