and Maei, Hamid Reza and Precup, Doina and Bhatnagar, Shalabh and Silver, David and Szepesv

Sutton, Richard S

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Beyond Linear Attention: Softmax Transformers Implement In-Context Reinforcement Learning

cs.LG · 2026-05-08 · unverdicted · novelty 8.0 · 2 refs

Softmax Transformers implement in-context RL through equivalence to weighted softmax TD updates, with error decay under contraction and parameters as global minimizers of pretraining loss.

citing papers explorer

Showing 1 of 1 citing paper.

Beyond Linear Attention: Softmax Transformers Implement In-Context Reinforcement Learning cs.LG · 2026-05-08 · unverdicted · none · ref 59 · 2 links
Softmax Transformers implement in-context RL through equivalence to weighted softmax TD updates, with error decay under contraction and parameters as global minimizers of pretraining loss.

and Maei, Hamid Reza and Precup, Doina and Bhatnagar, Shalabh and Silver, David and Szepesv

fields

years

verdicts

representative citing papers

citing papers explorer