Planning in a recurrent neural network that plays Sokoban

Mohammad Taufeeque, Philip Quirke, Maximilian Li, Chris Cundy, Aaron David Tucker, Adam Gleave, Adri `a Garriga-Alonso · 2024 · arXiv 2407.15421

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

representative citing papers

The Shape of Addition: Geometric Structures of Arithmetic in Large Language Models

cs.LG · 2026-05-29 · unverdicted · novelty 7.0

LLM residual streams during addition form an Iso-Raw-Sum Trajectory anchored by digit semantics and modulated by continuous carry signals, with errors arising as geometric slippages across quantization thresholds in a noisy model.

Structure and Scale in Simplicial Sequence Modelling

cs.LG · 2026-05-31 · unverdicted · novelty 5.0

Small transformers on HMM prediction tasks exhibit correlated scaling between performance and linear encoding of belief distributions in residual activations.

citing papers explorer

Showing 2 of 2 citing papers after filters.

The Shape of Addition: Geometric Structures of Arithmetic in Large Language Models cs.LG · 2026-05-29 · unverdicted · none · ref 36
LLM residual streams during addition form an Iso-Raw-Sum Trajectory anchored by digit semantics and modulated by continuous carry signals, with errors arising as geometric slippages across quantization thresholds in a noisy model.
Structure and Scale in Simplicial Sequence Modelling cs.LG · 2026-05-31 · unverdicted · none · ref 62
Small transformers on HMM prediction tasks exhibit correlated scaling between performance and linear encoding of belief distributions in residual activations.

Planning in a recurrent neural network that plays Sokoban

fields

years

verdicts

representative citing papers

citing papers explorer