Saxe, James L

doi: 10 · 2019 · DOI 10.1073/pnas.1820226116

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

open at publisher browse 5 citing papers

citation-role summary

background 2

citation-polarity summary

background 2

representative citing papers

The Identity Trap in EEG Foundation Models: A Diagnostic Audit

cs.LG · 2026-06-04 · unverdicted · novelty 7.0

Subject identity variance dominates frozen representations in three EEG foundation models by 13-89x over null, and erasing the linear subject axis improves label decoding where within-subject label variation exists.

Unifying Dynamical Systems and Graph Theory to Mechanistically Understand Computation in Neural Networks

cs.NE · 2026-05-05 · unverdicted · novelty 7.0 · 2 refs

RNN computation is recovered from multi-hop graph pathways, and constraining these pathways via resolvent regularization yields improved temporal sparsity and task performance over standard L1.

Weight Decay Regimes in Grokking Transformers: Cheap Online Diagnostics

cs.LG · 2026-05-19 · conditional · novelty 6.0

Weight decay controls distinct learning regimes in grokking transformers on modular arithmetic, tracked by new cheap attention-based diagnostics with empirical critical value and exponent fits.

Deep sequence models tend to memorize geometrically; it is unclear why

cs.LG · 2025-10-30 · unverdicted · novelty 6.0

Deep sequence models develop geometric memory in embeddings that encodes novel global relationships, transforming l-fold composition tasks into 1-step navigation via a natural spectral bias connected to Node2Vec.

Attention to task structure for cognitive flexibility

cs.NE · 2026-04-14 · unverdicted · novelty 5.0

Task connectivity in graph-structured multi-task environments enhances generalization and stability, with stronger benefits for attention models than MLPs.

citing papers explorer

Showing 3 of 3 citing papers after filters.

The Identity Trap in EEG Foundation Models: A Diagnostic Audit cs.LG · 2026-06-04 · unverdicted · none · ref 20
Subject identity variance dominates frozen representations in three EEG foundation models by 13-89x over null, and erasing the linear subject axis improves label decoding where within-subject label variation exists.
Weight Decay Regimes in Grokking Transformers: Cheap Online Diagnostics cs.LG · 2026-05-19 · conditional · none · ref 28
Weight decay controls distinct learning regimes in grokking transformers on modular arithmetic, tracked by new cheap attention-based diagnostics with empirical critical value and exponent fits.
Deep sequence models tend to memorize geometrically; it is unclear why cs.LG · 2025-10-30 · unverdicted · none · ref 159
Deep sequence models develop geometric memory in embeddings that encodes novel global relationships, transforming l-fold composition tasks into 1-step navigation via a natural spectral bias connected to Node2Vec.

Saxe, James L

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer