pith. sign in

The definition should characterize estimation as a method

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

cs.LG 1

years

2026 1

verdicts

CONDITIONAL 1

representative citing papers

Do Sparse Autoencoders Identify Reasoning Features in Language Models?

cs.LG · 2026-01-09 · conditional · novelty 5.0

Sparse autoencoders frequently capture low-dimensional correlates that co-occur with reasoning rather than core reasoning mechanisms, as demonstrated by token-level interventions and falsification across 22 model-layer-dataset configurations.

citing papers explorer

Showing 1 of 1 citing paper.

  • Do Sparse Autoencoders Identify Reasoning Features in Language Models? cs.LG · 2026-01-09 · conditional · none · ref 59

    Sparse autoencoders frequently capture low-dimensional correlates that co-occur with reasoning rather than core reasoning mechanisms, as demonstrated by token-level interventions and falsification across 22 model-layer-dataset configurations.