InInternational 9 Conference on Machine Learning, pages 2397–2430

Braun, Dan, Taylor, Jordan, Goldowsky-Dill, Nicholas, Sharkey, Lee , year= · 2024 · arXiv 2405.12241

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Subspace-Aware Sparse Autoencoders for Effective Mechanistic Interpretability

cs.LG · 2026-06-04 · conditional · novelty 7.0

SASA replaces single-vector decoders in SAEs with learned subspaces plus block sparsity and nuclear-norm regularization, proving that a single group becomes the global minimizer once block size meets intrinsic dimension and yielding polynomial rather than exponential sample complexity.

Crosscoding Through Time: Tracking Emergence & Consolidation Of Linguistic Representations Throughout LLM Pretraining

cs.CL · 2025-09-05 · unverdicted · novelty 6.0

Sparse crosscoders on LLM checkpoint triplets track emergence, maintenance, and discontinuation of linguistic features during pretraining via a new RelIE metric.

citing papers explorer

Showing 2 of 2 citing papers.

Subspace-Aware Sparse Autoencoders for Effective Mechanistic Interpretability cs.LG · 2026-06-04 · conditional · none · ref 44
SASA replaces single-vector decoders in SAEs with learned subspaces plus block sparsity and nuclear-norm regularization, proving that a single group becomes the global minimizer once block size meets intrinsic dimension and yielding polynomial rather than exponential sample complexity.
Crosscoding Through Time: Tracking Emergence & Consolidation Of Linguistic Representations Throughout LLM Pretraining cs.CL · 2025-09-05 · unverdicted · none · ref 1
Sparse crosscoders on LLM checkpoint triplets track emergence, maintenance, and discontinuation of linguistic features during pretraining via a new RelIE metric.

InInternational 9 Conference on Machine Learning, pages 2397–2430

fields

years

verdicts

representative citing papers

citing papers explorer