Transcoders Find Interpretable LLM Feature Circuits

Available online:https : / / aclanthology · 2023 · arXiv 2310.20320

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

cs.CL · 2026-07-01 · unverdicted · novelty 2.0

The paper reviews Transformer architecture, emergent LLM capabilities resembling cognition, explainable AI methods, and argues against both anthropomorphism and overly reductive views of LLM behavior as mere memorization.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Understanding Large Language Models cs.CL · 2026-07-01 · unverdicted · none · ref 7
The paper reviews Transformer architecture, emergent LLM capabilities resembling cognition, explainable AI methods, and argues against both anthropomorphism and overly reductive views of LLM behavior as mere memorization.

Transcoders Find Interpretable LLM Feature Circuits

fields

years

verdicts

representative citing papers

citing papers explorer