URL https://acla nthology.org/2021.acl-long.292/

doi: 10 · 2021 · DOI 10.18653/v1/2021.acl-long.292

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open at publisher browse 1 citing papers

representative citing papers

Dissociating Decodability and Causal Use in Bracket-Sequence Transformers

cs.CL · 2026-04-24 · unverdicted · novelty 6.0

In Dyck-language transformers, attention patterns causally use top-of-stack information while residual-stream depth and distance signals are decodable yet causally inert.

citing papers explorer

Showing 1 of 1 citing paper.

Dissociating Decodability and Causal Use in Bracket-Sequence Transformers cs.CL · 2026-04-24 · unverdicted · none · ref 14
In Dyck-language transformers, attention patterns causally use top-of-stack information while residual-stream depth and distance signals are decodable yet causally inert.

URL https://acla nthology.org/2021.acl-long.292/

fields

years

verdicts

representative citing papers

citing papers explorer