arXiv preprint arXiv:2106.06087 , year=

Matthew Finlayson, Aaron Mueller, Sebastian Gehrmann, Stuart Shieber, Tal Linzen, Yonatan Belinkov · 2021 · arXiv 2106.06087

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

representative citing papers

Localizing Model Behavior with Path Patching

cs.LG · 2023-04-12 · unverdicted · novelty 8.0

Path patching provides a method to express and quantitatively test hypotheses that neural network behaviors are localized to sets of paths.

Interpretability in the Wild: a Circuit for Indirect Object Identification in GPT-2 small

cs.LG · 2022-11-01 · conditional · novelty 8.0 · 2 refs

GPT-2 small solves indirect object identification via a circuit of 26 attention heads organized into seven functional classes discovered through causal interventions.

citing papers explorer

Showing 2 of 2 citing papers.

Localizing Model Behavior with Path Patching cs.LG · 2023-04-12 · unverdicted · none · ref 38
Path patching provides a method to express and quantitatively test hypotheses that neural network behaviors are localized to sets of paths.
Interpretability in the Wild: a Circuit for Indirect Object Identification in GPT-2 small cs.LG · 2022-11-01 · conditional · none · ref 24 · 2 links
GPT-2 small solves indirect object identification via a circuit of 26 attention heads organized into seven functional classes discovered through causal interventions.

arXiv preprint arXiv:2106.06087 , year=

fields

years

verdicts

representative citing papers

citing papers explorer