G lob E nc: Quantifying Global Token Attribution by Incorporating the Whole Encoder Layer in Transformers

Modarressi, Ali, Fayyaz, Mohsen, Yaghoobzadeh, Yadollah, Pilehvar, Mohammad Taher · 2022 · DOI 10.18653/v1/2022.naacl-main.19

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open at publisher browse 2 citing papers

representative citing papers

What LLMs explain is not what they believe: Evaluating explanation sufficiency under models' own input beliefs

cs.LG · 2026-06-26 · unverdicted · novelty 6.0

Proposes SCSuff metric for evaluating LLM explanation sufficiency via model-generated alternative inputs, showing explanations are typically insufficient and predictable from hidden states.

Multi-component Causal Tracing in Large Language Models

cs.LG · 2026-06-02 · unverdicted · novelty 6.0

A unified multi-component causal tracing method that uses soft interventions and a metric transformation to efficiently select critical LLM components for a target performance metric.

citing papers explorer

Showing 2 of 2 citing papers after filters.

What LLMs explain is not what they believe: Evaluating explanation sufficiency under models' own input beliefs cs.LG · 2026-06-26 · unverdicted · none · ref 28
Proposes SCSuff metric for evaluating LLM explanation sufficiency via model-generated alternative inputs, showing explanations are typically insufficient and predictable from hidden states.
Multi-component Causal Tracing in Large Language Models cs.LG · 2026-06-02 · unverdicted · none · ref 38
A unified multi-component causal tracing method that uses soft interventions and a metric transformation to efficiently select critical LLM components for a target performance metric.

G lob E nc: Quantifying Global Token Attribution by Incorporating the Whole Encoder Layer in Transformers

fields

years

verdicts

representative citing papers

citing papers explorer