A unified approach to interpreting model predictions.Advances in neural informa- tion processing systems, 30, 2017

Scott M Lundberg, Su-In Lee · 2017

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

From Actions to Understanding: Conformal Interpretability of Temporal Concepts in LLM Agents

cs.AI · 2026-03-27 · unverdicted · novelty 5.0

A conformal interpretability method labels LLM agent states step-by-step and extracts linearly separable temporal concept directions aligned with task success on ScienceWorld and AlfWorld.

citing papers explorer

Showing 1 of 1 citing paper.

From Actions to Understanding: Conformal Interpretability of Temporal Concepts in LLM Agents cs.AI · 2026-03-27 · unverdicted · none · ref 24
A conformal interpretability method labels LLM agent states step-by-step and extracts linearly separable temporal concept directions aligned with task success on ScienceWorld and AlfWorld.

A unified approach to interpreting model predictions.Advances in neural informa- tion processing systems, 30, 2017

fields

years

verdicts

representative citing papers

citing papers explorer