Fixed golden layers for knowledge editing in LLMs can be identified via gradient attribution and generalize across queries and datasets.
First is Not Really Better Than Last: Evaluating Layer Choice and Aggregation Strategies in Language Model Data Influence Estimation
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.LG 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Golden Layers and Where to Find Them: Improved Knowledge Editing for Large Language Models Via Layer Gradient Analysis
Fixed golden layers for knowledge editing in LLMs can be identified via gradient attribution and generalize across queries and datasets.