Do Localization Methods Actually Localize Memorized Data in LLM s? A Tale of Two Benchmarks

Chang, Ting-Yun, Thomason, Jesse, Jia, Robin · 2024 · DOI 10.18653/v1/2024.naacl-long.176

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open at publisher browse 3 citing papers

representative citing papers

LACUNA: A Testbed for Evaluating Localization Precision for LLM Unlearning

cs.CL · 2026-07-02 · conditional · novelty 8.0

LACUNA is a new testbed that injects PII into predefined model parameters to benchmark the localization precision of LLM unlearning methods, revealing that SOTA approaches are imprecise despite strong output performance.

Output Vector Editing for Memorization Mitigation in Large Language Models

cs.CL · 2026-06-17 · unverdicted · novelty 7.0

Output vector editing on MLP neurons suppresses memorization in LLMs up to 87.9% on 6831 sequences in OLMo-7B with a 2.7x gap over zero ablation, ensemble covering 96.5%.

Benchmarking Empirical Privacy Protection for Adaptations of Large Language Models

cs.LG · 2026-06-08 · unverdicted · novelty 6.0

Empirical benchmarks show distribution similarity between adaptation and pretraining data increases practical privacy leakage in DP-adapted LLMs at fixed theoretical guarantees, with LoRA providing strongest protection for OOD cases.

citing papers explorer

Showing 2 of 2 citing papers after filters.

LACUNA: A Testbed for Evaluating Localization Precision for LLM Unlearning cs.CL · 2026-07-02 · conditional · none · ref 8
LACUNA is a new testbed that injects PII into predefined model parameters to benchmark the localization precision of LLM unlearning methods, revealing that SOTA approaches are imprecise despite strong output performance.
Output Vector Editing for Memorization Mitigation in Large Language Models cs.CL · 2026-06-17 · unverdicted · none · ref 25
Output vector editing on MLP neurons suppresses memorization in LLMs up to 87.9% on 6831 sequences in OLMo-7B with a 2.7x gap over zero ablation, ensemble covering 96.5%.

Do Localization Methods Actually Localize Memorized Data in LLM s? A Tale of Two Benchmarks

fields

years

verdicts

representative citing papers

citing papers explorer