Quantifying memorization across neural language models

Nicholas Carlini, Daphne Ippolito, Matthew Jagielski, Katherine Lee, Florian Tramer, Chiyuan Zhang · 2023

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

LogitTrace: Detecting Benchmark Contamination via Layerwise Logit Trajectories

cs.CL · 2025-09-25 · unverdicted · novelty 7.0

LogitTrace detects benchmark contamination by showing that contaminated inputs produce earlier stabilization in layerwise logit trajectories while clean inputs show more gradual accumulation.

Data Compressibility Quantifies LLM Memorization

cs.CL · 2025-07-08 · unverdicted · novelty 5.0

Set-level data entropy estimators show linear correlation with LLM memorization scores, forming the Entropy-Memorization Linearity.

citing papers explorer

Showing 2 of 2 citing papers.

LogitTrace: Detecting Benchmark Contamination via Layerwise Logit Trajectories cs.CL · 2025-09-25 · unverdicted · none · ref 3
LogitTrace detects benchmark contamination by showing that contaminated inputs produce earlier stabilization in layerwise logit trajectories while clean inputs show more gradual accumulation.
Data Compressibility Quantifies LLM Memorization cs.CL · 2025-07-08 · unverdicted · none · ref 23
Set-level data entropy estimators show linear correlation with LLM memorization scores, forming the Entropy-Memorization Linearity.

Quantifying memorization across neural language models

fields

years

verdicts

representative citing papers

citing papers explorer