Post-cutoff decay is not a robust contamination signal because LLM-rephrased questions from the same source documents produce different temporal patterns than original cloze questions.
InProceedings of the 7th BlackboxNLP Workshop: Analyzing and Interpret- ing Neural Networks for NLP, pages 88–104, Miami, Florida, US
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.AI 1years
2025 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Test of Time: Rethinking Temporal Signal of Benchmark Contamination
Post-cutoff decay is not a robust contamination signal because LLM-rephrased questions from the same source documents produce different temporal patterns than original cloze questions.