Pacost: Paired confidence significance testing for benchmark contamination detection in large language mod- els,

· 2024 · arXiv 2406.18326

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

cs.CL · 2026-06-06 · unverdicted · novelty 6.0

A masked-token hit-rate comparison method detects pretraining data membership in black-box LLMs with performance comparable to white-box approaches.

Showing 1 of 1 citing paper.

MC-PDD: Masked Corpus-Level Pretraining Data Detection for Black-Box Large Language Models cs.CL · 2026-06-06 · unverdicted · none · ref 13
A masked-token hit-rate comparison method detects pretraining data membership in black-box LLMs with performance comparable to white-box approaches.