Data contamination quiz: A tool to detect and estimate contamination in large language models.Trans

Shahriar Golchin, Mihai Surdeanu · 2025 · DOI 10.1162/tacl.a.20

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open at publisher browse 1 citing papers

representative citing papers

LiveBrowseComp: Are Search Agents Searching, or Just Verifying What They Already Know?

cs.AI · 2026-05-27 · unverdicted · novelty 7.0

LiveBrowseComp shows search agents rely on intrinsic knowledge on standard benchmarks, with scores dropping 25-40 points and closed-book accuracy below 2% on questions about facts from the prior 90 days.

citing papers explorer

Showing 1 of 1 citing paper.

LiveBrowseComp: Are Search Agents Searching, or Just Verifying What They Already Know? cs.AI · 2026-05-27 · unverdicted · none · ref 52
LiveBrowseComp shows search agents rely on intrinsic knowledge on standard benchmarks, with scores dropping 25-40 points and closed-book accuracy below 2% on questions about facts from the prior 90 days.

Data contamination quiz: A tool to detect and estimate contamination in large language models.Trans

fields

years

verdicts

representative citing papers

citing papers explorer