Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL) , year=

When Not to Trust Language Models: Investigating Effectiveness of Parametric, Non-Parametric Memories , author=

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

PARALLAX: Separating Genuine Hallucination Detection from Benchmark Construction Artifacts

cs.CL · 2026-05-16 · unverdicted · novelty 6.0

Benchmark construction artifacts in hallucination detection corpora allow naive text-similarity baselines to achieve near-perfect scores, and controlled evaluations show most methods perform near chance except SAPLMA and the new DRIFT probe.

citing papers explorer

Showing 1 of 1 citing paper.

PARALLAX: Separating Genuine Hallucination Detection from Benchmark Construction Artifacts cs.CL · 2026-05-16 · unverdicted · none · ref 76
Benchmark construction artifacts in hallucination detection corpora allow naive text-similarity baselines to achieve near-perfect scores, and controlled evaluations show most methods perform near chance except SAPLMA and the new DRIFT probe.

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL) , year=

fields

years

verdicts

representative citing papers

citing papers explorer