WikiContradict : A benchmark for evaluating LLMs on real-world knowledge conflicts from Wikipedia

Yufang Hou, Alessandra Pascale, Javier Carnerero-Cano, Tigran Tchrakian, Radu Marinescu, Elizabeth Daly, Inkit Padhi, Prasanna Sattigeri · 2024 · arXiv 2406.13805

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

read on arXiv browse 1 citing papers

representative citing papers

Detecting Is Not Resolving: The Monitoring Control Gap in Retrieval Augmented LLMs

cs.AI · 2026-05-26 · unverdicted · novelty 6.0

RAG models exhibit a monitoring-control gap: they acknowledge epistemic conflicts in accumulating documents yet fail to constrain unsafe recommendations, with single-turn tests overestimating safety.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Detecting Is Not Resolving: The Monitoring Control Gap in Retrieval Augmented LLMs cs.AI · 2026-05-26 · unverdicted · none · ref 8
RAG models exhibit a monitoring-control gap: they acknowledge epistemic conflicts in accumulating documents yet fail to constrain unsafe recommendations, with single-turn tests overestimating safety.

WikiContradict : A benchmark for evaluating LLMs on real-world knowledge conflicts from Wikipedia

fields

years

verdicts

representative citing papers

citing papers explorer