InProceedings of the 60th Annual Meeting of the Association for Computa- tional Linguistics, pages 3214–3252

TruthfulQA: Measuring how models mimic human falsehoods · 2023

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

A multilingual hallucination benchmark: MultiWikiQHalluA

cs.CL · 2026-05-04 · unverdicted · novelty 6.0

Synthetic multilingual hallucination datasets and classifiers show higher hallucination rates for the 0.6B Qwen3 model (up to 60%) and for lower-resource languages like Icelandic compared with larger models.

citing papers explorer

Showing 1 of 1 citing paper.

A multilingual hallucination benchmark: MultiWikiQHalluA cs.CL · 2026-05-04 · unverdicted · none · ref 11
Synthetic multilingual hallucination datasets and classifiers show higher hallucination rates for the 0.6B Qwen3 model (up to 60%) and for lower-resource languages like Icelandic compared with larger models.

InProceedings of the 60th Annual Meeting of the Association for Computa- tional Linguistics, pages 3214–3252

fields

years

verdicts

representative citing papers

citing papers explorer