Analyzing llm behavior in dialogue sum- marization: Unveiling circumstantial hallucina- tion trends,

· 2024

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

HalluScore: Large Language Model Hallucination Question Answering Benchmark

cs.CL · 2026-05-16 · unverdicted · novelty 7.0

HalluScore is a curated Arabic QA dataset with 827 questions, ground-truth evidence, and human annotations used to measure hallucination rates across 17 LLMs.

citing papers explorer

Showing 1 of 1 citing paper.

HalluScore: Large Language Model Hallucination Question Answering Benchmark cs.CL · 2026-05-16 · unverdicted · none · ref 19
HalluScore is a curated Arabic QA dataset with 827 questions, ground-truth evidence, and human annotations used to measure hallucination rates across 17 LLMs.

Analyzing llm behavior in dialogue sum- marization: Unveiling circumstantial hallucina- tion trends,

fields

years

verdicts

representative citing papers

citing papers explorer