pith:TW2NMZMH
HalluScore: Large Language Model Hallucination Question Answering Benchmark
HalluScore is a curated Arabic QA dataset with 827 questions, ground-truth evidence, and human annotations used to measure hallucination rates across 17 LLMs.
arxiv:2605.17007 v1 · 2026-05-16 · cs.CL
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{TW2NMZMHICZI74RCPKWHQHSDWM}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
We introduce HalluScore, a structured Arabic question answering benchmark designed to evaluate hallucination behavior in LLMs across different levels of reasoning difficulty, various knowledge domains, historical timelines, and culturally grounded Arabic scenarios. It contains 827 carefully curated questions.
The model-driven selection process successfully retains only questions that consistently trigger hallucinations while preserving factual validity and cultural grounding; this premise is stated in the abstract's description of the construction pipeline but lacks independent verification details.
HalluScore is a curated Arabic QA dataset with 827 questions, ground-truth evidence, and human annotations used to measure hallucination rates across 17 LLMs.
References
Receipt and verification
| First computed | 2026-05-20T00:03:35.680979Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
9db4d6658740b28ff2227aac781e43b32ca440b2c613e029041c30f8eeef9b86
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/TW2NMZMHICZI74RCPKWHQHSDWM \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 9db4d6658740b28ff2227aac781e43b32ca440b2c613e029041c30f8eeef9b86
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "d282ecd04265ed3c1cdc4244c43228492da185798c0dbee60c0c8c18612f8d13",
"cross_cats_sorted": [],
"license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
"primary_cat": "cs.CL",
"submitted_at": "2026-05-16T14:08:15Z",
"title_canon_sha256": "8d9db1692352343d3d6137a75d4941442578da79660e749c022489853b683c95"
},
"schema_version": "1.0",
"source": {
"id": "2605.17007",
"kind": "arxiv",
"version": 1
}
}