pith:FERP7UN3
CommonWhy: A Dataset for Evaluating Entity-Based Causal Commonsense Reasoning in Large Language Models
CommonWhy introduces 15,000 why questions that test whether LLMs can combine specific entity facts with causal commonsense inference
arxiv:2605.12918 v1 · 2026-05-13 · cs.CL
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{FERP7UN3IEHUJRI2Y3JU3QG6UR}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
Experiments with state-of-the-art LLMs and LLM-based KGQA methods reveal their significant shortcomings, including frequent factual hallucinations and failures in causal reasoning.
The questions in CommonWhy require genuine integration of entity facts with causal commonsense reasoning rather than being solvable through superficial patterns learned during training.
CommonWhy is a new dataset of 15,000 why-questions for evaluating LLMs on entity-based causal commonsense reasoning grounded in Wikidata.
References
Receipt and verification
| First computed | 2026-05-18T03:09:10.302646Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
2922ffd1bb410f44c51ac6d34dc0dea4423380b83f7a65ca7895e1f8f0b93256
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/FERP7UN3IEHUJRI2Y3JU3QG6UR \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 2922ffd1bb410f44c51ac6d34dc0dea4423380b83f7a65ca7895e1f8f0b93256
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "9592c8faa0e8b886c12b3b87214f6e419c5f622aa759dae31d6810543695b969",
"cross_cats_sorted": [],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.CL",
"submitted_at": "2026-05-13T02:47:21Z",
"title_canon_sha256": "3abb3e7e165af69dcb2ed63b40b440f667de36b49ab7f513f7642a9b5477c766"
},
"schema_version": "1.0",
"source": {
"id": "2605.12918",
"kind": "arxiv",
"version": 1
}
}