Physical Commonsense Reasoning for Lower-Resourced Languages and Dialects: a Study on Basque

· 2026 · cs.CL · arXiv 2602.14812

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Physical commonsense reasoning represents a fundamental capability of human intelligence, enabling individuals to understand their environment, predict future events, and navigate physical spaces. Recent years have witnessed growing interest in reasoning tasks within Natural Language Processing (NLP). However, no prior research has examined the performance of Large Language Models (LLMs) on non-question-answering (non-QA) physical commonsense reasoning tasks in low-resource languages such as Basque. Taking the Italian GITA as a starting point, this paper addresses this gap by presenting BasPhyCo, the first non-QA physical commonsense reasoning dataset for Basque, available in both standard and dialectal variants. We evaluate model performance across three hierarchical levels of commonsense understanding: (1) distinguishing between plausible and implausible narratives (accuracy), (2) identifying the conflicting element that renders a narrative implausible (consistency), and (3) determining the specific physical state that creates the implausibility (verifiability). These tasks were assessed using multiple multilingual LLMs as well as models pretrained specifically for Italian and Basque. Results indicate that, in terms of verifiability, LLMs exhibit limited physical commonsense capabilities in low-resource languages such as Basque, especially when processing dialectal variants.

representative citing papers

Physical Commonsense Reasoning for Lower-Resourced Languages and Dialects: a Study on Basque

cs.CL · 2026-02-16 · conditional · novelty 7.0

BasPhyCo is the first physical commonsense reasoning dataset for Basque and dialects, showing LLMs have limited performance on verifiability tasks especially with dialects.

citing papers explorer

Showing 1 of 1 citing paper.

Physical Commonsense Reasoning for Lower-Resourced Languages and Dialects: a Study on Basque cs.CL · 2026-02-16 · conditional · none · ref 1 · internal anchor
BasPhyCo is the first physical commonsense reasoning dataset for Basque and dialects, showing LLMs have limited performance on verifiability tasks especially with dialects.

Physical Commonsense Reasoning for Lower-Resourced Languages and Dialects: a Study on Basque

fields

years

verdicts

representative citing papers

citing papers explorer