A DIF-based statistical method identifies items where humans and LLMs show systematic performance differences on chemistry and entrance exams, supporting AI-aware assessment design.
Title resolution pending
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.HC 1years
2026 1verdicts
CONDITIONAL 1representative citing papers
citing papers explorer
-
Assessment Design in the AI Era: A Method for Identifying Items Functioning Differentially for Humans and Chatbots
A DIF-based statistical method identifies items where humans and LLMs show systematic performance differences on chemistry and entrance exams, supporting AI-aware assessment design.