Multidimensional IRT analysis of 34k FCI administrations identifies 22 robust misconception dimensions and computes student/class scores revealing varied post-instruction remediation patterns.
Title resolution pending
3 Pith papers cite this work. Polarity classification is still indexing.
verdicts
UNVERDICTED 3representative citing papers
LLM graders achieve substantial human agreement on math and science MCAS items but vary on ELA, performing best as sources of formative narrative feedback rather than summative numerical scores.
AI and NLP applied to educational artifacts within the Instructional Core Framework can identify advantages for teacher coaching, student support, and personalized learning.
citing papers explorer
-
Discovering Misconceptions and Misunderstandings From Administrations of Research-Designed Multiple Choice Instruments
Multidimensional IRT analysis of 34k FCI administrations identifies 22 robust misconception dimensions and computes student/class scores revealing varied post-instruction remediation patterns.
-
Creating and Evaluating K-12 GenAI Assessment Graders Through Context Engineering
LLM graders achieve substantial human agreement on math and science MCAS items but vary on ELA, performing best as sources of formative narrative feedback rather than summative numerical scores.