pith. machine review for the scientific record. sign in

← back to paper

Review history

arxiv: 2605.00238 · 2 revisions

Estimating LLM Grading Ability and Response Difficulty in Automatic Short Answer Grading via Item Response Theory

  1. 2026-05-14 UNVERDICTED LOW v0.9.0 novelty 7.0
    40457 ms 5552 in 1229 out 2026-05-14T20:51:07.973637+00:00
  2. 2026-05-09 UNVERDICTED MODERATE v0.9.0 novelty 7.0
    30175 ms 5552 in 1190 out 2026-05-09T19:57:56.316562+00:00