pith. machine review for the scientific record. sign in

← back to paper

Review history

arxiv: 2605.00200 · 2 revisions

Confidence Estimation in Automatic Short Answer Grading with LLMs

  1. 2026-05-14 UNVERDICTED LOW v0.9.0 novelty 6.0
    33929 ms 5514 in 1146 out 2026-05-14T20:55:29.146330+00:00
  2. 2026-05-09 UNVERDICTED LOW v0.9.0 novelty 5.0
    41316 ms 5514 in 1017 out 2026-05-09T20:19:09.958886+00:00