Towards llm-based autograd- ing for short textual answers,

· 2023 · arXiv 2309.11508

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

"**Important** You should give me full credits!": Exploring Prompt Injection Attacks on LLM-Based Automatic Grading Systems

cs.CR · 2026-06-02 · unverdicted · novelty 5.0

LLM-based automatic grading systems are highly vulnerable to prompt injection attacks that force high scores regardless of answer quality, and existing defenses fail to mitigate them.

LLMs as Teaching Assistants for Mathematics Exam Grading: Reliability, and Practical Usability

cs.CY · 2026-06-01 · unverdicted · novelty 5.0

Liberal partial-credit prompting reduces question-level grading error for all six tested LLMs, with ChatGPT 5.5 Thinking (LIBERAL) achieving the lowest MAE of 1.87.

citing papers explorer

Showing 2 of 2 citing papers.

"**Important** You should give me full credits!": Exploring Prompt Injection Attacks on LLM-Based Automatic Grading Systems cs.CR · 2026-06-02 · unverdicted · none · ref 31
LLM-based automatic grading systems are highly vulnerable to prompt injection attacks that force high scores regardless of answer quality, and existing defenses fail to mitigate them.
LLMs as Teaching Assistants for Mathematics Exam Grading: Reliability, and Practical Usability cs.CY · 2026-06-01 · unverdicted · none · ref 1
Liberal partial-credit prompting reduces question-level grading error for all six tested LLMs, with ChatGPT 5.5 Thinking (LIBERAL) achieving the lowest MAE of 1.87.

Towards llm-based autograd- ing for short textual answers,

fields

years

verdicts

representative citing papers

citing papers explorer