Gyeong-Geon Lee, Ehsan Latif, Xuansheng Wu, Ning- hao Liu, and Xiaoming Zhai

Towards reliable LLM grading through selfconsistency, selective human review: Higher accuracy, less work · 2024 · arXiv 2312.03748

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

read on arXiv browse 1 citing papers

representative citing papers

Generative-Evaluative Agreement: A Necessary Validity Criterion for LLM-Enabled Adaptive Assessment

cs.AI · 2026-05-19 · unverdicted · novelty 7.0

Defines GEA validity criterion and reports first measurement of r=0.698 recovery with positive bias in LLM two-stage adaptive assessment, stronger for verifiable skills.

citing papers explorer

Showing 1 of 1 citing paper.

Generative-Evaluative Agreement: A Necessary Validity Criterion for LLM-Enabled Adaptive Assessment cs.AI · 2026-05-19 · unverdicted · none · ref 1
Defines GEA validity criterion and reports first measurement of r=0.698 recovery with positive bias in LLM two-stage adaptive assessment, stronger for verifiable skills.

Gyeong-Geon Lee, Ehsan Latif, Xuansheng Wu, Ning- hao Liu, and Xiaoming Zhai

fields

years

verdicts

representative citing papers

citing papers explorer