Doubly-robust LLM-as-a-judge: Externally valid estimation with imperfect personas.arXiv preprint arXiv:2509.22957,

Luke Guerdan, Justin Whitehouse, Kimberly Truong, Kenneth Holstein, Zhiwei Steven Wu · arXiv 2509.22957

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Mitigating LLM-based p-Hacking by Preregistering for the Next LLM

cs.CL · 2026-06-26 · conditional · novelty 7.0

Preregistering LLM experiments to run on the first future eligible model blocks p-hacking transfer in roughly 73% of cases across 20 models and 11 configurations on two tasks with known ground truth.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Mitigating LLM-based p-Hacking by Preregistering for the Next LLM cs.CL · 2026-06-26 · conditional · none · ref 6
Preregistering LLM experiments to run on the first future eligible model blocks p-hacking transfer in roughly 73% of cases across 20 models and 11 configurations on two tasks with known ground truth.

Doubly-robust LLM-as-a-judge: Externally valid estimation with imperfect personas.arXiv preprint arXiv:2509.22957,

fields

years

verdicts

representative citing papers

citing papers explorer