Expert validators do not know whether they are the first or second non-expert validator when they are given the question

$10 bonus for each question one validates where the 2nd expert validator answerscorrectly, a $5 bonus for each question validated where the majority ofnon-expert validators ans

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

GPQA: A Graduate-Level Google-Proof Q&A Benchmark

cs.AI · 2023-11-20 · accept · novelty 7.0

GPQA is a new graduate-level benchmark where PhD experts score 65% (74% after corrections), skilled non-experts score 34% with web access, and GPT-4 scores 39%, intended to enable realistic tests of human supervision over superhuman AI.

citing papers explorer

Showing 1 of 1 citing paper.

GPQA: A Graduate-Level Google-Proof Q&A Benchmark cs.AI · 2023-11-20 · accept · none · ref 11
GPQA is a new graduate-level benchmark where PhD experts score 65% (74% after corrections), skilled non-experts score 34% with web access, and GPT-4 scores 39%, intended to enable realistic tests of human supervision over superhuman AI.

Expert validators do not know whether they are the first or second non-expert validator when they are given the question

fields

years

verdicts

representative citing papers

citing papers explorer