Brauner, and Matthias Samwald

Adriano Barbosa-Silva, Simon Ott, Kathrin Blagec, Jan M · 2022 · arXiv 2203.04592

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

SEAL: Can Saturated Benchmarks Be Revived by LLM-as-a-Meta-Judge?

cs.CL · 2026-05-28 · unverdicted · novelty 6.0

SEAL revives saturated benchmarks via adaptive LLM meta-judging in elimination matches, matching full pairwise accuracy with roughly half the calls across code, math, QA, and agent tasks.

citing papers explorer

Showing 1 of 1 citing paper after filters.

SEAL: Can Saturated Benchmarks Be Revived by LLM-as-a-Meta-Judge? cs.CL · 2026-05-28 · unverdicted · none · ref 2
SEAL revives saturated benchmarks via adaptive LLM meta-judging in elimination matches, matching full pairwise accuracy with roughly half the calls across code, math, QA, and agent tasks.

Brauner, and Matthias Samwald

fields

years

verdicts

representative citing papers

citing papers explorer