Title resolution pending

Delivery: Pick one (i) On-Campus Partner (classes hosted at partner school premises), (ii) Learning Center (dedicated provider-run teaching location), or (iii) Hybrid: Kit + Video

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

ProfBench: Multi-Domain Rubrics requiring Professional Knowledge to Answer and Judge

cs.CL · 2025-10-21 · unverdicted · novelty 7.0

ProfBench is a new multi-domain benchmark with human-expert rubrics for judging LLM responses on professional tasks, showing top models reach only 65.9% performance while providing cheap LLM judges that reduce evaluation cost by orders of magnitude.

citing papers explorer

Showing 1 of 1 citing paper.

ProfBench: Multi-Domain Rubrics requiring Professional Knowledge to Answer and Judge cs.CL · 2025-10-21 · unverdicted · none · ref 13
ProfBench is a new multi-domain benchmark with human-expert rubrics for judging LLM responses on professional tasks, showing top models reach only 65.9% performance while providing cheap LLM judges that reduce evaluation cost by orders of magnitude.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer