Title resolution pending

If g o l d _ a n s w e r gives s ep ar at e c o m p o n e n t values but the q ue st io n asks for a ,→ com bi ne d total , a correct c omb in ed total in m o d e l _ a n s w e r is consistent , as ,→ long as it d ir ec tl y answers the qu · 2000

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

DocScope: Benchmarking Verifiable Reasoning for Trustworthy Long-Document Understanding

cs.CL · 2026-05-09 · unverdicted · novelty 7.0

DocScope benchmarks long-document QA by requiring models to predict evidence pages, regions, facts, and answers, finding that even correct answers have complete evidence chains only 29% of the time at best.

citing papers explorer

Showing 1 of 1 citing paper.

DocScope: Benchmarking Verifiable Reasoning for Trustworthy Long-Document Understanding cs.CL · 2026-05-09 · unverdicted · none · ref 21
DocScope benchmarks long-document QA by requiring models to predict evidence pages, regions, facts, and answers, finding that even correct answers have complete evidence chains only 29% of the time at best.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer