DR³-Eval is a new benchmark for deep research agents that pairs authentic materials with a static sandbox corpus and a multi-dimensional evaluation framework aligned to human judgments.
The company’s revenue increased by 25% in 2023
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.AI 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
DR$^{3}$-Eval: Towards Realistic and Reproducible Deep Research Evaluation
DR³-Eval is a new benchmark for deep research agents that pairs authentic materials with a static sandbox corpus and a multi-dimensional evaluation framework aligned to human judgments.