Beaver agent harness achieves 81.0 GRAS on multimodal scientific curation, outperforming frontier agents by over 23 points through scaffolding and evidence tooling.
URL https://aclanthology.org/2025
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.AI 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Building Agent Harnesses for Scientific Curation from Multimodal Sources
Beaver agent harness achieves 81.0 GRAS on multimodal scientific curation, outperforming frontier agents by over 23 points through scaffolding and evidence tooling.