pith. sign in

Agent psychometrics: Task-level performance prediction in agentic coding benchmarks.arXiv preprint arXiv:2604.00594, 2026

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

cs.AI 1

years

2026 1

verdicts

UNVERDICTED 1

representative citing papers

Stop Comparing LLM Agents Without Disclosing the Harness

cs.AI · 2026-05-07 · unverdicted · novelty 4.0

The Binding Constraint Thesis states that harness configuration governs performance variance more than model choice in long-horizon agent tasks, leading to misattribution in evaluations.

citing papers explorer

Showing 1 of 1 citing paper.

  • Stop Comparing LLM Agents Without Disclosing the Harness cs.AI · 2026-05-07 · unverdicted · none · ref 11

    The Binding Constraint Thesis states that harness configuration governs performance variance more than model choice in long-horizon agent tasks, leading to misattribution in evaluations.