pith. machine review for the scientific record. sign in

hub

Note on the sampling error of the difference between correlated proportions or percentages.Psychometrika, 12(2):153–157

10 Pith papers cite this work, alongside 3,350 external citations. Polarity classification is still indexing.

10 Pith papers citing it
3,350 external citations · Crossref

hub tools

years

2026 10

verdicts

UNVERDICTED 10

representative citing papers

Evaluating Plan Compliance in Autonomous Programming Agents

cs.SE · 2026-04-13 · unverdicted · novelty 7.0

Autonomous programming agents frequently fail to follow instructed plans, falling back on incomplete internalized workflows, while standard plans and periodic reminders improve performance but poor plans can degrade it more than no plan.

Evaluating LLM Agents on Automated Software Analysis Tasks

cs.SE · 2026-04-13 · unverdicted · novelty 7.0

A custom LLM agent achieves 94% manually verified success on a new benchmark of 35 software analysis setups, outperforming baselines at 77%, but struggles with stage mixing, error localization, and overestimating its own success.

citing papers explorer

Showing 10 of 10 citing papers.