10 minutes) to provide basic information, educational background, and technical experience

PROCEDURES If you agree to participate in this study, you will be asked to complete the following: •Pre-Test Questionnaire:You will first complete an online questionnaire (approx

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

CentaurEval: Benchmarking Human-in-the-Loop Value in Agentic Coding

cs.SE · 2025-11-30 · unverdicted · novelty 7.0 · 2 refs

Human-AI collaboration on CentaurEval's collaboration-necessary tasks reaches 31.11% success, far above standalone humans at 18.89% or LLMs at 0.67%.

citing papers explorer

Showing 1 of 1 citing paper.

CentaurEval: Benchmarking Human-in-the-Loop Value in Agentic Coding cs.SE · 2025-11-30 · unverdicted · none · ref 13 · 2 links
Human-AI collaboration on CentaurEval's collaboration-necessary tasks reaches 31.11% success, far above standalone humans at 18.89% or LLMs at 0.67%.

10 minutes) to provide basic information, educational background, and technical experience

fields

years

verdicts

representative citing papers

citing papers explorer