Novices performed better and reported lower workload with GitHub Copilot than with human partners, but human partners produced more positive emotions and a smaller drop in retest performance after one week.
Herbsleb, Alexandra Holloway, and Scott Davidoff
2 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
verdicts
UNVERDICTED 2roles
background 1polarities
background 1representative citing papers
CodeQ aggregates token rationales into code categories to enable global interpretability of LLMs, claiming over 50% entropy reduction and revealing model preference for syntactic cues plus human misalignment in a 37-person study.
citing papers explorer
-
Fast and Forgettable: A Controlled Study of Novices' Performance, Learning, Workload, and Emotion in AI-Assisted and Human Pair Programming Paradigms
Novices performed better and reported lower workload with GitHub Copilot than with human partners, but human partners produced more positive emotions and a smaller drop in retest performance after one week.
-
Enabling Global, Human-Centered Explanations for LLMs:From Tokens to Interpretable Code and Test Generation
CodeQ aggregates token rationales into code categories to enable global interpretability of LLMs, claiming over 50% entropy reduction and revealing model preference for syntactic cues plus human misalignment in a 37-person study.