Developers using AI showed the same core problem-solving behaviors as those without but differed in how they became stuck and recovered, with AI helping or hindering in specific cases.
Canonical reference
Title resolution pending
Canonical reference. 88% of citing Pith papers cite this work as background.
citation-role summary
citation-polarity summary
roles
background 8representative citing papers
CyberCertBench shows frontier LLMs reach human-expert performance on general IT and networking security but drop on vendor-specific and formal standards questions such as IEC 62443, with a new framework for producing interpretable explanations.
EvoGraph turns linear AI-assisted programming into a manipulable graph of branching histories, reducing cognitive load and enabling better iteration according to a user study with 20 developers.
REAP automatically curates production-derived benchmarks for AI coding agents via LLM classification and stability checks, producing the Harvest benchmark with model solve rates of 42.9-58.2%.
Mixed-methods study shows developers prefer GenAI for repetitive tasks, benefit from single interaction modes but not combined ones, and gain awareness from study participation.
Incidental prompt cues induce large, systematic shifts in the algorithm families chosen by LLMs during code generation across thousands of controlled trials.
A qualitative study of South Korean parents shows that trauma and healing after learning a child is LGBTQ+ leads to identity reconstruction as supportive parents and more critical, protective informating practices.
Empirical analysis of 4707 MoltBook posts shows AI-only technical discourse focuses on security, trust, and abstract topics while lacking concrete runtime and project details found in human GitHub discussions.
Co-locating tests with implementation code yields substantially higher preservation and correctness in foundation-model-generated programs than separated test syntax.
Researchers created a stigma-aware WhatsApp chatbot for menstrual health education in Pakistan through co-design workshops and a two-week deployment, yielding insights on its use for challenging taboos alongside tensions around trust and cultural explanations.
Aporia makes design decisions explicit and interactive in AI-assisted programming, leading to higher engagement and 5x fewer mental model disagreements with code in a 14-person user study compared to a baseline agent.
Polite chatbot feedback lowers psychological reactance and boosts behavioral intentions but lacks engagement, whereas verbal leakage heightens surprise and engagement at the expense of increased reactance.
Empirical analysis of 338 PRs with self-admitted ChatGPT usage shows low full integration (median 25%), selective adaptation patterns, and broader influence on developer reasoning during reviews.
Clover tool and behavioral taxonomy show tab-accept rates linked to lower attention-check scores and dwell time linked to higher scores in AI-assisted programming tasks.
Mixed-methods study adapting UTAUT2 shows individual-level perceptions predict continued GenAI use in Italian SME developers (R²=0.647) while social and organisational factors do not.
Longitudinal surveys show AI coding assistants reduce time on code writing but increase supervisory verification tasks, with stable productivity perceptions yet rising reports of worsened developer experience.
A multisite biometric study finds lower cognitive engagement under AI assistance via EEG and blink rate, with physiological-performance links present only in the non-AI condition.
Among novice programmers using AI code generators, trust did not predict compliance with suggestions, while performance correlated with both compliance and increased subsequent trust.
Hiding generative AI use to signal expertise reduces knowledge sharing and transparency among workplace colleagues.
User study reveals nine LLM failure categories in SE tasks and quantifies abandonment factors from 26 participants.
The paper outlines a controlled study protocol using staged programming tasks to measure the effects of configuration mechanisms on build-versus-buy decisions in Claude Code and OpenAI Codex.
Cross-boundary collaboration in open source is sustained by a thin carrier layer of contributors and repeat relationships that increase pull request acceptance rates from 42% to 87%.
Smaller LLMs produce functional but limited Python code with variable quantization effects and quality/maintainability concerns that require validation before use.
A survey of user studies on LLM use in programming that identifies interaction behaviors, mixed benefits and weaknesses, and factors influencing human and task performance.
citing papers explorer
-
Building Digital Societies as Ecosystems: How Recognition and Repeat Relationships Sustain Cross-Community Work in Open Source
Cross-boundary collaboration in open source is sustained by a thin carrier layer of contributors and repeat relationships that increase pull request acceptance rates from 42% to 87%.