pith. sign in

arxiv: 2511.06428 · v2 · pith:KVMHJ3PSnew · submitted 2025-11-09 · 💻 cs.SE · cs.AI

Walking the Tightrope of LLMs for Software Development: A Practitioners' Perspective

Pith reviewed 2026-05-22 12:41 UTC · model grok-4.3

classification 💻 cs.SE cs.AI
keywords LLMssoftware developmentpractitioner perspectivesbenefits and challengestrade-offsempirical study
0
0 comments X

The pith

Software practitioners see LLMs as tools that sustain flow and sharpen mental models but can harm reputation across multiple levels.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper examines how large language models affect software development by interviewing 22 practitioners over three rounds. It finds benefits including maintained developer flow, better mental models, and increased entrepreneurship, set against challenges such as potential damage to developers' reputation. These are examined at the individual, team, organization, and society scales, along with practical steps to reduce the downsides. The results aim to help team leaders and managers judge when and how to adopt LLMs in their own settings.

Core claim

The study identifies both positive and negative effects of LLMs on software development as reported by practitioners. Benefits include maintaining flow, improving mental models, and fostering entrepreneurship, while challenges encompass risks to reputation. These impacts occur at individual, team, organizational, and societal levels, and the work provides guidance on how to mitigate the challenges through careful management of trade-offs.

What carries the argument

The socio-technical grounded theory analysis of practitioner interviews that uncovers benefits and challenges of LLM use organized by individual, team, organization, and society levels.

If this is right

  • Team leaders can evaluate LLM adoption by weighing flow improvements against reputation concerns in their specific setting.
  • Organizations may need policies to address societal-level effects of widespread LLM use in development.
  • Developers could apply mitigation strategies to preserve their professional standing while gaining productivity aids.
  • Entrepreneurship in software teams might increase if LLM tools are integrated thoughtfully.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Extending beyond the interviews, similar trade-offs between productivity tools and professional risks could arise in other creative or knowledge work domains.
  • The mitigation guidances could be piloted in real teams to see if they effectively reduce challenges like reputation damage.
  • Broader studies might explore how these findings vary with different LLM tools or developer experience levels.

Load-bearing premise

The self-reported experiences of these 22 practitioners are representative enough of the wider software development community to draw general conclusions about LLM impacts.

What would settle it

A large-scale survey of software developers finding that the balance of benefits and challenges differs markedly from what the interviews suggest would challenge the paper's conclusions.

Figures

Figures reproduced from arXiv: 2511.06428 by Christoph Treude, John Grundy, Rashina Hoda, Samuel Ferino.

Figure 1
Figure 1. Figure 1: Study methodology. related activities, which involves exploring the benefits, challenges, limitations, and recommendations shared by software practitioners involved in SE-related activities. Our analysis reveals the following main benefits: (B1) reduced effort due to LLMs as a foundation to boost code development and perceived saving time; (B2) flow experience when LLMs mitigate interruptions and automate … view at source ↗
Figure 2
Figure 2. Figure 2: Emergence of the category “Impact on using LLMs” from raw data → codes → concepts → subcategories → category through constant comparison. provide a snapshot of the LLM features with the intent of improving clarity of the context related to the data collection. We summarise the main features in the supplementary online package [26]. 2.4 Data Analysis One of the ways in which STGT distinguishes itself from t… view at source ↗
Figure 3
Figure 3. Figure 3: Main Benefits of using LLMs at the Individual (Software Practitioner) Level. [PITH_FULL_IMAGE:figures/full_fig_p008_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Main Disadvantages of using LLMs at the Individual (Software Practitioner) Level. [PITH_FULL_IMAGE:figures/full_fig_p011_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Comparison with related work focused on benefits and disadvantages. The colored bars represent the overlap with existing [PITH_FULL_IMAGE:figures/full_fig_p014_5.png] view at source ↗
read the original abstract

Background: Large Language Models emerged with the potential of provoking a revolution in software development (e.g., automating processes, workforce transformation). Although studies have started to investigate the perceived impact of LLMs for software development, there is a need for empirical studies to comprehend how to balance forward and backward effects of using LLMs. Objective: We investigated how LLMs impact software development and how to manage the impact from a software developer's perspective. Method: We conducted 22 interviews with software practitioners across 3 rounds of data collection and analysis, between October (2024) and September (2025). We employed Socio-Technical Grounded Theory for Data Analysis (STGT4DA) to rigorously analyse interview participants' responses. Results: We identified the benefits (e.g., maintain developer flow, improve developer mental models, and foster entrepreneurship) and challenges (e.g., damage to developers' reputation) of using LLMs at individual, team, organisation, and society levels; as well as actionable guidances into how mitigate these challenges. Conclusion: Critically, we present the trade-offs that software practitioners, teams, and organisations face in working with LLMs. Our findings are particularly useful for software team leaders and IT managers to assess the viability of LLMs within their specific context.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 2 minor

Summary. The paper reports results from a qualitative study of 22 interviews with software practitioners, collected in three iterative rounds between October 2024 and September 2025 and analyzed with Socio-Technical Grounded Theory for Data Analysis (STGT4DA). It claims to identify benefits (maintaining developer flow, improving mental models, fostering entrepreneurship) and challenges (e.g., damage to developers' reputation) of LLM use at individual, team, organizational, and societal levels, together with actionable mitigation guidance and an overall discussion of trade-offs useful to team leaders and IT managers.

Significance. If the methodological gaps are addressed, the work supplies new primary interview data on a timely topic and offers a multi-level framing of LLM impacts that could help practitioners weigh benefits against risks. The explicit attention to both positive and negative effects, plus the practitioner-oriented guidance, distinguishes it from purely technical LLM evaluations.

major comments (3)
  1. [Method] Method section: the sampling strategy, recruitment criteria, and participant demographics are not described in sufficient detail to assess whether the 22 practitioners capture adequate variation in roles, organization sizes, and LLM experience levels. Without this, the extension of themes to team-, organization-, and society-level claims rests on an unverified assumption of representativeness.
  2. [Method] Method section: the interview protocol (including question guides, how prompts evolved across the three rounds, and steps taken to reduce social-desirability or recall bias in self-reported LLM usage) is omitted. This information is load-bearing for evaluating the trustworthiness of the reported benefits, challenges, and mitigations.
  3. [Results] Results / Analysis description: explicit reporting of theoretical saturation criteria, member-checking procedures, or any form of triangulation (e.g., with artifacts or secondary data) is absent. This weakens confidence that the identified themes are not artifacts of the particular cohort or analysis process.
minor comments (2)
  1. [Abstract] Abstract: the date range 'October (2024) and September (2025)' should be clarified or corrected, as it extends into the future relative to typical submission dates.
  2. [Results] Ensure that each actionable guidance in the results is explicitly traced back to one or more interview-derived themes so readers can judge its empirical grounding.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for their constructive and detailed feedback on our manuscript. The comments highlight key areas where greater methodological transparency will strengthen the paper's contribution. We address each major comment below, indicating the revisions we will make to improve rigor and clarity without altering the core findings.

read point-by-point responses
  1. Referee: [Method] Method section: the sampling strategy, recruitment criteria, and participant demographics are not described in sufficient detail to assess whether the 22 practitioners capture adequate variation in roles, organization sizes, and LLM experience levels. Without this, the extension of themes to team-, organization-, and society-level claims rests on an unverified assumption of representativeness.

    Authors: We agree that additional detail on sampling and participant characteristics is necessary to support the multi-level claims. In the revised manuscript, we will expand the Participants subsection to explicitly describe the purposive sampling strategy, recruitment channels (professional networks and industry contacts), inclusion criteria focused on active LLM users in software roles, and a demographics table covering roles, organization sizes, years of experience, and self-reported LLM usage frequency. This will provide readers with the information needed to evaluate variation and the basis for extending themes across levels. revision: yes

  2. Referee: [Method] Method section: the interview protocol (including question guides, how prompts evolved across the three rounds, and steps taken to reduce social-desirability or recall bias in self-reported LLM usage) is omitted. This information is load-bearing for evaluating the trustworthiness of the reported benefits, challenges, and mitigations.

    Authors: We acknowledge that the interview protocol details were not sufficiently elaborated. The revised manuscript will include the core semi-structured interview guide, a description of how questions were iteratively refined across the three data collection rounds based on emerging themes from prior analysis, and explicit steps taken to reduce bias, such as assurances of anonymity, use of neutral and open-ended phrasing, requests for concrete recent examples to aid recall, and avoidance of leading questions. These additions will allow better assessment of data trustworthiness. revision: yes

  3. Referee: [Results] Results / Analysis description: explicit reporting of theoretical saturation criteria, member-checking procedures, or any form of triangulation (e.g., with artifacts or secondary data) is absent. This weakens confidence that the identified themes are not artifacts of the particular cohort or analysis process.

    Authors: We agree that explicit documentation of saturation assessment, member checking, and triangulation would enhance confidence in the analysis. In the revised version, we will add a dedicated subsection under Data Analysis that reports the criteria used to determine theoretical saturation (no new categories emerging after the third round), any member-checking steps performed with a subset of participants, and triangulation via independent coding by multiple researchers plus consistency checks against related literature. Where a procedure was not applied, we will note it transparently as a limitation. revision: yes

Circularity Check

0 steps flagged

No circularity: primary interview data analyzed via STGT4DA

full rationale

The paper derives its claims about benefits (e.g., maintaining developer flow), challenges (e.g., reputation damage), and mitigations at individual/team/org/society levels directly from thematic analysis of 22 new practitioner interviews collected in three iterative rounds using Socio-Technical Grounded Theory for Data Analysis (STGT4DA). No equations, fitted parameters renamed as predictions, self-definitional constructs, or load-bearing self-citations appear in the derivation chain. The central results are outputs of the empirical coding process applied to fresh primary data rather than reductions to prior inputs or ansatzes. This is a standard self-contained qualitative study whose validity rests on sampling and triangulation questions, not on any circular structure.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The work rests on standard assumptions of qualitative interview research and grounded theory rather than introducing new free parameters, axioms, or invented entities.

axioms (2)
  • domain assumption Self-reported experiences of interviewed practitioners accurately reflect real LLM impacts on software development.
    Invoked implicitly in the results and conclusion sections when generalizing from the 22 interviews.
  • standard math Socio-Technical Grounded Theory for Data Analysis (STGT4DA) is an appropriate and rigorous method for organizing the interview responses.
    Stated in the method section as the chosen analysis approach.

pith-pipeline@v0.9.0 · 5768 in / 1382 out tokens · 55071 ms · 2026-05-22T12:41:22.386301+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

68 extracted references · 68 canonical work pages

  1. [1]

    Application of large language models to software engineering tasks: Opportunities, risks, and implications,

    I. Ozkaya, “Application of large language models to software engineering tasks: Opportunities, risks, and implications,”IEEE Software, vol. 40, no. 3, pp. 4–8, 2023

  2. [2]

    Welcome to the era of chatgpt et al. the prospects of large language models,

    T. Teubneret al., “Welcome to the era of chatgpt et al. the prospects of large language models,”Business & Information Systems Engineering, vol. 65, no. 2, pp. 95–101, 2023

  3. [3]

    Opinion paper:“so what if chatgpt wrote it?

    Y . K. Dwivediet al., “Opinion paper:“so what if chatgpt wrote it?” multidisciplinary perspectives on opportunities, challenges and implica- tions of generative conversational ai for research, practice and policy,” International Journal of Information Management, vol. 71, p. 102642, 2023

  4. [4]

    Github copilot ai pair programmer: Asset or liability?

    A. M. Dakhelet al., “Github copilot ai pair programmer: Asset or liability?” Journal of Systems and Software, vol. 203, p. 111734, 2023

  5. [5]

    Exploring the problems, their causes and solutions of ai pair programming: A study on github and stack overflow,

    X. Zhouet al., “Exploring the problems, their causes and solutions of ai pair programming: A study on github and stack overflow,”Journal of Systems and Software, p. 112204, 2024

  6. [6]

    Large language models for software engineering: A systematic literature review,

    X. Houet al., “Large language models for software engineering: A systematic literature review,”ACM Trans. Softw. Eng. Methodol., Sep

  7. [7]

    Available: https://doi.org/10.1145/3695988

    [Online]. Available: https://doi.org/10.1145/3695988

  8. [8]

    Software testing with large language models: Survey, landscape, and vision,

    J. Wanget al., “Software testing with large language models: Survey, landscape, and vision,”IEEE Transactions on Software Engineering, vol. 50, no. 4, pp. 911–936, 2024

  9. [9]

    On the use of large language models in model-driven engineering,

    J. Di Roccoet al., “On the use of large language models in model-driven engineering,”Software and Systems Modeling, pp. 1–26, 2025

  10. [10]

    Superagency in the workplace: Empowering people to unlock ai’s full potential,

    H. Mayeret al., “Superagency in the workplace: Empowering people to unlock ai’s full potential,”McKinsey Digital, vol. 28, 2025

  11. [11]

    Debelliset al.(2024) Superagency in the workplace: Empowering people to unlock ai’s full potential

    D. Debelliset al.(2024) Superagency in the workplace: Empowering people to unlock ai’s full potential. [Online]. Available: https: //dora.dev/research/ai/gen-ai-report/

  12. [12]

    DeBelliset al.(2025) 2025 dora state of ai-assisted software development

    D. DeBelliset al.(2025) 2025 dora state of ai-assisted software development. [Online]. Available: https://dora.dev/research/ai/ #state-of-ai-assisted-software-development

  13. [13]

    The consequences of generative ai for online knowledge communities,

    G. Burtchet al., “The consequences of generative ai for online knowledge communities,”Scientific Reports, vol. 14, no. 1, p. 10413, 2024

  14. [14]

    Is stack overflow obsolete? an empirical study of the characteristics of chatgpt answers to stack overflow questions,

    S. Kabir, D. N. Udo-Imeh, B. Kou, and T. Zhang, “Is stack overflow obsolete? an empirical study of the characteristics of chatgpt answers to stack overflow questions,” inProceedings of the CHI Conference on Human Factors in Computing Systems, 2024, pp. 1–17

  15. [15]

    Measuring github copilot’s impact on productivity,

    A. Ziegleret al., “Measuring github copilot’s impact on productivity,” Communications of the ACM, vol. 67, no. 3, pp. 54–63, 2024

  16. [16]

    The effects of generative ai on high skilled work: Evidence from three field experiments with software developers,

    Z. K. Cui, M. Demirer, S. Jaffe, L. Musolff, S. Peng, and T. Salz, “The effects of generative ai on high skilled work: Evidence from three field experiments with software developers,”Available at SSRN 4945566, 2024

  17. [17]

    Generative ai for software practitioners,

    C. Ebert and P. Louridas, “Generative ai for software practitioners,”IEEE Software, vol. 40, no. 4, pp. 30–38, 2023

  18. [18]

    Using an llm to help with code understanding,

    D. Nam, A. Macvean, V . Hellendoorn, B. Vasilescu, and B. Myers, “Using an llm to help with code understanding,” inProceedings of the IEEE/ACM 46th International Conference on Software Engineering, 2024, pp. 1–13

  19. [19]

    “will i be replaced?

    M. A. Kuhailet al., ““will i be replaced?” assessing chatgpt’s effect on software development and programmer perceptions of ai tools,”Science of Computer Programming, vol. 235, p. 103111, 2024

  20. [20]

    ”create a fear of missing out

    V . Kraußet al., “”create a fear of missing out”-chatgpt implements unsolicited deceptive designs in generated websites without warning,” in Proceedings of the 2025 CHI Conference on Human Factors in Computing Systems, 2025, pp. 1–20

  21. [21]

    The impact of generative ai on critical thinking: Self-reported reductions in cognitive effort and confidence effects from a survey of knowledge workers,

    H.-P. H. Leeet al., “The impact of generative ai on critical thinking: Self-reported reductions in cognitive effort and confidence effects from a survey of knowledge workers,” inProceedings of the CHI Conference on Human Factors in Computing Systems, 2025. JOURNAL OF LATEX CLASS FILES, VOL. 14, NO. 8, AUGUST 2021 16

  22. [22]

    An empirical study on challenges for llm application de- velopers,

    X. Chenet al., “An empirical study on challenges for llm application de- velopers,”ACM Transactions on Software Engineering and Methodology, 2025

  23. [23]

    Efficient and green large language models for software engineering: Literature review, vision, and the road ahead,

    J. Shi, Z. Yang, and D. Lo, “Efficient and green large language models for software engineering: Literature review, vision, and the road ahead,” ACM Transactions on Software Engineering and Methodology, vol. 34, no. 5, pp. 1–22, 2025

  24. [24]

    The impact of llm-assistants on software developer productivity: A systematic literature review,

    A. Mohamed, M. Assi, and M. Guizani, “The impact of llm-assistants on software developer productivity: A systematic literature review,”arXiv preprint arXiv:2507.03156, 2025

  25. [25]

    Novice developers’ perspectives on adopting llms for software development: a systematic literature review,

    S. Ferino, R. Hoda, J. Grundy, and C. Treude, “Novice developers’ perspectives on adopting llms for software development: a systematic literature review,”arXiv preprint arXiv:2503.07556, 2025

  26. [26]

    Socio-technical grounded theory for software engineering,

    R. Hoda, “Socio-technical grounded theory for software engineering,” IEEE Transactions on Software Engineering, vol. 48, no. 10, pp. 3808– 3832, 2022

  27. [27]

    Supplementary Information Package - STGT,

    S. Ferino, R. Hoda, J. Grundy, and C. Treude, “Supplementary Information Package - STGT,” 2025. [Online]. Available: https: //doi.org/10.5281/zenodo.17556044

  28. [28]

    Hoda,Qualitative Research with Socio-Technical Grounded Theory

    R. Hoda,Qualitative Research with Socio-Technical Grounded Theory. Springer, 2024. [Online]. Available: https://link.springer.com/book/10. 1007/978-3-031-60533-8

  29. [29]

    Dealing with data challenges when delivering data- intensive software solutions,

    U. M. Graetschet al., “Dealing with data challenges when delivering data- intensive software solutions,”IEEE Transactions on software engineering, vol. 49, no. 9, pp. 4349–4370, 2023

  30. [30]

    Assessing the attitude towards artificial intelligence: Introduction of a short measure in german, chinese, and english language,

    C. Sindermannet al., “Assessing the attitude towards artificial intelligence: Introduction of a short measure in german, chinese, and english language,” KI-K¨unstliche intelligenz, vol. 35, no. 1, pp. 109–118, 2021

  31. [31]

    Dogra and A

    A. Dogra and A. Nieto. (2025) Build with gpt-5 on databricks with ai gateway. Accessed: 2025-10-07. [Online]. Available: https: //www.databricks.com/blog/build-gpt-5-databricks-ai-gateway

  32. [32]

    M. team. (2025) Overview of copilot for power bi. Accessed: 2025- 10-07. [Online]. Available: https://learn.microsoft.com/en-us/power-bi/ create-reports/copilot-introduction

  33. [33]

    Real world scrum a grounded theory of variations in practice,

    Z. Masood, R. Hoda, and K. Blincoe, “Real world scrum a grounded theory of variations in practice,”IEEE Transactions on Software Engineering, vol. 48, no. 5, pp. 1579–1591, 2020

  34. [34]

    Agentic software engineering: Foundational pillars and a research roadmap,

    A. E. Hassanet al., “Agentic software engineering: Foundational pillars and a research roadmap,”arXiv preprint arXiv:2509.06216, 2025

  35. [35]

    Navigating the complexity of generative ai adoption in software engineering,

    D. Russo, “Navigating the complexity of generative ai adoption in software engineering,”ACM Transactions on Software Engineering and Methodology, vol. 33, no. 5, pp. 1–50, 2024

  36. [36]

    A large-scale survey on the usability of ai programming assistants: Successes and challenges,

    J. T. Liang, C. Yang, and B. A. Myers, “A large-scale survey on the usability of ai programming assistants: Successes and challenges,” in Proceedings of the 46th IEEE/ACM international conference on software engineering, 2024, pp. 1–13

  37. [37]

    Significant productivity gains through programming with large language models,

    T. Weberet al., “Significant productivity gains through programming with large language models,”Proceedings of the ACM on Human-Computer Interaction, vol. 8, no. EICS, pp. 1–29, 2024

  38. [38]

    Software engineering by and for humans in an ai era,

    S. Abrah ˜ao, J. Grundy, M. Pezz `e, M.-A. Storey, and D. A. Tamburri, “Software engineering by and for humans in an ai era,”ACM Transactions on Software Engineering and Methodology, vol. 34, no. 5, pp. 1–46, 2025

  39. [39]

    How hungry is ai? benchmarking energy, water, and carbon footprint of llm inference,

    N. Jegham, M. Abdelatti, L. Elmoubarki, and A. Hendawi, “How hungry is ai? benchmarking energy, water, and carbon footprint of llm inference,” arXiv preprint arXiv:2505.09598, 2025

  40. [40]

    Vibe coding in practice: Motivations, challenges, and a future outlook-a grey literature review,

    A. Fawz, A. Tahir, and K. Blincoe, “Vibe coding in practice: Motivations, challenges, and a future outlook-a grey literature review,”arXiv preprint arXiv:2510.00328, 2025

  41. [41]

    Test-driven development and llm-based code generation,

    N. S. Mathews and M. Nagappan, “Test-driven development and llm-based code generation,” inProceedings of the 39th IEEE/ACM International Conference on Automated Software Engineering, ser. ASE ’24. New York, NY , USA: Association for Computing Machinery, 2024, p. 1583–1594. [Online]. Available: https://doi.org/10.1145/3691620.3695527

  42. [42]

    Today was a good day: The daily life of software developers,

    A. N. Meyer, E. T. Barr, C. Bird, and T. Zimmermann, “Today was a good day: The daily life of software developers,”IEEE Transactions on Software Engineering, vol. 47, no. 5, pp. 863–880, 2019

  43. [43]

    A systematic literature review on the influence of enhanced developer experience on developers’ productivity: Factors, practices, and recommendations,

    A. Razzaq, J. Buckley, Q. Lai, T. Yu, and G. Botterweck, “A systematic literature review on the influence of enhanced developer experience on developers’ productivity: Factors, practices, and recommendations,”ACM Computing Surveys, vol. 57, no. 1, pp. 1–46, 2024

  44. [44]

    Reducing interruptions at work: A large-scale field study of flowlight,

    M. Z¨ugeret al., “Reducing interruptions at work: A large-scale field study of flowlight,” inProceedings of the 2017 CHI Conference on Human Factors in Computing Systems, 2017, pp. 61–72

  45. [45]

    Reading between the lines: Modeling user behavior and costs in ai-assisted programming,

    H. Mozannaret al., “Reading between the lines: Modeling user behavior and costs in ai-assisted programming,” inProceedings of the 2024 CHI Conference on Human Factors in Computing Systems, 2024, pp. 1–16

  46. [46]

    Flow experience in software engineering,

    S. Ritonummi, V . Siitonen, M. Salo, H. Pirkkalainen, and A. Sivunen, “Flow experience in software engineering,” inProceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2023, pp. 618–630

  47. [47]

    Devex: What actually drives productivity?

    A. Nodaet al., “Devex: What actually drives productivity?”Communica- tions of the ACM, vol. 66, no. 11, pp. 44–49, 2023

  48. [48]

    Ironies of generative ai: understanding and mitigating productivity loss in human-ai interaction,

    A. Simkuteet al., “Ironies of generative ai: understanding and mitigating productivity loss in human-ai interaction,”International Journal of Human–Computer Interaction, vol. 41, no. 5, pp. 2898–2919, 2025

  49. [49]

    Promptmaker: Prompt-based prototyping with large language models,

    E. Jianget al., “Promptmaker: Prompt-based prototyping with large language models,” inCHI Conference on Human Factors in Computing Systems Extended Abstracts, 2022, pp. 1–8

  50. [50]

    Prototyping with prompts: Emerging approaches and challenges in generative ai design for collaborative software teams,

    H. Subramonyamet al., “Prototyping with prompts: Emerging approaches and challenges in generative ai design for collaborative software teams,” in Proceedings of the 2025 CHI Conference on Human Factors in Computing Systems, 2025, pp. 1–22

  51. [51]

    Minimum viable product or multiple facet product? the role of mvp in software startups,

    A. N. Duc and P. Abrahamsson, “Minimum viable product or multiple facet product? the role of mvp in software startups,” inInternational conference on agile software development. Springer, 2016, pp. 118–130

  52. [52]

    Accountability in code review: The role of intrinsic drivers and the impact of llms,

    A. Alami, V . V . Jensen, and N. A. Ernst, “Accountability in code review: The role of intrinsic drivers and the impact of llms,”ACM Transactions on Software Engineering and Methodology, 2025

  53. [53]

    Human and machine: How software engineers perceive and engage with ai-assisted code reviews compared to their peers,

    A. Alami and N. Ernst, “Human and machine: How software engineers perceive and engage with ai-assisted code reviews compared to their peers,” in2025 IEEE/ACM 18th International Conference on Cooperative and Human Aspects of Software Engineering (CHASE). IEEE, 2025, pp. 63–74

  54. [54]

    What makes a great manager of software engineers?

    E. Kalliamvakouet al., “What makes a great manager of software engineers?”IEEE Transactions on Software Engineering, vol. 45, no. 1, pp. 87–106, 2017

  55. [55]

    Intuition in software development,

    P. Naur, “Intuition in software development,” inInternational Joint Conference on Theory and Practice of Software Development. Springer, 1985, pp. 60–79

  56. [56]

    Understanding the role of human intuition on reliance in human-ai decision-making with explanations,

    V . Chenet al., “Understanding the role of human intuition on reliance in human-ai decision-making with explanations,”Proceedings of the ACM on Human-computer Interaction, vol. 7, no. CSCW2, pp. 1–32, 2023

  57. [57]

    Burnout in software engineering: A systematic mapping study,

    T. R. Tulili, A. Capiluppi, and A. Rastogi, “Burnout in software engineering: A systematic mapping study,”Information and Software Technology, vol. 155, p. 107116, 2023

  58. [58]

    Grounded copilot: How programmers interact with code- generating models,

    S. Barkeet al., “Grounded copilot: How programmers interact with code- generating models,”Proceedings of the ACM on Programming Languages, vol. 7, no. OOPSLA1, pp. 85–111, 2023

  59. [59]

    Copiloting the future: How generative ai transforms software engineering,

    L. Banh, F. Holldack, and G. Strobel, “Copiloting the future: How generative ai transforms software engineering,”Information and Software Technology, vol. 183, p. 107751, 2025

  60. [60]

    Prompts are programs too! understanding how developers build software containing prompts,

    J. T. Lianget al., “Prompts are programs too! understanding how developers build software containing prompts,”Proceedings of the ACM on Software Engineering, vol. 2, no. FSE, pp. 1591–1614, 2025

  61. [61]

    Ai tool use and adoption in software development by individuals and organizations: a grounded theory study,

    Z. S. Liet al., “Ai tool use and adoption in software development by individuals and organizations: a grounded theory study,”arXiv preprint arXiv:2406.17325, 2024

  62. [62]

    Glaser and A

    B. Glaser and A. Strauss,Discovery of grounded theory: Strategies for qualitative research. Routledge, 2017

  63. [63]

    Juliet and S

    M. Juliet and S. Corbin,Basics of qualitative research: Techniques and procedures for developing grounded theory. SAGE Publications, Incorporated, 2015

  64. [64]

    M. R. Roller and P. J. Lavrakas,Applied qualitative research design: A total quality framework approach. Guilford Publications, 2015

  65. [65]

    Qualitative software engineering research: Reflections and guidelines,

    P. Lenberget al., “Qualitative software engineering research: Reflections and guidelines,”Journal of Software: Evolution and Process, vol. 36, no. 6, p. e2607, 2024

  66. [66]

    The role of empathy in software engineering - a socio-technical grounded theory,

    H. Gunatilakeet al., “The role of empathy in software engineering - a socio-technical grounded theory,”ACM Trans. Softw. Eng. Methodol., Sep. 2025, just Accepted. [Online]. Available: https://doi.org/10.1145/3768315

  67. [67]

    Series: Practical guidance to qualitative research. part 4: Trustworthiness and publishing,

    I. Korstjens and A. Moser, “Series: Practical guidance to qualitative research. part 4: Trustworthiness and publishing,”European Journal of General Practice, vol. 24, no. 1, pp. 120–124, 2018

  68. [68]

    The struggle is real! the agony of recruiting partic- ipants for empirical software engineering studies,

    K. Madampeet al., “The struggle is real! the agony of recruiting partic- ipants for empirical software engineering studies,” in2024 IEEE Sym- posium on Visual Languages and Human-Centric Computing (VL/HCC). IEEE, 2024, pp. 417–422