Prediction Model of Motivators and Demotivators of Integrating Large Language Models in Software Engineering Education: An Empirical Study

Estefan\'ia Mart\'in-Barroso; Jussi Kasurinen; Maryam Khan; Muhammad Azeem Akbar

arxiv: 2605.09393 · v2 · pith:VYV6JCWDnew · submitted 2026-05-10 · 💻 cs.SE

Prediction Model of Motivators and Demotivators of Integrating Large Language Models in Software Engineering Education: An Empirical Study

Maryam Khan , Muhammad Azeem Akbar , Jussi Kasurinen , Estefan\'ia Mart\'in-Barroso This is my paper

Pith reviewed 2026-05-20 23:12 UTC · model grok-4.3

classification 💻 cs.SE

keywords large language modelssoftware engineering educationmotivators and demotivatorsprediction modelgenetic algorithm optimizationstakeholder surveycost-aware integrationgovernance safeguards

0 comments

The pith

Stakeholder surveys combined with probabilistic modeling and genetic algorithm optimization can identify cost-efficient paths for integrating large language models into software engineering education.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper builds a prediction model that turns survey answers about what encourages or discourages the use of large language models into estimates of how familiar stakeholders are likely to become with the technology. Those estimates then feed an optimization routine that weighs the expected familiarity gains against the money and effort required to implement various integration steps. A sympathetic reader would care because the resulting framework gives institutions a concrete way to plan staged rollouts that favor high-benefit areas while respecting limited resources and addressing risks such as plagiarism and loss of critical thinking skills. The work shows that ethical and governance safeguards emerge as high-priority items when costs are constrained.

Core claim

The study claims that operationalizing nineteen factors into a survey of 126 stakeholders, training Naive Bayes and Logistic Regression models to predict the probability of high LLM familiarity from Likert responses, and embedding those probabilities in a Genetic Algorithm that minimizes implementation cost produces an optimization-informed decision support framework capable of recommending staged, cost-aware integration strategies for large language models in software engineering education, with particular emphasis on governance mechanisms such as integrity and ethical safeguards.

What carries the argument

The Genetic Algorithm optimization layer that trades off predicted familiarity probabilities against implementation costs at both global and category-specific levels.

If this is right

Governance mechanisms focused on integrity and ethical safeguards should receive priority when budgets are limited.
Programming assistance, debugging support, and personalized adaptive learning are perceived as the strongest benefits.
Plagiarism concerns, over-reliance risks, and potential reductions in critical thinking require explicit mitigation in any rollout plan.
Decisions can be made separately at the overall institutional level and at the level of individual educational categories.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same survey-plus-optimization structure could be reused for other classroom technologies by substituting new factors while keeping the modeling steps intact.
Institutions might begin with low-cost pilots in programming assistance to raise familiarity before expanding into more complex uses.
The cost-awareness built into the model implies a need for periodic re-surveying to check whether actual familiarity matches the original predictions.

Load-bearing premise

The nineteen factors taken from earlier literature taxonomies capture the essential motivators and demotivators, and the Likert-scale answers collected from the 126 stakeholders supply reliable training data for the probabilistic models and the subsequent optimization.

What would settle it

A follow-up study that collects new survey responses from a comparable or larger group and produces materially different optimization priorities, such as down-ranking governance safeguards, would falsify the claim that the current model reliably identifies cost-efficient integration strategies.

Figures

Figures reproduced from arXiv: 2605.09393 by Estefan\'ia Mart\'in-Barroso, Jussi Kasurinen, Maryam Khan, Muhammad Azeem Akbar.

**Figure 1.** Figure 1: Research Methodology Process and Cognitive Load, and Integration and Practical Implementation. Across these categories, ten subthemes were identified: Plagiarism and Intellectual Property Concerns, Over-Reliance on AI in Learning, Reduced Critical Thinking and Problem-Solving, Ethical Concerns in AI-Assisted Learning, Challenges in Evaluating Learning Outcomes, Security, Privacy, and Data Integrity Issues… view at source ↗

**Figure 1.** Figure 1: Research Methodology Process • Phase 3: Data Preprocessing and Model Training – Preprocessing of the collected survey data and training of predictive models based on the validated responses. • Phase 4: Probabilistic Cost and Effort Prediction Modeling – Development of a probabilistic cost and effort prediction model grounded in the taxonomies (Phase 1) and the empirically collected and analysed data (Phas… view at source ↗

**Figure 2.** Figure 2: Demographic characteristics of the survey respondents ( [PITH_FULL_IMAGE:figures/full_fig_p012_2.png] view at source ↗

**Figure 2.** Figure 2: Demographic characteristics of the survey respondents ( [PITH_FULL_IMAGE:figures/full_fig_p011_2.png] view at source ↗

**Figure 3.** Figure 3: Prediction Model ø Final Insight from Factor-Level Optimization Factor-level optimization indicates that effective LLM integration is not achieved by uniformly addressing all factors in a specific category but by selectively prioritizing those that deliver the highest impact relative to effort. The results show that stronger outcomes are associated with investments in deep learning processes, structured fe… view at source ↗

**Figure 3.** Figure 3: Prediction Model only an automation tool [49]. In risk-oriented domains, the model emphasizes Bias and Hallucination in LLM Outputs, Security, Privacy, and Data Integrity Issues, and Reduced Critical Thinking and ProblemSolving. The third layer, represented through category-level efficiency gains (∆Fitness), supports staged implementation planning. Pedagogical categories such as Collaboration and Peer Le… view at source ↗

read the original abstract

Context: Large Language Models (LLMs) are increasingly influencing software engineering practice and education. While prior studies examine their technical performance and classroom use, limited research provides cost-aware and empirically grounded models for systematic institutional integration. Objective: This study develops and validates a prediction model to identify cost-efficient strategies for integrating LLMs into software engineering education using motivating and demotivating factors. Method: Based on our previously developed literature survey taxonomies [1], we operationalized 19 validated factors (9 motivators and 10 demotivators) into a structured survey completed by 126 stakeholders from multiple countries. Likert-scale responses were encoded and used to train probabilistic models (Naive Bayes and Logistic Regression) to estimate the likelihood of high LLM familiarity. The probability estimates were integrated into a Genetic Algorithm (GA)-based optimization framework to model trade-offs between predicted familiarity and implementation cost at global and category levels. Results: Respondents perceived strong benefits in Programming Assistance and Debugging Support and Personalized and Adaptive Learning. Major concerns included Plagiarism and Intellectual Property Concerns, Over-Reliance on AI in Learning, and Reduced Critical Thinking and Problem Solving. Optimization results indicate that governance-related mechanisms, particularly integrity and ethical safeguards, should be prioritized under cost constraints. Conclusions: The study introduces an optimization-informed decision support framework linking stakeholder perceptions with probabilistic modeling and cost-effort analysis. The model supports staged and cost-aware LLM integration grounded in governance stability and pedagogically meaningful development.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper runs a survey of 126 stakeholders through Naive Bayes, logistic regression, and a genetic algorithm to rank cost-aware LLM integration steps in SE education, but reports no model performance numbers or cost assignment method.

read the letter

The main takeaway is that this work turns a prior taxonomy into a survey, fits two standard probabilistic models to predict high LLM familiarity, and then uses a genetic algorithm to suggest which factors to prioritize when budgets are tight. The survey itself covers a reasonable spread of countries and flags clear patterns: strong perceived upside in programming assistance and adaptive learning, with big worries around plagiarism, over-reliance, and loss of critical thinking. The optimization step points to governance and ethics safeguards as the place to start under cost limits, which aligns with what many programs are already discussing.

Referee Report

2 major / 3 minor

Summary. The paper develops a prediction model for motivators and demotivators of LLM integration in software engineering education. It operationalizes 19 factors (9 motivators, 10 demotivators) drawn from the authors' prior taxonomy into a survey completed by 126 stakeholders across countries. Likert responses are encoded to train Naive Bayes and Logistic Regression models estimating the probability of high LLM familiarity; these probabilities feed a Genetic Algorithm that optimizes trade-offs between predicted familiarity and implementation costs at global and category levels. Results emphasize benefits in programming assistance and debugging while highlighting concerns over plagiarism, over-reliance, and reduced critical thinking, with optimization favoring governance mechanisms such as integrity and ethical safeguards.

Significance. If the probabilistic models are validated and cost assignments are made transparent and reproducible, the work could supply a practical, optimization-based decision-support framework that connects stakeholder perceptions to staged, cost-aware LLM integration strategies. The combination of empirical survey data, probabilistic modeling, and GA-driven trade-off analysis extends prior taxonomies and offers falsifiable outputs (e.g., prioritized factor lists under varying cost constraints) that institutions could test.

major comments (2)

[Abstract / Method] Abstract and Method: No performance metrics (accuracy, AUC, calibration, or cross-validation results) are reported for the Naive Bayes and Logistic Regression models. Because the probability estimates P(high familiarity) are the direct inputs to the Genetic Algorithm, the lack of validation leaves the optimization outputs sensitive to unexamined model quality rather than demonstrably supported by the 126 responses.
[Method] Method: The procedure for deriving or assigning concrete implementation cost values to the 19 factors (including governance safeguards) is not described. Without explicit cost quantification, weighting scheme, or sensitivity analysis, the GA results that prioritize integrity/ethics under cost constraints rest on unspecified assumptions and cannot be reproduced or stress-tested from the stakeholder data alone.

minor comments (3)

[Method] Clarify the exact encoding scheme used to convert Likert-scale responses into features for the probabilistic models.
[Method] Report the specific GA hyperparameters, population size, number of generations, and fitness function formulation.
[Discussion / Limitations] Add a limitations subsection discussing response bias, sample representativeness across countries, and dependence on the authors' earlier taxonomy.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive and detailed feedback, which identifies key areas for strengthening the methodological rigor and reproducibility of our work. We address each major comment point by point below, indicating the revisions planned for the next version of the manuscript.

read point-by-point responses

Referee: [Abstract / Method] Abstract and Method: No performance metrics (accuracy, AUC, calibration, or cross-validation results) are reported for the Naive Bayes and Logistic Regression models. Because the probability estimates P(high familiarity) are the direct inputs to the Genetic Algorithm, the lack of validation leaves the optimization outputs sensitive to unexamined model quality rather than demonstrably supported by the 126 responses.

Authors: We acknowledge the validity of this observation. The manuscript presents the application of the trained models to generate probability estimates for the Genetic Algorithm but does not include explicit performance evaluation. In the revised manuscript, we will add a dedicated subsection in the Method section reporting 5-fold cross-validation results, including accuracy, AUC-ROC, precision, recall, F1-score, and Brier score for calibration, for both Naive Bayes and Logistic Regression. We will also discuss the implications of these metrics for the reliability of the P(high familiarity) inputs to the optimization framework. revision: yes
Referee: [Method] Method: The procedure for deriving or assigning concrete implementation cost values to the 19 factors (including governance safeguards) is not described. Without explicit cost quantification, weighting scheme, or sensitivity analysis, the GA results that prioritize integrity/ethics under cost constraints rest on unspecified assumptions and cannot be reproduced or stress-tested from the stakeholder data alone.

Authors: This comment correctly highlights a transparency gap. The current text describes the integration of costs into the GA but does not specify their derivation. We will revise the Method section to include an explicit description of the cost assignment process: the numerical values assigned to each of the 19 factors, the basis for those values (expert estimation informed by educational resource literature), the weighting scheme applied at global and category levels, and a sensitivity analysis that varies cost parameters to demonstrate the robustness of the prioritization of integrity and ethical safeguards. These additions will enable full reproducibility and stress-testing. revision: yes

Circularity Check

1 steps flagged

Self-citation for factor taxonomy but independent survey data and modeling provide new content

specific steps

self citation load bearing [Abstract / Method]
"Based on our previously developed literature survey taxonomies [1], we operationalized 19 validated factors (9 motivators and 10 demotivators) into a structured survey completed by 126 stakeholders from multiple countries. Likert-scale responses were encoded and used to train probabilistic models (Naive Bayes and Logistic Regression) to estimate the likelihood of high LLM familiarity."

The selection and operationalization of the 19 factors that define the entire input space for the probabilistic models and GA optimization rests solely on the authors' overlapping prior publication [1]; while new survey responses are collected, the factor structure itself is not re-derived or externally validated within this paper and therefore carries the self-citation into the central modeling pipeline.

full rationale

The paper selects its 19 motivators and demotivators from the authors' own prior literature survey taxonomy [1] and then collects fresh Likert-scale responses from 126 stakeholders to train Naive Bayes and Logistic Regression models whose outputs feed a Genetic Algorithm. This self-citation defines the input structure but does not make the subsequent probability estimates or optimization results equivalent to the prior taxonomy by construction; the empirical data and fitted models introduce independent content. No equations reduce predictions to inputs, no uniqueness claims are imported, and no ansatz is smuggled. The central decision-support framework therefore retains non-circular empirical grounding despite the self-referential starting taxonomy.

Axiom & Free-Parameter Ledger

2 free parameters · 2 axioms · 0 invented entities

The framework rests on the assumption that the prior taxonomy factors are complete and that survey data can be directly mapped to cost-benefit optimization without additional validation steps.

free parameters (2)

Logistic regression and Naive Bayes model parameters
Fitted to the 126 survey responses to predict high LLM familiarity likelihood.
Genetic algorithm hyperparameters and cost weights
Chosen to balance predicted familiarity against unspecified implementation costs at global and category levels.

axioms (2)

domain assumption The 19 factors from the authors' prior literature survey accurately represent stakeholder motivators and demotivators.
Invoked when operationalizing factors into the survey instrument.
standard math Likert-scale responses provide valid quantitative input for probabilistic modeling.
Standard assumption when encoding survey answers for Naive Bayes and logistic regression.

pith-pipeline@v0.9.0 · 5819 in / 1495 out tokens · 34970 ms · 2026-05-20T23:12:32.883003+00:00 · methodology

Review history (2 revisions) →

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

The probability estimates were integrated into a Genetic Algorithm (GA)-based optimization framework to model trade-offs between predicted familiarity and implementation cost at global and category levels.
IndisputableMonolith/Foundation/AbsoluteFloorClosure.lean absolute_floor_iff_bare_distinguishability unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Optimization results indicate that governance-related mechanisms, particularly integrity and ethical safeguards, should be prioritized under cost constraints.

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

57 extracted references · 57 canonical work pages · 2 internal anchors

[1]

M. Khan, M. A. Akbar, J. Kasurinen, Integrating llms in software engineering education: motiva- tors, demotivators, and a roadmap towards a framework for finnish higher education institutes, in: Proceedings of the 2025 29th International Conference on Evaluation and Assessment in Software Engineering Companion, 2025, pp. 182–191

work page 2025
[2]

M. Y. Shaheen, Applications of artificial intelligence (ai) in healthcare: A review, ScienceOpen Preprints (2021)

work page 2021
[3]

Cao, Ai in finance: challenges, techniques, and opportunities, ACM Computing Surveys (CSUR) 55 (3) (2022) 1–38

L. Cao, Ai in finance: challenges, techniques, and opportunities, ACM Computing Surveys (CSUR) 55 (3) (2022) 1–38

work page 2022
[4]

K. D. Forbus, J. Laird, Guest editors’ introduction: Ai and the entertainment industry, IEEE Intelligent Systems 17 (04) (2002) 15–16

work page 2002
[5]

Y.Jin, L.Yan, V.Echeverria, D.Gašević, R.Martinez-Maldonado, Generativeaiinhighereducation: A global perspective of institutional adoption policies and guidelines, Computers and Education: Artificial Intelligence 8 (2025) 100348

work page 2025
[6]

X. Zhai, X. Chu, C. S. Chai, M. S. Y. Jong, A. Istenic, M. Spector, J.-B. Liu, J. Yuan, Y. Li, A review of artificial intelligence (ai) in education from 2010 to 2020, Complexity 2021 (1) (2021) 8812542

work page 2010
[7]

W. X. Zhao, K. Zhou, J. Li, T. Tang, X. Wang, Y. Hou, Y. Min, B. Zhang, J. Zhang, Z. Dong, et al., A survey of large language models, arXiv preprint arXiv:2303.18223 1 (2) (2023) 1–124

work page internal anchor Pith review Pith/arXiv arXiv 2023
[8]

M. A. Akbar, A. A. Khan, P. Liang, Ethical aspects of chatgpt in software engineering research, IEEE Transactions on Artificial Intelligence 6 (2) (2023) 254–267

work page 2023
[9]

J. He, C. Treude, D. Lo, Llm-based multi-agent systems for software engineering: Literature review, vision, and the road ahead, ACM Transactions on Software Engineering and Methodology 34 (5) (2025) 1–30

work page 2025
[10]

Kharrufa, S

A. Kharrufa, S. Alghamdi, A. Aziz, C. Bull, Llms integration in software engineering team projects: Roles, impact, and a pedagogical design space for ai tools in computing education, ACM Transactions on Computing Education 26 (2) (2026) 1–27

work page 2026
[11]

A. Fan, B. Gokkaya, M. Harman, M. Lyubarskiy, S. Sengupta, S. Yoo, J. M. Zhang, Large language models for software engineering: Survey and open problems, in: 2023 IEEE/ACM International Conference on Software Engineering: Future of Software Engineering (ICSE-FoSE), IEEE, 2023, pp. 31–53

work page 2023
[12]

V. D. Kirova, C. S. Ku, J. R. Laracy, T. J. Marlowe, Software engineering education must adapt and evolve for an llm environment, in: Proceedings of the 55th ACM technical symposium on computer science education v. 1, 2024, pp. 666–672

work page 2024
[13]

M. Daun, J. Brings, How chatgpt will change software engineering education, in: Proceedings of the 2023 Conference on Innovation and Technology in Computer Science Education V. 1, 2023, pp. 110–116

work page 2023
[14]

Banerjee, A

P. Banerjee, A. K. Srivastava, D. A. Adjeroh, R. Reddy, N. Karimian, Understanding chatgpt: Impact analysis and path forward for teaching computer science and engineering, IEEE Access 13 (2025) 11049–11069

work page 2025
[15]

Feldt, F

R. Feldt, F. G. de Oliveira Neto, R. Torkar, Ways of applying artificial intelligence in software engineering, in: Proceedings of the 6th International Workshop on Realizing Artificial Intelligence Synergies in Software Engineering, 2018, pp. 35–41

work page 2018
[16]

C. K. Lo, What is the impact of chatgpt on education? a rapid review of the literature, Education sciences 13 (4) (2023) 410

work page 2023
[17]

On the Opportunities and Risks of Foundation Models

R. Bommasani, D. A. Hudson, E. Adeli, R. Altman, S. Arora, S. von Arx, M. S. Bernstein, J. Bohg, A. Bosselut, E. Brunskill, et al., On the opportunities and risks of foundation models, arXiv preprint arXiv:2108.07258 (2021). 24

work page internal anchor Pith review Pith/arXiv arXiv 2021
[18]

B. A. Becker, P. Denny, J. Finnie-Ansley, A. Luxton-Reilly, J. Prather, E. A. Santos, Programming is hard-or at least it used to be: Educational opportunities and challenges of ai code generation, in: Proceedings of the 54th ACM Technical Symposium on Computer Science Education V. 1, 2023, pp. 500–506

work page 2023
[19]

R. FattahiBavandpour, Advancing education with large language models: A systematic review of potential, limitations, and business opportunities, Master’s thesis, LUT University, Lappeenranta, Finland (2024)

work page 2024
[20]

M. V. Macias, L. Kharlashkin, L. E. Huovinen, M. Hämäläinen, Empowering teachers with usability- oriented llm-based tools for digital pedagogy, in: Proceedings of the 4th International Conference on Natural Language Processing for Digital Humanities, 2024, pp. 549–557

work page 2024
[21]

E. M. Bender, T. Gebru, A. McMillan-Major, S. Shmitchell, On the dangers of stochastic parrots: Can language models be too big?, in: Proceedings of the 2021 ACM conference on fairness, account- ability, and transparency, 2021, pp. 610–623

work page 2021
[22]

A. A. Khan, M. A. Akbar, M. Fahmideh, P. Liang, M. Waseem, A. Ahmad, M. Niazi, P. Abrahams- son, Ai ethics: an empirical study on the views of practitioners and lawmakers, IEEE Transactions on Computational Social Systems 10 (6) (2023) 2971–2984

work page 2023
[23]

A. A. Khan, S. Badshah, P. Liang, M. Waseem, B. Khan, A. Ahmad, M. Fahmideh, M. Niazi, M. A. Akbar, Ethics of ai: A systematic literature review of principles and challenges, in: Proceedings of the 26th international conference on evaluation and assessment in software engineering, 2022, pp. 383–392

work page 2022
[24]

D. E. Goldberg, Genetic Algorithms in Search, Optimization, and Machine Learning, Addison- Wesley, Reading, MA, 1989

work page 1989
[25]

Kumar, M

A. Kumar, M. Nadeem, M. Shameem, Machine learning based predictive modeling to effectively implement devops practices in software organizations, Automated Software Engineering 30 (2) (2023) 21

work page 2023
[26]

Kumar, M

A. Kumar, M. Nadeem, M. Shameem, Metaheuristic-based cost-effective predictive modeling for devops project success, Applied Soft Computing 163 (2024) 111834

work page 2024
[27]

A. A. Khan, M. A. Akbar, V. Lahtinen, M. Paavola, M. Niazi, M. N. Alatawi, S. D. Alotaibi, Agile meets quantum: a novel genetic algorithm model for predicting the success of quantum software development project, Automated Software Engineering 31 (1) (2024) 34

work page 2024
[28]

Pereira, J.-M

J. Pereira, J.-M. López, X. Garmendia, M. Azanza, Leveraging open source llms for software en- gineering education and training, in: 2024 36th International Conference on Software Engineering Education and Training (CSEE&T), IEEE, 2024, pp. 1–10

work page 2024
[29]

Washizaki, Guide to the software engineering body of knowledge, IEEE Computer Society (2024)

H. Washizaki, Guide to the software engineering body of knowledge, IEEE Computer Society (2024)

work page 2024
[30]

T. Song, H. Zhang, Y. Xiao, A high-quality generation approach for educational programming projects using llm, IEEE Transactions on Learning Technologies 17 (2024) 2242–2255

work page 2024
[31]

A. T. Neumann, Y. Yin, S. Sowe, S. Decker, M. Jarke, An llm-driven chatbot in higher education for databases and information systems, IEEE Transactions on Education 68 (1) (2024) 103–116

work page 2024
[32]

Finnie-Ansley, P

J. Finnie-Ansley, P. Denny, B. A. Becker, A. Luxton-Reilly, J. Prather, The robots are coming: Exploring the implications of openai codex on introductory programming, in: Proceedings of the 24th Australasian computing education conference, 2022, pp. 10–19

work page 2022
[33]

W. Lyu, Y. Wang, T. Chung, Y. Sun, Y. Zhang, Evaluating the effectiveness of llms in introductory computer science education: A semester-long field study, in: Proceedings of the eleventh ACM conference on learning@ scale, 2024, pp. 63–74

work page 2024
[34]

Kazemitabaar, J

M. Kazemitabaar, J. Chow, C. K. T. Ma, B. J. Ericson, D. Weintrop, T. Grossman, Studying the effect of ai code generators on supporting novice learners in introductory programming, in: Proceedings of the 2023 CHI conference on human factors in computing systems, 2023, pp. 1–23. 25

work page 2023
[35]

Zönnchen, V

B. Zönnchen, V. Thurner, A. Böttcher, On the impact of chatgpt on teaching and studying software engineering, in: 2024 IEEE Global Engineering Education Conference (EDUCON), IEEE, 2024, pp. 1–10

work page 2024
[36]

Denny, J

P. Denny, J. Prather, B. A. Becker, J. Finnie-Ansley, A. Hellas, J. Leinonen, A. Luxton-Reilly, B. N. Reeves, E. A. Santos, S. Sarsa, Computing education in the era of generative ai, Communications of the ACM 67 (2) (2024) 56–67

work page 2024
[37]

M. A. Akbar, A. A. Khan, M. Shameem, M. Nadeem, Genetic model-based success probability prediction of quantum software development projects, Information and Software Technology 165 (2024) 107352

work page 2024
[38]

Shameem, M

M. Shameem, M. Nadeem, A. T. Zamani, Genetic algorithm based probabilistic model for agile project success in global software development, Applied Soft Computing 135 (2023) 109998

work page 2023
[39]

A. A. Khan, J. Keung, M. Niazi, S. Hussain, A. Ahmad, Systematic literature review and empirical investigation of barriers to process improvement in global software development: Client–vendor perspective, Information and Software Technology 87 (2017) 180–205

work page 2017
[40]

B. A. Kitchenham, S. L. Pfleeger, L. M. Pickard, P. W. Jones, D. C. Hoaglin, K. El Emam, J. Rosen- berg, Preliminary guidelines for empirical research in software engineering, IEEE Transactions on software engineering 28 (8) (2002) 721–734

work page 2002
[41]

Norman, Likert scales, levels of measurement and the “laws” of statistics, Advances in Health Sciences Education 15 (5) (2010) 625–632

G. Norman, Likert scales, levels of measurement and the “laws” of statistics, Advances in Health Sciences Education 15 (5) (2010) 625–632

work page 2010
[42]

G. M. Sullivan, A. R. Artino Jr, Analyzing and interpreting data from likert-type scales, Journal of graduate medical education 5 (4) (2013) 541–542

work page 2013
[43]

S. E. Harpe, How to analyze likert and other rating scale data, Currents in pharmacy teaching and learning 7 (6) (2015) 836–850

work page 2015
[44]

Hastie, R

T. Hastie, R. Tibshirani, J. Friedman, The Elements of Statistical Learning, Springer, 2009

work page 2009
[45]

M. Khan, M. A. Akbar, J. Kasurinen, Prediction model of motivators and demotivators of integrating large language models in software engineering education: An empirical study (2026).doi:10.5281/ zenodo.18840653

work page 2026
[46]

Wohlin, P

C. Wohlin, P. Runeson, M. Höst, M. C. Ohlsson, B. Regnell, A. Wesslén, et al., Experimentation in software engineering, Vol. 236, Springer, 2012

work page 2012
[47]

Vaithilingam, T

P. Vaithilingam, T. Zhang, E. L. Glassman, Expectation vs. experience: Evaluating the usability of code generation tools powered by large language models, in: Chi conference on human factors in computing systems extended abstracts, 2022, pp. 1–7

work page 2022
[48]

Barke, M

S. Barke, M. B. James, N. Polikarpova, Grounded copilot: How programmers interact with code- generating models, Proceedings of the ACM on Programming Languages 7 (OOPSLA1) (2023) 85–111

work page 2023
[49]

Kasneci, K

E. Kasneci, K. Seßler, S. Küchemann, M. Bannert, D. Dementieva, F. Fischer, U. Gasser, G. Groh, S. Günnemann, E. Hüllermeier, et al., Chatgpt for good? on opportunities and challenges of large language models for education, Learning and individual differences 103 (2023) 102274

work page 2023
[50]

D. R. Cotton, P. A. Cotton, J. R. Shipway, Chatting and cheating: Ensuring academic integrity in the era of chatgpt, Innovations in education and teaching international 61 (2) (2024) 228–239

work page 2024
[51]

so what if chatgpt wrote it?

N. Kshetri, L. Hughes, E. louise Slade, A. Jeyaraj, A. kumar Kar, A. Koohang, V. Raghavan, M. Ahuja, H. Albanna, M. ahmad Albashrawi, et al., “so what if chatgpt wrote it?” multidisciplinary perspectivesonopportunities, challengesandimplicationsofgenerativeconversationalaiforresearch, practice and policy, International Journal of Information Management 71...

work page 2023
[52]

A. Ng, M. Jordan, On discriminative vs. generative classifiers: A comparison of logistic regression and naive bayes, Advances in neural information processing systems 14 (2001). 26

work page 2001
[53]

Domingos, M

P. Domingos, M. Pazzani, On the optimality of the simple bayesian classifier under zero-one loss, Machine learning 29 (2) (1997) 103–130

work page 1997
[54]

T. G. Dietterich, Ensemble methods in machine learning, in: International workshop on multiple classifier systems, Springer, 2000, pp. 1–15

work page 2000
[55]

K. Deb, Multi-objective optimisation using evolutionary algorithms: an introduction, in: Multi- objective evolutionary optimisation for product design and manufacturing, Springer, 2011, pp. 3–34

work page 2011
[56]

Zawacki-Richter, V

O. Zawacki-Richter, V. I. Marín, M. Bond, F. Gouverneur, Systematic review of research on artifi- cial intelligence applications in higher education–where are the educators?, International journal of educational technology in higher education 16 (1) (2019) 39

work page 2019
[57]

Venkatesh, M

V. Venkatesh, M. G. Morris, G. B. Davis, F. D. Davis, User acceptance of information technology: Toward a unified view1, MIS quarterly 27 (3) (2003) 425–478. 27

work page 2003

[1] [1]

M. Khan, M. A. Akbar, J. Kasurinen, Integrating llms in software engineering education: motiva- tors, demotivators, and a roadmap towards a framework for finnish higher education institutes, in: Proceedings of the 2025 29th International Conference on Evaluation and Assessment in Software Engineering Companion, 2025, pp. 182–191

work page 2025

[2] [2]

M. Y. Shaheen, Applications of artificial intelligence (ai) in healthcare: A review, ScienceOpen Preprints (2021)

work page 2021

[3] [3]

Cao, Ai in finance: challenges, techniques, and opportunities, ACM Computing Surveys (CSUR) 55 (3) (2022) 1–38

L. Cao, Ai in finance: challenges, techniques, and opportunities, ACM Computing Surveys (CSUR) 55 (3) (2022) 1–38

work page 2022

[4] [4]

K. D. Forbus, J. Laird, Guest editors’ introduction: Ai and the entertainment industry, IEEE Intelligent Systems 17 (04) (2002) 15–16

work page 2002

[5] [5]

Y.Jin, L.Yan, V.Echeverria, D.Gašević, R.Martinez-Maldonado, Generativeaiinhighereducation: A global perspective of institutional adoption policies and guidelines, Computers and Education: Artificial Intelligence 8 (2025) 100348

work page 2025

[6] [6]

X. Zhai, X. Chu, C. S. Chai, M. S. Y. Jong, A. Istenic, M. Spector, J.-B. Liu, J. Yuan, Y. Li, A review of artificial intelligence (ai) in education from 2010 to 2020, Complexity 2021 (1) (2021) 8812542

work page 2010

[7] [7]

W. X. Zhao, K. Zhou, J. Li, T. Tang, X. Wang, Y. Hou, Y. Min, B. Zhang, J. Zhang, Z. Dong, et al., A survey of large language models, arXiv preprint arXiv:2303.18223 1 (2) (2023) 1–124

work page internal anchor Pith review Pith/arXiv arXiv 2023

[8] [8]

M. A. Akbar, A. A. Khan, P. Liang, Ethical aspects of chatgpt in software engineering research, IEEE Transactions on Artificial Intelligence 6 (2) (2023) 254–267

work page 2023

[9] [9]

J. He, C. Treude, D. Lo, Llm-based multi-agent systems for software engineering: Literature review, vision, and the road ahead, ACM Transactions on Software Engineering and Methodology 34 (5) (2025) 1–30

work page 2025

[10] [10]

Kharrufa, S

A. Kharrufa, S. Alghamdi, A. Aziz, C. Bull, Llms integration in software engineering team projects: Roles, impact, and a pedagogical design space for ai tools in computing education, ACM Transactions on Computing Education 26 (2) (2026) 1–27

work page 2026

[11] [11]

A. Fan, B. Gokkaya, M. Harman, M. Lyubarskiy, S. Sengupta, S. Yoo, J. M. Zhang, Large language models for software engineering: Survey and open problems, in: 2023 IEEE/ACM International Conference on Software Engineering: Future of Software Engineering (ICSE-FoSE), IEEE, 2023, pp. 31–53

work page 2023

[12] [12]

V. D. Kirova, C. S. Ku, J. R. Laracy, T. J. Marlowe, Software engineering education must adapt and evolve for an llm environment, in: Proceedings of the 55th ACM technical symposium on computer science education v. 1, 2024, pp. 666–672

work page 2024

[13] [13]

M. Daun, J. Brings, How chatgpt will change software engineering education, in: Proceedings of the 2023 Conference on Innovation and Technology in Computer Science Education V. 1, 2023, pp. 110–116

work page 2023

[14] [14]

Banerjee, A

P. Banerjee, A. K. Srivastava, D. A. Adjeroh, R. Reddy, N. Karimian, Understanding chatgpt: Impact analysis and path forward for teaching computer science and engineering, IEEE Access 13 (2025) 11049–11069

work page 2025

[15] [15]

Feldt, F

R. Feldt, F. G. de Oliveira Neto, R. Torkar, Ways of applying artificial intelligence in software engineering, in: Proceedings of the 6th International Workshop on Realizing Artificial Intelligence Synergies in Software Engineering, 2018, pp. 35–41

work page 2018

[16] [16]

C. K. Lo, What is the impact of chatgpt on education? a rapid review of the literature, Education sciences 13 (4) (2023) 410

work page 2023

[17] [17]

On the Opportunities and Risks of Foundation Models

R. Bommasani, D. A. Hudson, E. Adeli, R. Altman, S. Arora, S. von Arx, M. S. Bernstein, J. Bohg, A. Bosselut, E. Brunskill, et al., On the opportunities and risks of foundation models, arXiv preprint arXiv:2108.07258 (2021). 24

work page internal anchor Pith review Pith/arXiv arXiv 2021

[18] [18]

B. A. Becker, P. Denny, J. Finnie-Ansley, A. Luxton-Reilly, J. Prather, E. A. Santos, Programming is hard-or at least it used to be: Educational opportunities and challenges of ai code generation, in: Proceedings of the 54th ACM Technical Symposium on Computer Science Education V. 1, 2023, pp. 500–506

work page 2023

[19] [19]

R. FattahiBavandpour, Advancing education with large language models: A systematic review of potential, limitations, and business opportunities, Master’s thesis, LUT University, Lappeenranta, Finland (2024)

work page 2024

[20] [20]

M. V. Macias, L. Kharlashkin, L. E. Huovinen, M. Hämäläinen, Empowering teachers with usability- oriented llm-based tools for digital pedagogy, in: Proceedings of the 4th International Conference on Natural Language Processing for Digital Humanities, 2024, pp. 549–557

work page 2024

[21] [21]

E. M. Bender, T. Gebru, A. McMillan-Major, S. Shmitchell, On the dangers of stochastic parrots: Can language models be too big?, in: Proceedings of the 2021 ACM conference on fairness, account- ability, and transparency, 2021, pp. 610–623

work page 2021

[22] [22]

A. A. Khan, M. A. Akbar, M. Fahmideh, P. Liang, M. Waseem, A. Ahmad, M. Niazi, P. Abrahams- son, Ai ethics: an empirical study on the views of practitioners and lawmakers, IEEE Transactions on Computational Social Systems 10 (6) (2023) 2971–2984

work page 2023

[23] [23]

A. A. Khan, S. Badshah, P. Liang, M. Waseem, B. Khan, A. Ahmad, M. Fahmideh, M. Niazi, M. A. Akbar, Ethics of ai: A systematic literature review of principles and challenges, in: Proceedings of the 26th international conference on evaluation and assessment in software engineering, 2022, pp. 383–392

work page 2022

[24] [24]

D. E. Goldberg, Genetic Algorithms in Search, Optimization, and Machine Learning, Addison- Wesley, Reading, MA, 1989

work page 1989

[25] [25]

Kumar, M

A. Kumar, M. Nadeem, M. Shameem, Machine learning based predictive modeling to effectively implement devops practices in software organizations, Automated Software Engineering 30 (2) (2023) 21

work page 2023

[26] [26]

Kumar, M

A. Kumar, M. Nadeem, M. Shameem, Metaheuristic-based cost-effective predictive modeling for devops project success, Applied Soft Computing 163 (2024) 111834

work page 2024

[27] [27]

A. A. Khan, M. A. Akbar, V. Lahtinen, M. Paavola, M. Niazi, M. N. Alatawi, S. D. Alotaibi, Agile meets quantum: a novel genetic algorithm model for predicting the success of quantum software development project, Automated Software Engineering 31 (1) (2024) 34

work page 2024

[28] [28]

Pereira, J.-M

J. Pereira, J.-M. López, X. Garmendia, M. Azanza, Leveraging open source llms for software en- gineering education and training, in: 2024 36th International Conference on Software Engineering Education and Training (CSEE&T), IEEE, 2024, pp. 1–10

work page 2024

[29] [29]

Washizaki, Guide to the software engineering body of knowledge, IEEE Computer Society (2024)

H. Washizaki, Guide to the software engineering body of knowledge, IEEE Computer Society (2024)

work page 2024

[30] [30]

T. Song, H. Zhang, Y. Xiao, A high-quality generation approach for educational programming projects using llm, IEEE Transactions on Learning Technologies 17 (2024) 2242–2255

work page 2024

[31] [31]

A. T. Neumann, Y. Yin, S. Sowe, S. Decker, M. Jarke, An llm-driven chatbot in higher education for databases and information systems, IEEE Transactions on Education 68 (1) (2024) 103–116

work page 2024

[32] [32]

Finnie-Ansley, P

J. Finnie-Ansley, P. Denny, B. A. Becker, A. Luxton-Reilly, J. Prather, The robots are coming: Exploring the implications of openai codex on introductory programming, in: Proceedings of the 24th Australasian computing education conference, 2022, pp. 10–19

work page 2022

[33] [33]

W. Lyu, Y. Wang, T. Chung, Y. Sun, Y. Zhang, Evaluating the effectiveness of llms in introductory computer science education: A semester-long field study, in: Proceedings of the eleventh ACM conference on learning@ scale, 2024, pp. 63–74

work page 2024

[34] [34]

Kazemitabaar, J

M. Kazemitabaar, J. Chow, C. K. T. Ma, B. J. Ericson, D. Weintrop, T. Grossman, Studying the effect of ai code generators on supporting novice learners in introductory programming, in: Proceedings of the 2023 CHI conference on human factors in computing systems, 2023, pp. 1–23. 25

work page 2023

[35] [35]

Zönnchen, V

B. Zönnchen, V. Thurner, A. Böttcher, On the impact of chatgpt on teaching and studying software engineering, in: 2024 IEEE Global Engineering Education Conference (EDUCON), IEEE, 2024, pp. 1–10

work page 2024

[36] [36]

Denny, J

P. Denny, J. Prather, B. A. Becker, J. Finnie-Ansley, A. Hellas, J. Leinonen, A. Luxton-Reilly, B. N. Reeves, E. A. Santos, S. Sarsa, Computing education in the era of generative ai, Communications of the ACM 67 (2) (2024) 56–67

work page 2024

[37] [37]

M. A. Akbar, A. A. Khan, M. Shameem, M. Nadeem, Genetic model-based success probability prediction of quantum software development projects, Information and Software Technology 165 (2024) 107352

work page 2024

[38] [38]

Shameem, M

M. Shameem, M. Nadeem, A. T. Zamani, Genetic algorithm based probabilistic model for agile project success in global software development, Applied Soft Computing 135 (2023) 109998

work page 2023

[39] [39]

A. A. Khan, J. Keung, M. Niazi, S. Hussain, A. Ahmad, Systematic literature review and empirical investigation of barriers to process improvement in global software development: Client–vendor perspective, Information and Software Technology 87 (2017) 180–205

work page 2017

[40] [40]

B. A. Kitchenham, S. L. Pfleeger, L. M. Pickard, P. W. Jones, D. C. Hoaglin, K. El Emam, J. Rosen- berg, Preliminary guidelines for empirical research in software engineering, IEEE Transactions on software engineering 28 (8) (2002) 721–734

work page 2002

[41] [41]

Norman, Likert scales, levels of measurement and the “laws” of statistics, Advances in Health Sciences Education 15 (5) (2010) 625–632

G. Norman, Likert scales, levels of measurement and the “laws” of statistics, Advances in Health Sciences Education 15 (5) (2010) 625–632

work page 2010

[42] [42]

G. M. Sullivan, A. R. Artino Jr, Analyzing and interpreting data from likert-type scales, Journal of graduate medical education 5 (4) (2013) 541–542

work page 2013

[43] [43]

S. E. Harpe, How to analyze likert and other rating scale data, Currents in pharmacy teaching and learning 7 (6) (2015) 836–850

work page 2015

[44] [44]

Hastie, R

T. Hastie, R. Tibshirani, J. Friedman, The Elements of Statistical Learning, Springer, 2009

work page 2009

[45] [45]

M. Khan, M. A. Akbar, J. Kasurinen, Prediction model of motivators and demotivators of integrating large language models in software engineering education: An empirical study (2026).doi:10.5281/ zenodo.18840653

work page 2026

[46] [46]

Wohlin, P

C. Wohlin, P. Runeson, M. Höst, M. C. Ohlsson, B. Regnell, A. Wesslén, et al., Experimentation in software engineering, Vol. 236, Springer, 2012

work page 2012

[47] [47]

Vaithilingam, T

P. Vaithilingam, T. Zhang, E. L. Glassman, Expectation vs. experience: Evaluating the usability of code generation tools powered by large language models, in: Chi conference on human factors in computing systems extended abstracts, 2022, pp. 1–7

work page 2022

[48] [48]

Barke, M

S. Barke, M. B. James, N. Polikarpova, Grounded copilot: How programmers interact with code- generating models, Proceedings of the ACM on Programming Languages 7 (OOPSLA1) (2023) 85–111

work page 2023

[49] [49]

Kasneci, K

E. Kasneci, K. Seßler, S. Küchemann, M. Bannert, D. Dementieva, F. Fischer, U. Gasser, G. Groh, S. Günnemann, E. Hüllermeier, et al., Chatgpt for good? on opportunities and challenges of large language models for education, Learning and individual differences 103 (2023) 102274

work page 2023

[50] [50]

D. R. Cotton, P. A. Cotton, J. R. Shipway, Chatting and cheating: Ensuring academic integrity in the era of chatgpt, Innovations in education and teaching international 61 (2) (2024) 228–239

work page 2024

[51] [51]

so what if chatgpt wrote it?

N. Kshetri, L. Hughes, E. louise Slade, A. Jeyaraj, A. kumar Kar, A. Koohang, V. Raghavan, M. Ahuja, H. Albanna, M. ahmad Albashrawi, et al., “so what if chatgpt wrote it?” multidisciplinary perspectivesonopportunities, challengesandimplicationsofgenerativeconversationalaiforresearch, practice and policy, International Journal of Information Management 71...

work page 2023

[52] [52]

A. Ng, M. Jordan, On discriminative vs. generative classifiers: A comparison of logistic regression and naive bayes, Advances in neural information processing systems 14 (2001). 26

work page 2001

[53] [53]

Domingos, M

P. Domingos, M. Pazzani, On the optimality of the simple bayesian classifier under zero-one loss, Machine learning 29 (2) (1997) 103–130

work page 1997

[54] [54]

T. G. Dietterich, Ensemble methods in machine learning, in: International workshop on multiple classifier systems, Springer, 2000, pp. 1–15

work page 2000

[55] [55]

K. Deb, Multi-objective optimisation using evolutionary algorithms: an introduction, in: Multi- objective evolutionary optimisation for product design and manufacturing, Springer, 2011, pp. 3–34

work page 2011

[56] [56]

Zawacki-Richter, V

O. Zawacki-Richter, V. I. Marín, M. Bond, F. Gouverneur, Systematic review of research on artifi- cial intelligence applications in higher education–where are the educators?, International journal of educational technology in higher education 16 (1) (2019) 39

work page 2019

[57] [57]

Venkatesh, M

V. Venkatesh, M. G. Morris, G. B. Davis, F. D. Davis, User acceptance of information technology: Toward a unified view1, MIS quarterly 27 (3) (2003) 425–478. 27

work page 2003