pith. sign in

arxiv: 2605.17675 · v1 · pith:ZWQ5F4RXnew · submitted 2026-05-17 · 💻 cs.SE · cond-mat.mtrl-sci

Bridging the Gap on AI-Assisted Scientific Software Development Through Transparency and Traceability

Pith reviewed 2026-05-19 22:03 UTC · model grok-4.3

classification 💻 cs.SE cond-mat.mtrl-sci
keywords AI-assisted developmentscientific softwaresoftware quality assuranceNQA-1verification and validationtraceabilitytransparencyTMAP8
0
0 comments X

The pith

A structured framework allows AI-assisted verification and validation in scientific software to meet NQA-1 standards while preserving human accountability.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper notes that researchers already use large language models for code, tests, and documentation in scientific software, but this happens without formal acknowledgment or guidance. It proposes a structured framework for governing AI-assisted development specifically in contexts that require strict Software Quality Assurance such as NQA-1. The framework is demonstrated through AI-assisted verification and validation case development on the TMAP8 tritium migration code, where known solutions make correctness measurable and all artifacts remain fully auditable. The guidance aims to establish disclosure and review standards that keep traceability intact and place final responsibility with humans. A reader would care because ungoverned AI use creates systemic risks to the quality of safety-relevant modeling tools used in fields like fusion energy.

Core claim

The authors claim that a structured framework for AI-assisted V&V case development, built from practical experience with TMAP8, operates within NQA-1 requirements, preserves human accountability, and establishes the disclosure and review standards that responsible AI-assisted scientific software development demands.

What carries the argument

The structured framework for AI-assisted verification and validation case development, which treats cases with known solutions as the proving ground because correctness is objectively measurable, errors are identifiable by design, and all artifacts remain fully auditable.

If this is right

  • AI-assisted code development can be integrated into formal workflows without compromising the traceability and independent verification required by NQA-1.
  • Validation cases with known solutions serve as an effective starting point for establishing governance because errors are identifiable by design and artifacts are fully auditable.
  • Human accountability is maintained by requiring documented procedures, disclosure of AI contributions, and human review at each step.
  • Responsible AI use in scientific software requires specific disclosure and review standards in both internal workflows and published work.
  • The approach demonstrates that strict quality assurance standards need not prohibit AI assistance if governance is put in place first.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same governance principles could be tested on projects outside nuclear applications, such as medical device simulation software, to check whether the framework travels without modification.
  • If the framework succeeds, future standards bodies might incorporate explicit AI disclosure requirements into updated NQA-1 or similar quality guidelines.
  • Researchers could extend the method to AI-generated documentation or test suites and measure whether audit effort increases or decreases compared with fully manual development.
  • A practical test would involve measuring the fraction of AI-generated code that survives human review versus the fraction that requires correction, providing a quantitative check on net benefit.

Load-bearing premise

The structured framework developed from TMAP8 V&V experience can be generalized and implemented across other scientific software projects subject to strict SQA without introducing new compliance gaps.

What would settle it

Applying the framework to a second NQA-1 scientific software project and finding either new compliance gaps or loss of independent verification traceability would falsify the central claim.

read the original abstract

The widespread adoption of AI-assisted development in scientific software is not a future concern -- it is a present reality. Researchers are already using large language models to write code, generate test cases, and draft documentation, yet this practice remains largely unacknowledged and unguided in formal workflows and published work. This ad hoc, ungoverned use of AI represents a systemic risk to scientific software quality, particularly in safety-relevant modeling and simulation tools subject to strict Software Quality Assurance (SQA), or even Nuclear Quality Assurance Level 1 (NQA-1) standards, for which traceability, independent verification, and documented procedures are paramount. The question facing the scientific software community is, therefore, not whether to permit AI-assisted development, but how to govern it responsibly. This paper proposes guidance for AI-assisted code development in the context of strict software quality assurance. Using TMAP8 -- an open-source tritium migration code for fusion energy -- as a demonstration platform, we propose a structured framework for AI-assisted verification and validation (V&V) case development. V&V case development represents the ideal proving ground for establishing that governance: because validation cases have known solutions, correctness is objectively measurable, errors are identifiable by design, and the artifacts are fully auditable. The proposed guidance, developed based on practical experience described herein, operates within NQA-1 requirements, preserves human accountability, and establishes the disclosure and review standards that responsible AI-assisted scientific software development demands.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript argues that ad hoc AI use in scientific software development poses risks to quality and compliance under strict SQA standards such as NQA-1. It proposes a structured framework for AI-assisted verification and validation (V&V) case development, grounded in practical experience with the TMAP8 open-source tritium migration code. The central claim is that this guidance operates within NQA-1 requirements, preserves human accountability, and establishes disclosure and review standards for responsible AI-assisted scientific software development.

Significance. If the framework proves generalizable and explicitly compliant, the work would provide timely, practical guidance for the scientific software community on governing AI tools in safety-relevant modeling and simulation, addressing an emerging gap in traceability and accountability for high-assurance projects.

major comments (2)
  1. [Abstract and TMAP8 demonstration] The claim that the proposed guidance operates within NQA-1 requirements lacks any explicit mapping to specific NQA-1 sections (e.g., 200-series software requirements or 400-series verification and validation). The framework is presented as derived from TMAP8 V&V experience, but without this mapping or an audit of how AI-generated artifacts affect configuration management and independent review, the compliance assertion remains unverified and load-bearing for the central claim.
  2. [Framework proposal and discussion] Generalizability of the TMAP8-derived framework to other NQA-1 projects is asserted without demonstration. The paper does not apply the framework to a second project, consider differing AI integration points (e.g., legacy Fortran vs. modern Python, open-source vs. closed-source), or test for new traceability or procedure gaps, leaving the broad applicability claim unsupported.
minor comments (2)
  1. [Abstract] The abstract would be strengthened by including at least one concrete example of an AI-generated artifact from TMAP8 (e.g., a test case or documentation snippet) along with how it was reviewed and disclosed.
  2. [Framework description] Notation and terminology for the structured framework (e.g., specific steps for disclosure and review) should be defined more explicitly in a dedicated section or table to improve reproducibility for other projects.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive comments, which help clarify the need for stronger substantiation of our NQA-1 compliance claims and the generalizability of the proposed framework. We address each major comment in turn and outline the revisions we will make to the manuscript.

read point-by-point responses
  1. Referee: [Abstract and TMAP8 demonstration] The claim that the proposed guidance operates within NQA-1 requirements lacks any explicit mapping to specific NQA-1 sections (e.g., 200-series software requirements or 400-series verification and validation). The framework is presented as derived from TMAP8 V&V experience, but without this mapping or an audit of how AI-generated artifacts affect configuration management and independent review, the compliance assertion remains unverified and load-bearing for the central claim.

    Authors: We agree that the compliance assertion would be strengthened by an explicit mapping to NQA-1 sections. In the revised manuscript we will add a new subsection and accompanying table that directly maps each component of the proposed guidance (disclosure requirements, human accountability mechanisms, traceability protocols, and review standards) to the relevant NQA-1 provisions, including the 200-series software requirements and the 400-series verification and validation requirements. We will also include a brief audit discussion of how AI-generated artifacts are incorporated into configuration management and subjected to independent review, thereby making the compliance claim verifiable. revision: yes

  2. Referee: [Framework proposal and discussion] Generalizability of the TMAP8-derived framework to other NQA-1 projects is asserted without demonstration. The paper does not apply the framework to a second project, consider differing AI integration points (e.g., legacy Fortran vs. modern Python, open-source vs. closed-source), or test for new traceability or procedure gaps, leaving the broad applicability claim unsupported.

    Authors: The framework was intentionally constructed around core principles of transparency, traceability, and preserved human accountability that are intended to be independent of specific implementation details. Nevertheless, we acknowledge that the manuscript asserts broad applicability without demonstrating it on additional projects or across varied integration contexts. In revision we will expand the discussion section to explicitly address adaptations for legacy Fortran versus modern Python codebases, open-source versus closed-source environments, and potential new traceability gaps that could arise. We will also state clearly that while the framework is designed for general use, empirical validation on further projects is identified as future work. revision: partial

Circularity Check

0 steps flagged

No significant circularity in derivation chain

full rationale

The paper proposes guidance for AI-assisted scientific software development based on practical experience with TMAP8 V&V case development. No equations, fitted parameters, self-referential definitions, or mathematical derivations are present. The framework is explicitly described as emerging from the experience detailed in the paper rather than being defined in terms of its own outputs or predictions. No self-citation chains, uniqueness theorems, or ansatz smuggling are invoked in a load-bearing manner for the central claim. Assertions about operating within NQA-1 requirements and preserving accountability are presented as outcomes of the described experience, not reductions by construction. This is a proposal paper grounded in external practical work, with generalizability as an empirical claim rather than a circular step.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The proposal assumes existing NQA-1 standards provide a sufficient foundation for AI governance and that human accountability can be maintained through disclosure and review without additional mechanisms.

axioms (1)
  • domain assumption NQA-1 standards mandate traceability, independent verification, and documented procedures for safety-relevant scientific software.
    Invoked throughout as the regulatory context that the guidance must satisfy.

pith-pipeline@v0.9.0 · 5819 in / 1165 out tokens · 33175 ms · 2026-05-19T22:03:53.877335+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

60 extracted references · 60 canonical work pages · 3 internal anchors

  1. [1]

    Vibe coding vs

    Sapkota, R., Roumeliotis, K.I., Karkee, M.: Vibe Coding vs. Agentic Coding: Fundamentals and Practical Implications of Agentic AI (2025). https://arxiv.org/ abs/2505.19443

  2. [2]

    Agentic Much? Adoption of Coding Agents on GitHub

    Robbes, R., Matricon, T., Degueule, T., Hora, A., Zacchiroli, S.: Agentic Much? Adoption of Coding Agents on GitHub (2026). https://arxiv.org/abs/2601.18341

  3. [3]

    The Landscape of Emerging AI Agent Architectures for Reasoning, Planning, and Tool Calling: A Survey

    Masterman, T., Besen, S., Sawtell, M., Chao, A.: The landscape of emerging ai agent architectures for reasoning, planning, and tool calling: A survey. arXiv preprint arXiv:2404.11584 (2024)

  4. [4]

    https://anthropic.com/research/ measuring-agent-autonomy 19

    McCain, M., Millar, T., Huang, S., Eaton, J., Handa, K., Stern, M., Tamkin, A., Kearney, M., Durmus, E., Shen, J., Hong, J., Calvert, B., Chan, J.S., Mosconi, F., Saunders, D., Neylon, T., Nicholas, G., Pollack, S., Clark, J., Ganguli, D.: Measuring AI Agent Autonomy in Practice. https://anthropic.com/research/ measuring-agent-autonomy 19

  5. [5]

    Nature Machine Intelligence8(2), 136–137 (2026) https://doi.org/10.1038/ s42256-026-01178-z

    Ciriello, R.: On the troubling rise of generative ai suspicion in academic publish- ing. Nature Machine Intelligence8(2), 136–137 (2026) https://doi.org/10.1038/ s42256-026-01178-z

  6. [6]

    Kousha, K.: How is chatgpt acknowledged in academic publications? Scientomet- rics129(12), 7959–7969 (2024) https://doi.org/10.1007/s11192-024-05193-y

  7. [7]

    npj Artificial Intelligence 1(1), 14 (2025)

    Zhang, Y., Khan, S.A., Mahmud, A., Yang, H., Lavin, A., Levin, M., Frey, J., Dun- nmon, J., Evans, J., Bundy, A.,et al.: Exploring the role of large language models in the scientific method: from hypothesis to discovery. npj Artificial Intelligence 1(1), 14 (2025)

  8. [8]

    arXiv preprint arXiv:2410.23166 (2024)

    Wang, W., Gu, L., Zhang, L., Luo, Y., Dai, Y., Shen, C., Xie, L., Lin, B., He, X., Ye, J.: Scipip: An llm-based scientific paper idea proposer. arXiv preprint arXiv:2410.23166 (2024)

  9. [9]

    Xie, Q., Weng, Y., Zhu, M., Shen, F., Huang, S., Lin, Z., Zhou, J., Mao, Z., Yang, Z., Yang, L., et al.: How far are ai scientists from changing the world? arXiv preprint arXiv:2507.23276 (2025)

  10. [10]

    The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search

    Yamada, Y., Lange, R.T., Lu, C., Hu, S., Lu, C., Foerster, J., Clune, J., Ha, D.: The ai scientist-v2: Workshop-level automated scientific discovery via agentic tree search. arXiv preprint arXiv:2504.08066 (2025)

  11. [11]

    arXiv preprint arXiv:2509.26603 , year=

    Weng, Y., Zhu, M., Xie, Q., Sun, Q., Lin, Z., Liu, S., Zhang, Y.: Deepscien- tist: Advancing frontier-pushing scientific findings progressively. arXiv preprint arXiv:2509.26603 (2025)

  12. [12]

    arXiv preprint arXiv:2603.08127 (2026)

    Lyu, Y., Zhang, X., Yi, X., Zhao, Y., Guo, S., Hu, W., Piotrowski, J., Kaliski, J., Urbani, J., Meng, Z., et al.: Evoscientist: Towards multi-agent evolving ai scientists for end-to-end scientific discovery. arXiv preprint arXiv:2603.08127 (2026)

  13. [13]

    https://gptzero.me/news/neurips/

    Shmatko, N.: GPTZero finds 100 new hallucinations in NeurIPS 2025 accepted papers (2026). https://gptzero.me/news/neurips/

  14. [14]

    https://arxiv.org/abs/2504.20799

    Lee, Y., Song, J.Y., Kim, D., Kim, J., Kim, M., Nam, J.: Hallucination by Code Generation LLMs: Taxonomy, Benchmarks, Mitigation, and Challenges (2025). https://arxiv.org/abs/2504.20799

  15. [15]

    https://arxiv.org/abs/2409.20550

    Zhang, Z., Wang, Y., Wang, C., Chen, J., Zheng, Z.: LLM Hallucinations in Practical Code Generation: Phenomena, Mechanism, and Mitigation (2025). https://arxiv.org/abs/2409.20550

  16. [16]

    AI-Generated Code: A Large-Scale Study of Defects, Vulnerabilities, and Complexity (2025)

    Cotroneo, D., Improta, C., Liguori, P.: Human-Written vs. AI-Generated Code: A Large-Scale Study of Defects, Vulnerabilities, and Complexity (2025). https: //arxiv.org/abs/2508.21634 20

  17. [17]

    Automated Software Engineering33(2), 70 (2026)

    Huang, T., Ren, Z., Huang, Y., Chen, X., Liu, Y., Zheng, Z.: Hallucination detec- tion in llm code generation: A sampling-based consensus verification approach. Automated Software Engineering33(2), 70 (2026)

  18. [18]

    Nature Communications 16(1) (2025) https://doi.org/10.1038/s41467-025-63913-1

    Tang, X., Jin, Q., Zhu, K., Yuan, T., Zhang, Y., Zhou, W., Qu, M., Zhao, Y., Tang, J., Zhang, Z., Cohan, A., Greenbaum, D., Lu, Z., Gerstein, M.: Risks of ai scientists: prioritizing safeguarding over autonomy. Nature Communications 16(1) (2025) https://doi.org/10.1038/s41467-025-63913-1

  19. [19]

    https://arxiv.org/abs/2502.13767

    Roychoudhury, A., Pasareanu, C., Pradel, M., Ray, B.: Agentic AI Software Engineers: Programming with Trust (2025). https://arxiv.org/abs/2502.13767

  20. [20]

    Standard ASME NQA-1-2017, American Society of Mechanical Engineers, New York, NY (Jan- uary 2017)

    American Society of Mechanical Engineers: Quality assurance require- ments for nuclear facility applications. Standard ASME NQA-1-2017, American Society of Mechanical Engineers, New York, NY (Jan- uary 2017). https://www.asme.org/codes-standards/find-codes-standards/ quality-assurance-requirements-for-nuclear-facility-applications

  21. [21]

    Cambridge University Press, Cambridge (2010)

    Oberkampf, W.L., Roy, C.J.: Verification and Validation in Scientific Computing. Cambridge University Press, Cambridge (2010)

  22. [22]

    https://doi.org/10.1109/IEEESTD.2017.7916712

    IEEE Standard for System, Software, and Hardware Verification and Validation. https://doi.org/10.1109/IEEESTD.2017.7916712 . IEEE. Revision of IEEE Std 1012-2012

  23. [23]

    arXiv (2025)

    Jakeman, J.D., Barba, L.A., Martins, J.R.R.A., O’Leary-Roseberry, T.: Ver- ification and Validation for Trustworthy Scientific Machine Learning. arXiv (2025). https://doi.org/10.48550/ARXIV.2502.15496 . https://arxiv.org/abs/ 2502.15496

  24. [24]

    Fusion Engineering and Design 214, 114874 (2025) https://doi.org/10.1016/j.fusengdes.2025.114874

    Simon, P.-C.A., Icenhour, C.T., Singh, G., Lindsay, A.D., Bhave, C.V., Yang, L., Riet, A.A., Che, Y., Humrickhouse, P., Shimada, M., Calderoni, P.: MOOSE- based tritium migration analysis program, version 8 (TMAP8) for advanced open- source tritium transport and fuel cycle modeling. Fusion Engineering and Design 214, 114874 (2025) https://doi.org/10.1016/...

  25. [25]

    SoftwareX31, 102264 (2025) https://doi.org/10.1016/j.softx.2025.102264

    Harbour, L., Giudicelli, G., Lindsay, A.D., German, P., Hansel, J., Icenhour, C., Li, M., Miller, J.M., Stogner, R.H., Behne, P., Yankura, D., Prince, Z.M., DeChant, C., Schwen, D., Spencer, B.W., Tano, M., Choi, N., Wang, Y., Nezdyur, M., Miao, Y., Hu, T., Kumar, S., Matthews, C., Langley, B., Nobre, N., Blair, A., MacMackin, C., Rocha, H.B., Palmer, E.,...

  26. [26]

    Cambridge University Press, Cambridge, UK (2007) 21

    Freidberg, J.P.: Plasma Physics and Fusion Energy. Cambridge University Press, Cambridge, UK (2007) 21

  27. [27]

    Nature Physics12, 398–410 (2016) https://doi.org/10.1038/nphys3745

    Ongena, J., Koch, R., Wolf, R., Zohm, H.: Magnetic-confinement fusion. Nature Physics12, 398–410 (2016) https://doi.org/10.1038/nphys3745

  28. [28]

    Journal of Research of the National Institute of Standards and Technology105(4), 541–549 (2000) https://doi.org/10.6028/jres.105.043

    Lucas, L.L., Unterweger, M.P.: Comprehensive review and critical evaluation of the half-life of tritium. Journal of Research of the National Institute of Standards and Technology105(4), 541–549 (2000) https://doi.org/10.6028/jres.105.043

  29. [29]

    Fusion Engineering and Design136, 1140–1148 (2018) https://doi.org/10.1016/j.fusengdes.2018.04.090

    Pearson, R.J., Antoniazzi, A.B., Nuttall, W.J.: Tritium supply and use: a key issue for the development of nuclear fusion energy. Fusion Engineering and Design136, 1140–1148 (2018) https://doi.org/10.1016/j.fusengdes.2018.04.090 . Special Issue: Proceedings of the 13th International Symposium on Fusion Nuclear Technology (ISFNT-13)

  30. [30]

    National Academies of Sciences, Engineering, and Medicine: Bringing Fusion to the U.S. Grid. The National Academies Press, Washington, DC (2021). https: //doi.org/10.17226/25991

  31. [31]

    Nuclear Fusion61(1), 013001 (2021) https://doi.org/10.1088/1741-4326/abbf35

    Abdou, M., Riva, M., Ying, A., Day, C., Loarte, A., Baylor, L., Humrick- house, P., Fuerst, T.F., Cho, S.-K.: Physics and technology considerations for the deuterium–tritium fuel cycle and conditions for tritium fuel self sufficiency. Nuclear Fusion61(1), 013001 (2021) https://doi.org/10.1088/1741-4326/abbf35

  32. [32]

    Nuclear Fusion58(2), 026010 (2018) https://doi.org/10.1088/ 1741-4326/aa9d25

    Kovari, M., Coleman, M., Cristescu, I., Smith, R.: Tritium resources available for fusion reactors. Nuclear Fusion58(2), 026010 (2018) https://doi.org/10.1088/ 1741-4326/aa9d25

  33. [33]

    Journal of Nuclear Materials300(2–3), 91–117 (2002) https: //doi.org/10.1016/S0022-3115(01)00732-2

    Causey, R.A.: Hydrogen isotope retention and recycling in fusion reactor plasma- facing components. Journal of Nuclear Materials300(2–3), 91–117 (2002) https: //doi.org/10.1016/S0022-3115(01)00732-2

  34. [34]

    In: Konings, R.J.M., Stoller, R.E

    Shimada, M.: 6.08 - tritium transport in fusion reactor materials. In: Konings, R.J.M., Stoller, R.E. (eds.) Comprehensive Nuclear Mate- rials (Second Edition), Second edition edn., pp. 251–273. Elsevier, Oxford (2020). https://doi.org/10.1016/B978-0-12-803581-8.11754-0 . https://www.sciencedirect.com/science/article/pii/B9780128035818117540

  35. [35]

    Technical report, U.S

    Fusion Energy Sciences Advisory Committee: Transformative enabling capabili- ties for efficient advance toward fusion energy. Technical report, U.S. Department of Energy, Office of Science, Office of Fusion Energy Sciences (February 2018). https://science.osti.gov/-/media/fes/fesac/pdf/2018/TEC Report Final.pdf

  36. [36]

    Technical report, U.S

    Fusion Energy Sciences Advisory Committee: Powering the future: Fusion & plas- mas. Technical report, U.S. Department of Energy, Office of Science, Office of Fusion Energy Sciences (December 2020). https://science.osti.gov/-/media/fes/ fesac/pdf/2020/202012/FESAC Report 2020 Powering the Future.pdf

  37. [37]

    Department of Energy, Office of Science: Fusion energy sciences: 22 A ten-year perspective (2015–2025)

    U.S. Department of Energy, Office of Science: Fusion energy sciences: 22 A ten-year perspective (2015–2025). Technical report, U.S. Department of Energy (2015). https://science.osti.gov/-/media/fes/pdf/2015/FES A Ten-Year Perspective 2015-2025.pdf

  38. [38]

    Department of Energy, Office of Science: Fusion energy sciences: Sci- ence and technology roadmap

    U.S. Department of Energy, Office of Science: Fusion energy sciences: Sci- ence and technology roadmap. Technical report, U.S. Department of Energy, Office of Science, Office of Fusion Energy Sciences (October 2025). Released October 16, 2025. https://www.energy.gov/sites/default/files/2025-10/fusion-s% 26t-roadmap-101625.pdf

  39. [39]

    Nuclear Fusion

    Simon, P.-C.A., Icenhour, C.T., Singh, G., Lindsay, A.D., Bhave, C.V., Yang, L., Riet, A.A., Che, Y., Humrickhouse, P., Shimada, M., Calderoni, P.: Advanced tritium transport modeling and verification, validation, and benchmarking from component level to fuel cycle scale using TMAP8. Nuclear Fusion. Submitted (2026)

  40. [40]

    Fusion Engineering and Design218, 115128 (2025) https://doi.org/10.1016/j.fusengdes.2025.115128

    Franklin, T., Icenhour, C., Simon, P.-C.A., Humrickhouse, P., Kong, F., Carasik, L.B.: New developments and verification of fusion blanket simulation capabilities in the MOOSE framework. Fusion Engineering and Design218, 115128 (2025) https://doi.org/10.1016/j.fusengdes.2025.115128

  41. [41]

    Fusion Engineering and Design203, 114438 (2024) https://doi.org/10.1016/j.fusengdes

    Shimada, M., Simon, P.-C.A., Icenhour, C.T., Singh, G.: Toward a high-fidelity tritium transport modeling for retention and permeation experiments. Fusion Engineering and Design203, 114438 (2024) https://doi.org/10.1016/j.fusengdes. 2024.114438

  42. [42]

    International Journal of Hydrogen Energy 210, 153551 (2026) https://doi.org/10.1016/j.ijhydene.2026.153551

    Yang, L., Simon, P.-C.A., Tang, W., Li, M., Zhao, Z., Ding, D., Fuerst, T.: Elu- cidating hydrogen isotope transport mechanisms in proton-conducting ceramics with trapping effects using TMAP8. International Journal of Hydrogen Energy 210, 153551 (2026) https://doi.org/10.1016/j.ijhydene.2026.153551

  43. [43]

    Evaluating AGENTS.md: Are Repository-Level Context Files Helpful for Coding Agents?

    Gloaguen, T., M¨ undler, N., M¨ uller, M., Raychev, V., Vechev, M.: Evaluating agents. md: Are repository-level context files helpful for coding agents? arXiv preprint arXiv:2602.11988 (2026)

  44. [44]

    Technical Report PLN-4005, Idaho National Laboratory, Idaho Falls, ID (March 2025)

    Battelle Energy Alliance, LLC: Software Quality Assurance Plan (SQAP) for MOOSE and MOOSE-Based Applications. Technical Report PLN-4005, Idaho National Laboratory, Idaho Falls, ID (March 2025). Available to authorized users via the INL Electronic Document Management System (EDMS)

  45. [45]

    Journal of Nuclear Materials 458, 22–28 (2015) https://doi.org/10.1016/j.jnucmat.2014.11.047

    Kobayashi, M., Oya, Y., Munakata, K., Okuno, K.: Developing a tritium release model for Li2TiO3 with irradiation-induced defects. Journal of Nuclear Materials 458, 22–28 (2015) https://doi.org/10.1016/j.jnucmat.2014.11.047

  46. [46]

    Nuclear Materials and Energy30, 101137 (2022) https://doi.org/10.1016/j.nme.2022

    Kremer, K., Brucker, M., Jacob, W., Schwarz-Selinger, T.: Influence of thin sur- face oxide films on hydrogen isotope release from ion-irradiated tungsten. Nuclear Materials and Energy30, 101137 (2022) https://doi.org/10.1016/j.nme.2022. 23 101137

  47. [47]

    GitHub pull request #397,idaholab/TMAP8

    Bhave, C.: Add HIT format pre-commit check and agentic workflow docs. GitHub pull request #397,idaholab/TMAP8. Merged 2026-03-20 by P.-C. Simon (simopier). Accessed: 2026-05-08 (2026). https://github.com/idaholab/TMAP8/ pull/397

  48. [48]

    GitHub pull request #402,idaholab/TMAP8

    Yang, L.: Add validation case val-2j: Tritium TDS from Li 2TiO3 solid breeder. GitHub pull request #402,idaholab/TMAP8. Merged 2026-04-14 by P.-C. Simon (simopier); closes issue #400. Accessed: 2026-05-08 (2026). https://github.com/ idaholab/TMAP8/pull/402

  49. [49]

    GitHub pull request #409,idaholab/TMAP8

    Simon, P.-C.: Add val-2k: deuterium desorption from self-irradiated tungsten with thin film oxide. GitHub pull request #409,idaholab/TMAP8. Merged 2026-05- 07; closes issue #399. Accessed: 2026-05-08 (2026). https://github.com/idaholab/ TMAP8/pull/409

  50. [50]

    TBD (2026

    Kadz, L., Yang, L., Dhulipala, S.L.N., Shimada, M., Simon, P.-C.A.: Investigating the critical role of surface effects on deuterium transport in self-damaged tungsten using TMAP8. TBD (2026. In preparation)

  51. [51]

    GitHub issue #408,idaholab/TMAP8

    Idaho National Laboratory: Improve AGENTS.md and agent workflow. GitHub issue #408,idaholab/TMAP8. Accessed: 2026-05-08 (2025). https://github.com/ idaholab/TMAP8/issues/408

  52. [52]

    Nature631(8022), 755–759 (2024)

    Shumailov, I., Shumaylov, Z., Zhao, Y., Papernot, N., Anderson, R., Gal, Y.: Ai models collapse when trained on recursively generated data. Nature631(8022), 755–759 (2024)

  53. [53]

    Nature632(8026), 715–716 (2024)

    Gibney, E.: Has your paper been used to train an ai model? almost certainly. Nature632(8026), 715–716 (2024)

  54. [54]

    NeuroImage124, 1155–1167 (2016) https://doi.org/10.1016/j

    Dhulipala, S.L.N., German, P., Che, Y., Prince, Z.M., Xie, X., Simon, P.-C.A., Labour´ e, V.M., Yan, H.: MOOSE ProbML: Parallelized probabilistic machine learning and uncertainty quantification for computational energy applications. Journal of Computational Science94, 102776 (2026) https://doi.org/10.1016/j. jocs.2025.102776

  55. [55]

    Nuclear Fusion64(8), 086026 (2024)

    Dark, J., Delaporte-Mathurin, R., Schwarz-Selinger, T., Hodille, E.A., Mougenot, J., Charles, Y., Grisolia, C.: Modelling neutron damage effects on tritium transport in tungsten. Nuclear Fusion64(8), 086026 (2024)

  56. [56]

    Journal of Nuclear Materials393(3), 508–512 (2009) https://doi.org/10.1016/j.jnucmat.2009.07.012

    Jiang, D.-e., Carter, E.A.: First-principles investigation on dissolution and diffu- sion of oxygen in tungsten. Journal of Nuclear Materials393(3), 508–512 (2009) https://doi.org/10.1016/j.jnucmat.2009.07.012

  57. [57]

    Nuclear Materials and Energy27, 100991 (2021) https: //doi.org/10.1016/j.nme.2021.100991

    Kremer, K., Schwarz-Selinger, T., Jacob, W.: Influence of thin tungsten oxide 24 films on hydrogen isotope uptake and retention in tungsten – evidence for per- meation barrier effect. Nuclear Materials and Energy27, 100991 (2021) https: //doi.org/10.1016/j.nme.2021.100991

  58. [58]

    IEEE Transactions on Plasma Science, 1–7 (2022) https://doi.org/10.1109/TPS.2022.3183525

    Simon, P.-C.A., Humrickhouse, P.W., Lindsay, A.D.: Tritium Transport Modeling at the Pore Scale in Ceramic Breeder Materials Using TMAP8. IEEE Transactions on Plasma Science, 1–7 (2022) https://doi.org/10.1109/TPS.2022.3183525

  59. [59]

    Journal of Nuclear Materials610, 155795 (2025) https://doi.org/10.1016/j.jnucmat.2025

    Dhulipala, S.L.N., Simon, P.-C.A., Demkowicz, P.A., Hirschhorn, J.A., Novas- cone, S.R.: Unpacking model inadequacy: The quantification of silver release from TRISO fuel by considering empirical and mechanistic approaches. Journal of Nuclear Materials610, 155795 (2025) https://doi.org/10.1016/j.jnucmat.2025. 155795

  60. [60]

    Slaughter, A.E., Prince, Z.M., German, P., Halvic, I., Jiang, W., Spencer, B.W., Dhulipala, S.L., Gaston, D.R.: Moose stochastic tools: A module for performing parallel, memory-efficient in situ stochastic simulations. SoftwareX22, 101345 (2023) 25 Supplementary material Validation case 1: Tritium release from neutron-irradiated Li2TiO3 This validation ca...