GlyTwin: Digital Twin for Glucose Control in Type 1 Diabetes Through Optimal Behavioral Modifications Using Patient-Centric Counterfactuals

Asiful Arefeen; Bithika Thompson; Hassan Ghasemzadeh; Maria Adela Grando; Saman Khamesian

arxiv: 2504.09846 · v2 · pith:7DCJQYRZnew · submitted 2025-04-14 · 💻 cs.LG · cs.AI· cs.HC

GlyTwin: Digital Twin for Glucose Control in Type 1 Diabetes Through Optimal Behavioral Modifications Using Patient-Centric Counterfactuals

Asiful Arefeen , Saman Khamesian , Maria Adela Grando , Bithika Thompson , Hassan Ghasemzadeh This is my paper

Pith reviewed 2026-05-25 08:45 UTC · model grok-4.3

classification 💻 cs.LG cs.AIcs.HC

keywords digital twincounterfactual explanationstype 1 diabetesglucose controlbehavioral modificationshyperglycemiapatient-centric interventionsautomated insulin delivery

0 comments

The pith

GlyTwin generates patient-centric counterfactuals inside a digital twin to recommend behavioral changes that cut hyperglycemia in type 1 diabetes simulations.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents GlyTwin as a framework that adds counterfactual explanation generation to digital twin models so that they can propose specific adjustments to carbohydrate intake and insulin dosing. These adjustments are meant to produce alternative future glucose trajectories that avoid or shorten hyperglycemic periods while respecting individual preferences. The authors built a new longitudinal dataset from 50 automated-insulin-delivery users and report that the resulting counterfactuals are valid 85.8 percent of the time and reduce hyperglycemia 87.3 percent more effectively than the patients' own historical records. The central premise is that current predictive tools stop short of offering actionable, preference-aware behavioral scenarios that could be used for proactive management.

Core claim

GlyTwin generates counterfactual treatments by recommending adjustments to behavioral choices, such as carbohydrate intake and insulin dosing, to significantly reduce the occurrence and duration of hyperglycemic events. In addition, GlyTwin incorporates stakeholder preferences into its intervention-generation process, ensuring that the tool is personalized and user-centric. When evaluated on the AZT1D dataset of 50 type 1 diabetes patients monitored for 26 days each, the method produced 85.8 percent valid explanations and 87.3 percent effectiveness in preventing hyperglycemia relative to historical data, outperforming prior counterfactual methods.

What carries the argument

Patient-centric counterfactual explanations integrated into a digital twin that simulate alternative behavioral treatment scenarios for glucose trajectories.

If this is right

Digital twins for diabetes can move from passive prediction to active generation of preferred behavioral alternatives.
Stakeholder preferences can be directly encoded so that recommended changes remain acceptable to the individual patient.
Hyperglycemia reduction of roughly 87 percent relative to historical traces becomes achievable inside simulation before any real-world change occurs.
The same counterfactual machinery can be applied to other chronic conditions where daily behavioral choices drive long-term outcomes.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

If the recommendations prove effective in practice, automated insulin delivery systems could embed GlyTwin-style modules that surface daily behavioral nudges rather than only adjusting insulin.
Long-term complication rates could decline if repeated use of the tool steadily lowers cumulative hyperglycemic exposure.
The approach could be tested by measuring whether patients who receive the counterfactual suggestions actually modify their recorded behavior in the expected direction.

Load-bearing premise

Counterfactuals judged valid and effective on past glucose and behavior records will produce the same improvements when patients actually adopt the recommended changes.

What would settle it

A prospective trial in which patients follow the specific carb-intake and insulin-dosing recommendations generated by GlyTwin for several weeks and their measured time-in-hyperglycemia is compared against their own baseline data.

Figures

Figures reproduced from arXiv: 2504.09846 by Asiful Arefeen, Bithika Thompson, Hassan Ghasemzadeh, Maria Adela Grando, Saman Khamesian.

**Figure 1.** Figure 1: Digital twin framework with enhanced capabilities that can model physiological response, simulate treatments, and identify optimal treatment. Our proposition, GlyTwin, is built upon this vision. Following [PITH_FULL_IMAGE:figures/full_fig_p002_1.png] view at source ↗

**Figure 2.** Figure 2: Counterfactual XAI for hyperglycemia prevention. glucose levels (BGL) and insulin bolus dosing). Red squares represent samples that resulted in hyperglycemia, while green squares correspond to normoglycemic outcomes. The black dashed curve denotes the decision boundary of the classifier that separates hyperglycemia from normoglycemia. Assuming the pink square as an observed hyperglycemic event, GlyTwin ide… view at source ↗

**Figure 3.** Figure 3: GlyTwin framework consists of four phases: data acquisition from CGM sensor and insulin logs, model training for glycemic outcome prediction, counterfactual generation for actionable recommendations, and integration into a dynamic, personalized management pipeline [PITH_FULL_IMAGE:figures/full_fig_p004_3.png] view at source ↗

**Figure 4.** Figure 4: Categorizing the performance of GlyTwin based on patient age, sex, A1C and years from diagnosis (YfD). whom are false negatives). Proximity values remain relatively consistent across age groups, with younger patients (27–36 years) showing slightly higher proximity values (0.328). Sparsity, which reflects the simplicity of the CFs, does not vary significantly and stays within a narrow range of 2.0 − 2.43 a… view at source ↗

**Figure 5.** Figure 5: Comparison of feature diversity among CFs produced using different techniques [PITH_FULL_IMAGE:figures/full_fig_p006_5.png] view at source ↗

**Figure 6.** Figure 6: Preference alignment analysis of GlyTwin. 2. Discussion In this section, we will demonstrate some experiments conducted on the GlyTwin algorithm. Basically, we want to answer- what happens when certain parameters of GlyTwin are modified? 2.1 Impact of target probability When a higher target probability (γ) is set for normoglycemia, it takes longer for GlyTwin to reach the target. Therefore, GlyTwin has to… view at source ↗

**Figure 7.** Figure 7: Results of ablations studies performed using GlyTwin. The ablation studies include understanding (a) how the number of required iterations, (b) proximity, and (c) validity changes as we set different target confidence for achieving normoglycemia [PITH_FULL_IMAGE:figures/full_fig_p007_7.png] view at source ↗

**Figure 8.** Figure 8: Runtime comparison between converged and non-converged samples. GlyTwin takes roughly 9.4 seconds on average to produce a behavioral intervention to prevent hyperglycemia. 2.4 Limitations of GlyTwin Although GlyTwin achieves promising results in many performance metrics, we have identified three limitations associated with the current state of the GlyTwin technology. 2.4.1 Clinical validation of the inter… view at source ↗

**Figure 9.** Figure 9: Extracting the basal rates and the device modes from the PDFs using OCR and coordinate system. its timestamp (tmax) have been captured. Therefore, BGLmax = max(BGL[tf b : tf b +120]) and tmax = argmax(BGL[tf b : tf b + 120]). Since, peak time for glucose level after meal is 72±23 minutes, our assumption is that meal timestamp tmeal = tmax − 72 [41]. Hence, we calculate ∆t using tmeal and tf b. 3.3.3 Total … view at source ↗

**Figure 10.** Figure 10: Derivation of different features from the data stream [PITH_FULL_IMAGE:figures/full_fig_p010_10.png] view at source ↗

read the original abstract

Frequent and long-term exposure to hyperglycemia increases the risk of chronic complications, including neuropathy, nephropathy, and cardiovascular disease. Existing continuous subcutaneous insulin infusion (CSII) and continuous glucose monitoring (CGM) technologies model only specific aspects of glycemic regulation, such as predicting hypoglycemia and administering small insulin boluses. Similarly, current digital twin approaches in diabetes management primarily focus on predicting glucose responses to human behavior and insulin therapy. As a result, these technologies lack the ability to provide alternative treatment scenarios that could guide proactive behavioral interventions for optimal diabetes management. To address this gap, we propose GlyTwin, a novel computational framework that enhances digital twin technologies by integrating counterfactual explanations to simulate optimal behavioral treatments for glucose control. GlyTwin generates counterfactual treatments by recommending adjustments to behavioral choices, such as carbohydrate intake and insulin dosing, to significantly reduce the occurrence and duration of hyperglycemic events. In addition, GlyTwin incorporates stakeholder preferences into its intervention-generation process, ensuring that the tool is personalized and user-centric. We evaluate GlyTwin on AZT1D, a new dataset constructed by collecting longitudinal data from 50 individuals living with type 1 diabetes (T1D) on automated insulin delivery (AID) systems, each monitored for 26 days. Results show that GlyTwin outperforms state-of-the-art methods for generating counterfactual explanations, with 85.8\% valid explanations and 87.3\% effectiveness in preventing hyperglycemia compared with historical data.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

GlyTwin adds counterfactuals to digital twins for suggesting T1D behavioral changes, but the 87.3% effectiveness claim rests only on retrospective historical comparisons without any test of adoption or closed-loop outcomes.

read the letter

The core contribution is a framework that layers counterfactual generation on top of a digital twin to recommend adjustments in carbohydrate intake and insulin dosing aimed at cutting hyperglycemic time. They collected a new dataset (AZT1D) from 50 patients on automated insulin delivery systems over 26 days each and report 85.8% valid counterfactuals plus 87.3% effectiveness versus the observed traces. The work also tries to fold in stakeholder preferences so the suggestions feel personalized rather than purely algorithmic. That combination of a fresh dataset and the user-centric angle is the clearest advance over prior digital-twin papers that stop at prediction. The comparison to existing counterfactual methods is a reasonable baseline step and shows numerical improvement on their chosen metrics. The main limitation is exactly the one flagged in the stress test. Effectiveness is defined by whether the suggested changes would have produced lower glucose in the historical record, which assumes both perfect adherence and that the underlying glucose model remains accurate under the altered inputs. No closed-loop simulation, no adherence measurement, and no prospective arm appear in the evaluation, so the jump from “valid on past data” to “would improve real outcomes” stays untested. The math and data handling look standard for this domain, with no obvious circularity once the retrospective framing is accepted. This paper is aimed at researchers working at the intersection of explainable AI and chronic-disease digital health. It is coherent on its own terms and engages the relevant literature without obvious internal contradictions, so it deserves a serious referee even though the validation gap will need addressing in revision. I would send it out for review rather than desk-reject.

Referee Report

2 major / 2 minor

Summary. The paper introduces GlyTwin, a digital twin framework that augments glucose prediction models with counterfactual explanations to recommend patient-centric behavioral changes (primarily adjustments to carbohydrate intake and insulin dosing) aimed at reducing hyperglycemic events in type 1 diabetes. It constructs a new AZT1D dataset from 50 AID users monitored for 26 days each and reports that GlyTwin generates 85.8% valid counterfactuals while achieving 87.3% effectiveness in preventing hyperglycemia relative to historical traces, outperforming prior counterfactual methods.

Significance. If the retrospective performance numbers prove robust under prospective validation, the work would represent a meaningful step toward actionable, preference-aware digital twins that move beyond passive prediction to proactive behavioral guidance. The release of the AZT1D dataset is a concrete contribution that could support future reproducibility studies in the field.

major comments (2)

[Evaluation] Evaluation section (and abstract): the 87.3% effectiveness figure is obtained solely by scoring counterfactual traces against the same historical glucose records used to generate them; no closed-loop simulation of the underlying glucose dynamics, no adherence model, and no prospective patient arm are described. This assumption is load-bearing for the central claim that the counterfactuals translate to improved real-world outcomes.
[Methods] Methods section: the abstract and provided description supply no algorithmic details on how counterfactuals are generated, how validity is formally defined, how stakeholder preferences are encoded, or what statistical tests support the reported percentages. Without these, the 85.8% validity claim cannot be independently verified.

minor comments (2)

[Evaluation] The paper should explicitly state the glucose dynamics model used to score counterfactual effectiveness and whether it was fitted on the same AZT1D traces.
[Dataset] Clarify whether the 26-day windows are contiguous or selected, and report any patient-level variability in the validity and effectiveness metrics.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback. We address each major comment below, indicating planned revisions where appropriate.

read point-by-point responses

Referee: [Evaluation] Evaluation section (and abstract): the 87.3% effectiveness figure is obtained solely by scoring counterfactual traces against the same historical glucose records used to generate them; no closed-loop simulation of the underlying glucose dynamics, no adherence model, and no prospective patient arm are described. This assumption is load-bearing for the central claim that the counterfactuals translate to improved real-world outcomes.

Authors: We agree that the evaluation is retrospective and relies on historical traces. The 87.3% effectiveness is computed by applying the generated counterfactual behavioral changes (carbohydrate and insulin adjustments) to the patient's observed data and comparing the resulting simulated glucose trajectory against the original historical record. This quantifies potential improvement under the observed conditions but does not model future adherence or closed-loop dynamics. In the revision we will add an explicit Limitations subsection acknowledging the absence of prospective validation or adherence modeling and outlining plans for future closed-loop simulation studies. The current results remain a necessary first step for demonstrating the framework on real patient data. revision: yes
Referee: [Methods] Methods section: the abstract and provided description supply no algorithmic details on how counterfactuals are generated, how validity is formally defined, how stakeholder preferences are encoded, or what statistical tests support the reported percentages. Without these, the 85.8% validity claim cannot be independently verified.

Authors: The full manuscript contains a Methods section that defines counterfactual generation via constrained optimization minimizing time-in-hyperglycemia subject to physiological and behavioral bounds, with validity defined as counterfactuals that reduce hyperglycemia duration by at least 20% while satisfying all constraints. Stakeholder preferences are encoded as additional soft constraints (e.g., bounds on daily carbohydrate intake derived from patient history). Percentages are supported by paired statistical tests across the 50-patient cohort. To address the concern, we will expand the Methods section with pseudocode, formal mathematical definitions, and explicit statistical test descriptions in the revision. revision: yes

Circularity Check

0 steps flagged

No significant circularity; evaluation is retrospective but independent of input definitions

full rationale

The paper introduces GlyTwin as a framework that generates patient-centric counterfactuals for carbohydrate/insulin adjustments and evaluates them on the newly collected AZT1D dataset (50 patients, 26 days each). The reported metrics (85.8% valid explanations, 87.3% effectiveness) are computed by comparing generated counterfactuals against observed historical glucose traces, not by re-using the same fitted parameters or self-referential definitions as the generation step. No equations or sections are shown that reduce the effectiveness score to a tautology (e.g., no case where the model used for scoring is identical to the model used for generation in a way that forces the result). Self-citations, if present, are not load-bearing for the central performance numbers. The derivation therefore remains self-contained against the external historical benchmark.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract does not specify any free parameters, axioms, or invented entities; assessment limited by lack of full text.

pith-pipeline@v0.9.0 · 5820 in / 1238 out tokens · 49424 ms · 2026-05-25T08:45:46.587671+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

45 extracted references · 45 canonical work pages · 2 internal anchors

[1]

Reynolds, Kara R

Evan L. Reynolds, Kara R. Mizokami-Stout, Nathaniel Putnam, Mousumi Banerjee, Dana Albright, Lynn Ang, Joyce Lee, Rodica Pop-Busui, Eva L. Feldman, and Brian Christopher Callaghan. Cost and utilization of healthcare services for persons with diabetes. Diabetes research and clinical practice, page 110983, 2023

work page 2023
[2]

ElSayed, Grazia Aleppo, Vanita R

Nuha A. ElSayed, Grazia Aleppo, Vanita R. Aroda, Raveendhara R. Bannuru, Florence M. Brown, Dennis Bruemmer, Billy S. Collins, Marisa E. Hilliard, Diana M. Isaacs, Eric L. Johnson, Scott Kahan, Kamlesh K Khunti, Jose Leon, Sarah K. Lyons, Mary Lou Perry, Priya Pra- halad, Richard E. Pratley, J. Seley, Robert C. Stanton, GlyTwin: Digital Twin for Glucose C...

work page 2023
[3]

Sherr, Lori M

Jennifer L. Sherr, Lori M. Laffel, Jingwen Liu, Wendy A. Wolf, Jeoffrey A. Bispham, Katherine S Chapman, Daniel Finan, Lina Titievsky, Tina Liu, Kaitlin Hagan, Jason L. Gaglia, Keval Chandarana, Richard M. Bergenstal, and Jeremy H. Pettus. Severe hypoglycemia and impaired awareness of hypoglycemia persist in people with type 1 diabetes despite use of diab...

work page 2024
[4]

One-year outcomes of a digi- tal twin intervention for type 2 diabetes: a retrospective real-world study

Paramesh Shamanna, Ravi Sankar Erukulapati, Ashutosh Shukla, Lisa Shah, Bree Willis, Mohamed Thajudeen, Ra- jiv Kovil, Rahul Baxi, Mohsin Wali, Suresh Damodharan, and Shashank R Joshi. One-year outcomes of a digi- tal twin intervention for type 2 diabetes: a retrospective real-world study. Scientific Reports, 14, 2024

work page 2024
[5]

Artifi- cial intelligence in diabetes management: Advancements, opportunities, and challenges

Zhouyu Guan, Huating Li, Ruhan Liu, Chun Cai, Yuex- ing Liu, Jiajia Li, Xiangning Wang, Shan Huang, Liang Wu, Dan Liu, Shujie Yu, Zheyuan Wang, Jia Shu, Xuhong Hou, Xiaokang Yang, Weiping Jia, and Bin Sheng. Artifi- cial intelligence in diabetes management: Advancements, opportunities, and challenges. Cell Reports Medicine, 4, 2023

work page 2023
[6]

Re- playbg: A digital twin-based methodology to identify a personalized model from type 1 diabetes data and simu- late glucose concentrations to assess alternative therapies

Giacomo Cappon, Martina Vettoretti, Giovanni Spara- cino, Simone Del Favero, and Andrea Facchinetti. Re- playbg: A digital twin-based methodology to identify a personalized model from type 1 diabetes data and simu- late glucose concentrations to assess alternative therapies. IEEE Transactions on Biomedical Engineering, 70:3227– 3238, 2023

work page 2023
[7]

System architec- ture of twin: A new digital twin-based clinical decision support system for type 1 diabetes management in chil- dren

Giacomo Cappon, Elisa Pellizzari, Luca Cossu, Giovanni Sparacino, Annalisa Deodati, Riccardo Schiaffini, Ste- fano Cianfarani, and Andrea Facchinetti. System architec- ture of twin: A new digital twin-based clinical decision support system for type 1 diabetes management in chil- dren. 2023 IEEE 19th International Conference on Body Sensor Networks (BSN), ...

work page 2023
[8]

Naveenah Udaya Surian, Arsen Batagov, Andrew Wu, W. B. Lai, Yan Sun, Yong Mong Bee, and Rinkoo Dalan. A digital twin model incorporating generalized metabolic fluxes to identify and predict chronic kidney disease in type 2 diabetes mellitus. NPJ Digital Medicine, 7, 2024

work page 2024
[9]

Digital twin predicting diet response before and after long-term fasting

Oscar Silfvergren, Christian Simonsson, Mattias Ekstedt, Peter Lundberg, Peter Gennemark, and Gunnar Ceder- sund. Digital twin predicting diet response before and after long-term fasting. PLoS Computational Biology, 18, 2021

work page 2021
[10]

Glumarker: A novel predictive modeling of glycemic control through digital biomarkers

Ziyi Zhou, Ming Cheng, Xingjian Diao, Yanjun Cui, and Xiangling Li. Glumarker: A novel predictive modeling of glycemic control through digital biomarkers. 2024 46th Annual International Conference of the IEEE Engineer- ing in Medicine and Biology Society (EMBC), pages 1–7, 2024

work page 2024
[11]

Glysim: Mod- eling and simulating glycemic response for behavioral lifestyle interventions

Asiful Arefeen and Hassan Ghasemzadeh. Glysim: Mod- eling and simulating glycemic response for behavioral lifestyle interventions. 2023 IEEE EMBS International Conference on Biomedical and Health Informatics (BHI), pages 1–5, 2023

work page 2023
[12]

Nurse-in-the- loop artificial intelligence for precision management of type 2 diabetes in a clinical trial utilizing transfer-learned predictive digital twin

Syed Hasib Akhter Faruqui, Adel Alaeddini, Yan Du, Shiyu Li, Kumar Sharma, and Jing Wang. Nurse-in-the- loop artificial intelligence for precision management of type 2 diabetes in a clinical trial utilizing transfer-learned predictive digital twin. ArXiv, abs/2401.02661, 2024

work page arXiv 2024
[13]

Forewarning postprandial hyper- glycemia with interpretations using machine learning

Asiful Arefeen, Samantha N Fessler, Carol Johnston, and Hassan Ghasemzadeh. Forewarning postprandial hyper- glycemia with interpretations using machine learning. 2022 IEEE-EMBS International Conference on Wearable and Implantable Body Sensor Networks (BSN) , pages 1–4, 2022

work page 2022
[14]

why should i trust you?

Marco Tulio Ribeiro, Sameer Singh, and Carlos Guestrin. “why should i trust you?”: Explaining the predictions of any classifier. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016

work page 2016
[15]

Akshay Sood and Mark W. Craven. Feature importance explanations for temporal black-box models. ArXiv, abs/2102.11934, 2021

work page arXiv 2021
[16]

A Unified Approach to Interpreting Model Predictions

Scott M. Lundberg and Su-In Lee. A unified approach to interpreting model predictions. ArXiv, abs/1705.07874, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017
[17]

Unbiased measurement of feature importance in tree-based methods

Zhengze Zhou and Giles Hooker. Unbiased measurement of feature importance in tree-based methods. ACM Trans- actions on Knowledge Discovery from Data (TKDD) , 15:1 – 21, 2019

work page 2019
[18]

All models are wrong, but many are useful: Learning a variable’s importance by studying an entire class of prediction models simultaneously

Aaron Fisher, Cynthia Rudin, and Francesca Dominici. All models are wrong, but many are useful: Learning a variable’s importance by studying an entire class of prediction models simultaneously. Journal of machine learning research : JMLR, 20, 2018

work page 2018
[19]

Jo˜ao Bento, Pedro Saleiro, Andre Ferreira Cruz, M ´ario A. T. Figueiredo, and P. Bizarro. Timeshap: Explaining re- current models through sequence perturbations. Proceed- ings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining, 2020

work page 2020
[20]

What went wrong and when? instance-wise feature im- portance for time-series black-box models

Sana Tonekaboni, Shalmali Joshi, Kieran Campbell, David Kristjanson Duvenaud, and Anna Goldenberg. What went wrong and when? instance-wise feature im- portance for time-series black-box models. In Neural Information Processing Systems, 2020

work page 2020
[21]

and Subhashini R

Saranya A. and Subhashini R. A systematic review of ex- plainable artificial intelligence models and applications: GlyTwin: Digital Twin for Glucose Control in Type 1 Diabetes Through Optimal Behavioral Modifications Using Patient-Centric Counterfactuals — 14/15 Recent developments and future trends. Decision Analyt- ics Journal, 2023

work page 2023
[22]

Entropy- based logic explanations of neural networks

Pietro Barbiero, Gabriele Ciravegna, Francesco Giannini, Pietro Li’o, Marco Gori, and Stefano Melacci. Entropy- based logic explanations of neural networks. ArXiv, abs/2106.06804, 2021

work page arXiv 2021
[23]

Sturm, and No´emie Elhadad

Rich Caruana, Yin Lou, Johannes Gehrke, Paul Koch, M. Sturm, and No´emie Elhadad. Intelligible models for healthcare: Predicting pneumonia risk and hospital 30- day readmission. Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015

work page 2015
[24]

Nice: an algo- rithm for nearest instance counterfactual explanations

Dieter Brughmans and David Martens. Nice: an algo- rithm for nearest instance counterfactual explanations. Data Mining and Knowledge Discovery , pages 1–39, 2021

work page 2021
[25]

Use of What-if Scenarios to Help Explain Artificial Intelligence Models for Neonatal Health

Abdullah Mamun, Lawrence D. Devoe, Mark I. Evans, David W. Britt, Judith Klein-Seetharaman, and Hassan Ghasemzadeh. Use of what-if scenarios to help explain artificial intelligence models for neonatal health. ArXiv, abs/2410.09635, 2024

work page internal anchor Pith review Pith/arXiv arXiv 2024
[26]

Gen- erating interpretable counterfactual explanations by im- plicit minimisation of epistemic and aleatoric uncertain- ties

Lisa Schut, Oscar Key, Rory McGrath, Luca Costabello, Bogdan Sacaleanu, Medb Corcoran, and Yarin Gal. Gen- erating interpretable counterfactual explanations by im- plicit minimisation of epistemic and aleatoric uncertain- ties. In International Conference on Artificial Intelligence and Statistics, 2021

work page 2021
[27]

Designing user-centric behavioral interventions to prevent dysg- lycemia with novel counterfactual explanations

Asiful Arefeen and Hassan Ghasemzadeh. Designing user-centric behavioral interventions to prevent dysg- lycemia with novel counterfactual explanations. ArXiv, abs/2310.01684, 2023

work page arXiv 2023
[28]

Glyman: Glycemic management using patient-centric counterfac- tuals

Asiful Arefeen, Saman Khamesian, Mar´ıa Adela Grando, Bithika Thompson, and Hassan Ghasemzadeh. Glyman: Glycemic management using patient-centric counterfac- tuals. 2024 IEEE EMBS International Conference on Biomedical and Health Informatics (BHI) , pages 1–5, 2024

work page 2024
[29]

A novel method to derive personalized minimum viable recommendations for type 2 diabetes prevention based on counterfactual explanations

Marta Lenatti, Alberto Carlevaro, Aziz Guergachi, Karim Keshavjee, Maurizio Mongelli, and Alessia Paglialonga. A novel method to derive personalized minimum viable recommendations for type 2 diabetes prevention based on counterfactual explanations. PLOS ONE, 17, 2022

work page 2022
[30]

Characterization of type 2 diabetes using counterfactuals and explainable ai

Marta Lenatti, Alberto Carlevaro, Karim Keshavjee, Aziz Guergachi, Alessia Paglialonga, and Maurizio Mongelli. Characterization of type 2 diabetes using counterfactuals and explainable ai. Studies in health technology and informatics, 294:98–103, 2022

work page 2022
[31]

Realistic coun- terfactual explanations by learned relations

Xintao Xiang and Artem Lenskiy. Realistic coun- terfactual explanations by learned relations. ArXiv, abs/2202.07356, 2022

work page arXiv 2022
[32]

Enhancing metabolic syndrome prediction with hybrid data balanc- ing and counterfactuals

Sanyam Paresh Shah, Abdullah Mamun, Shovito Barua Soumma, and Hassan Ghasemzadeh. Enhancing metabolic syndrome prediction with hybrid data balanc- ing and counterfactuals. volume abs/2504.06987, 2025

work page arXiv 2025
[33]

Directive explanations for monitoring the risk of diabetes onset: Introducing directive data- centric explanations and combinations to support what- if explorations

Aditya Bhattacharya, Jeroen Ooge, Gregor Stiglic, and Katrien Verbert. Directive explanations for monitoring the risk of diabetes onset: Introducing directive data- centric explanations and combinations to support what- if explorations. Proceedings of the 28th International Conference on Intelligent User Interfaces, 2023

work page 2023
[34]

Explaining machine learning classifiers through diverse counterfactual explanations

Ramaravind K Mothilal, Amit Sharma, and Chenhao Tan. Explaining machine learning classifiers through diverse counterfactual explanations. In Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency, pages 607–617, 2020

work page 2020
[35]

Optimal Counterfac- tual Explanations for Scorecard modelling

Guillermo Navas-Palencia. Optimal Counterfac- tual Explanations for Scorecard modelling. ArXiv, abs/2104.08619, 2021

work page arXiv 2021
[36]

A model-agnostic and data- independent tabu search algorithm to generate counterfac- tuals for tabular, image, and text data

Raphael Mazzine Barbosa de Oliveira, Kenneth S¨orensen, and David Martens. A model-agnostic and data- independent tabu search algorithm to generate counterfac- tuals for tabular, image, and text data. European Journal of Operational Research, 2023

work page 2023
[37]

CounterNet: End-to-End Training of Prediction Aware Counterfactual Explanations

Hangzhi Guo, Thanh Hong Nguyen, and Amulya Yadav. CounterNet: End-to-End Training of Prediction Aware Counterfactual Explanations. Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

work page 2021
[38]

Buck- ingham

Fraser Cameron, G¨unter Niemeyer, and Bruce A. Buck- ingham. Probabilistic evolving meal detection and es- timation of meal total glucose appearance. Journal of Diabetes Science and Technology, 3:1022 – 1030, 2009

work page 2009
[39]

Corbett, Marc D

John P. Corbett, Marc D. Breton, and Stephen D. Patek. A multiple hypothesis approach to estimating meal times in individuals with type 1 diabetes. Journal of Diabetes Science and Technology, 15:141 – 146, 2019

work page 2019
[40]

Peters, Michelle A Van Name, Brian Larsen Thorsted, Johanne Spanggaard Piltoft, and William V Tamborlane

Anne L. Peters, Michelle A Van Name, Brian Larsen Thorsted, Johanne Spanggaard Piltoft, and William V Tamborlane. Postprandial dosing of bolus insulin in pa- tients with type 1 diabetes: A cross-sectional study using data from the t1d exchange registry. Endocrine practice : official journal of the American College of Endocrinology and the American Associa...

work page 2017
[41]

Daenen, Agn `es Sola-Gazagnes, Jocelyne M’bemba, C Dorange-Breillard, F Defer, Fabienne Elgrably, Etienne Larger, and G´erard Slama

S. Daenen, Agn `es Sola-Gazagnes, Jocelyne M’bemba, C Dorange-Breillard, F Defer, Fabienne Elgrably, Etienne Larger, and G´erard Slama. Peak-time determination of post-meal glucose excursions in insulin-treated diabetic patients. Diabetes & metabolism, 36 2:165–9, 2010

work page 2010
[42]

Explanation in Artificial Intelligence: In- sights from the Social Sciences

Tim Miller. Explanation in Artificial Intelligence: In- sights from the Social Sciences. Artif. Intell., 267:1–38, 2017

work page 2017
[43]

Sandra Wachter, Brent Daniel Mittelstadt, and Chris Rus- sell. Counterfactual Explanations Without Opening the GlyTwin: Digital Twin for Glucose Control in Type 1 Diabetes Through Optimal Behavioral Modifications Using Patient-Centric Counterfactuals — 15/15 Black Box: Automated Decisions and the GDPR. Cyber- security, 2017

work page 2017
[44]

Counterfactual Fairness through Transforming Data Orthogonal to Bias

Shuyi Chen and Shixiang Zhu. Counterfactual Fairness through Transforming Data Orthogonal to Bias. ArXiv, abs/2403.17852, 2024

work page arXiv 2024
[45]

Counterfactual explanations and how to find them: literature review and benchmarking

Riccardo Guidotti. Counterfactual explanations and how to find them: literature review and benchmarking. Data Mining and Knowledge Discovery, pages 1–55, 2022

work page 2022

[1] [1]

Reynolds, Kara R

Evan L. Reynolds, Kara R. Mizokami-Stout, Nathaniel Putnam, Mousumi Banerjee, Dana Albright, Lynn Ang, Joyce Lee, Rodica Pop-Busui, Eva L. Feldman, and Brian Christopher Callaghan. Cost and utilization of healthcare services for persons with diabetes. Diabetes research and clinical practice, page 110983, 2023

work page 2023

[2] [2]

ElSayed, Grazia Aleppo, Vanita R

Nuha A. ElSayed, Grazia Aleppo, Vanita R. Aroda, Raveendhara R. Bannuru, Florence M. Brown, Dennis Bruemmer, Billy S. Collins, Marisa E. Hilliard, Diana M. Isaacs, Eric L. Johnson, Scott Kahan, Kamlesh K Khunti, Jose Leon, Sarah K. Lyons, Mary Lou Perry, Priya Pra- halad, Richard E. Pratley, J. Seley, Robert C. Stanton, GlyTwin: Digital Twin for Glucose C...

work page 2023

[3] [3]

Sherr, Lori M

Jennifer L. Sherr, Lori M. Laffel, Jingwen Liu, Wendy A. Wolf, Jeoffrey A. Bispham, Katherine S Chapman, Daniel Finan, Lina Titievsky, Tina Liu, Kaitlin Hagan, Jason L. Gaglia, Keval Chandarana, Richard M. Bergenstal, and Jeremy H. Pettus. Severe hypoglycemia and impaired awareness of hypoglycemia persist in people with type 1 diabetes despite use of diab...

work page 2024

[4] [4]

One-year outcomes of a digi- tal twin intervention for type 2 diabetes: a retrospective real-world study

Paramesh Shamanna, Ravi Sankar Erukulapati, Ashutosh Shukla, Lisa Shah, Bree Willis, Mohamed Thajudeen, Ra- jiv Kovil, Rahul Baxi, Mohsin Wali, Suresh Damodharan, and Shashank R Joshi. One-year outcomes of a digi- tal twin intervention for type 2 diabetes: a retrospective real-world study. Scientific Reports, 14, 2024

work page 2024

[5] [5]

Artifi- cial intelligence in diabetes management: Advancements, opportunities, and challenges

Zhouyu Guan, Huating Li, Ruhan Liu, Chun Cai, Yuex- ing Liu, Jiajia Li, Xiangning Wang, Shan Huang, Liang Wu, Dan Liu, Shujie Yu, Zheyuan Wang, Jia Shu, Xuhong Hou, Xiaokang Yang, Weiping Jia, and Bin Sheng. Artifi- cial intelligence in diabetes management: Advancements, opportunities, and challenges. Cell Reports Medicine, 4, 2023

work page 2023

[6] [6]

Re- playbg: A digital twin-based methodology to identify a personalized model from type 1 diabetes data and simu- late glucose concentrations to assess alternative therapies

Giacomo Cappon, Martina Vettoretti, Giovanni Spara- cino, Simone Del Favero, and Andrea Facchinetti. Re- playbg: A digital twin-based methodology to identify a personalized model from type 1 diabetes data and simu- late glucose concentrations to assess alternative therapies. IEEE Transactions on Biomedical Engineering, 70:3227– 3238, 2023

work page 2023

[7] [7]

System architec- ture of twin: A new digital twin-based clinical decision support system for type 1 diabetes management in chil- dren

Giacomo Cappon, Elisa Pellizzari, Luca Cossu, Giovanni Sparacino, Annalisa Deodati, Riccardo Schiaffini, Ste- fano Cianfarani, and Andrea Facchinetti. System architec- ture of twin: A new digital twin-based clinical decision support system for type 1 diabetes management in chil- dren. 2023 IEEE 19th International Conference on Body Sensor Networks (BSN), ...

work page 2023

[8] [8]

Naveenah Udaya Surian, Arsen Batagov, Andrew Wu, W. B. Lai, Yan Sun, Yong Mong Bee, and Rinkoo Dalan. A digital twin model incorporating generalized metabolic fluxes to identify and predict chronic kidney disease in type 2 diabetes mellitus. NPJ Digital Medicine, 7, 2024

work page 2024

[9] [9]

Digital twin predicting diet response before and after long-term fasting

Oscar Silfvergren, Christian Simonsson, Mattias Ekstedt, Peter Lundberg, Peter Gennemark, and Gunnar Ceder- sund. Digital twin predicting diet response before and after long-term fasting. PLoS Computational Biology, 18, 2021

work page 2021

[10] [10]

Glumarker: A novel predictive modeling of glycemic control through digital biomarkers

Ziyi Zhou, Ming Cheng, Xingjian Diao, Yanjun Cui, and Xiangling Li. Glumarker: A novel predictive modeling of glycemic control through digital biomarkers. 2024 46th Annual International Conference of the IEEE Engineer- ing in Medicine and Biology Society (EMBC), pages 1–7, 2024

work page 2024

[11] [11]

Glysim: Mod- eling and simulating glycemic response for behavioral lifestyle interventions

Asiful Arefeen and Hassan Ghasemzadeh. Glysim: Mod- eling and simulating glycemic response for behavioral lifestyle interventions. 2023 IEEE EMBS International Conference on Biomedical and Health Informatics (BHI), pages 1–5, 2023

work page 2023

[12] [12]

Nurse-in-the- loop artificial intelligence for precision management of type 2 diabetes in a clinical trial utilizing transfer-learned predictive digital twin

Syed Hasib Akhter Faruqui, Adel Alaeddini, Yan Du, Shiyu Li, Kumar Sharma, and Jing Wang. Nurse-in-the- loop artificial intelligence for precision management of type 2 diabetes in a clinical trial utilizing transfer-learned predictive digital twin. ArXiv, abs/2401.02661, 2024

work page arXiv 2024

[13] [13]

Forewarning postprandial hyper- glycemia with interpretations using machine learning

Asiful Arefeen, Samantha N Fessler, Carol Johnston, and Hassan Ghasemzadeh. Forewarning postprandial hyper- glycemia with interpretations using machine learning. 2022 IEEE-EMBS International Conference on Wearable and Implantable Body Sensor Networks (BSN) , pages 1–4, 2022

work page 2022

[14] [14]

why should i trust you?

Marco Tulio Ribeiro, Sameer Singh, and Carlos Guestrin. “why should i trust you?”: Explaining the predictions of any classifier. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016

work page 2016

[15] [15]

Akshay Sood and Mark W. Craven. Feature importance explanations for temporal black-box models. ArXiv, abs/2102.11934, 2021

work page arXiv 2021

[16] [16]

A Unified Approach to Interpreting Model Predictions

Scott M. Lundberg and Su-In Lee. A unified approach to interpreting model predictions. ArXiv, abs/1705.07874, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017

[17] [17]

Unbiased measurement of feature importance in tree-based methods

Zhengze Zhou and Giles Hooker. Unbiased measurement of feature importance in tree-based methods. ACM Trans- actions on Knowledge Discovery from Data (TKDD) , 15:1 – 21, 2019

work page 2019

[18] [18]

All models are wrong, but many are useful: Learning a variable’s importance by studying an entire class of prediction models simultaneously

Aaron Fisher, Cynthia Rudin, and Francesca Dominici. All models are wrong, but many are useful: Learning a variable’s importance by studying an entire class of prediction models simultaneously. Journal of machine learning research : JMLR, 20, 2018

work page 2018

[19] [19]

Jo˜ao Bento, Pedro Saleiro, Andre Ferreira Cruz, M ´ario A. T. Figueiredo, and P. Bizarro. Timeshap: Explaining re- current models through sequence perturbations. Proceed- ings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining, 2020

work page 2020

[20] [20]

What went wrong and when? instance-wise feature im- portance for time-series black-box models

Sana Tonekaboni, Shalmali Joshi, Kieran Campbell, David Kristjanson Duvenaud, and Anna Goldenberg. What went wrong and when? instance-wise feature im- portance for time-series black-box models. In Neural Information Processing Systems, 2020

work page 2020

[21] [21]

and Subhashini R

Saranya A. and Subhashini R. A systematic review of ex- plainable artificial intelligence models and applications: GlyTwin: Digital Twin for Glucose Control in Type 1 Diabetes Through Optimal Behavioral Modifications Using Patient-Centric Counterfactuals — 14/15 Recent developments and future trends. Decision Analyt- ics Journal, 2023

work page 2023

[22] [22]

Entropy- based logic explanations of neural networks

Pietro Barbiero, Gabriele Ciravegna, Francesco Giannini, Pietro Li’o, Marco Gori, and Stefano Melacci. Entropy- based logic explanations of neural networks. ArXiv, abs/2106.06804, 2021

work page arXiv 2021

[23] [23]

Sturm, and No´emie Elhadad

Rich Caruana, Yin Lou, Johannes Gehrke, Paul Koch, M. Sturm, and No´emie Elhadad. Intelligible models for healthcare: Predicting pneumonia risk and hospital 30- day readmission. Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015

work page 2015

[24] [24]

Nice: an algo- rithm for nearest instance counterfactual explanations

Dieter Brughmans and David Martens. Nice: an algo- rithm for nearest instance counterfactual explanations. Data Mining and Knowledge Discovery , pages 1–39, 2021

work page 2021

[25] [25]

Use of What-if Scenarios to Help Explain Artificial Intelligence Models for Neonatal Health

Abdullah Mamun, Lawrence D. Devoe, Mark I. Evans, David W. Britt, Judith Klein-Seetharaman, and Hassan Ghasemzadeh. Use of what-if scenarios to help explain artificial intelligence models for neonatal health. ArXiv, abs/2410.09635, 2024

work page internal anchor Pith review Pith/arXiv arXiv 2024

[26] [26]

Gen- erating interpretable counterfactual explanations by im- plicit minimisation of epistemic and aleatoric uncertain- ties

Lisa Schut, Oscar Key, Rory McGrath, Luca Costabello, Bogdan Sacaleanu, Medb Corcoran, and Yarin Gal. Gen- erating interpretable counterfactual explanations by im- plicit minimisation of epistemic and aleatoric uncertain- ties. In International Conference on Artificial Intelligence and Statistics, 2021

work page 2021

[27] [27]

Designing user-centric behavioral interventions to prevent dysg- lycemia with novel counterfactual explanations

Asiful Arefeen and Hassan Ghasemzadeh. Designing user-centric behavioral interventions to prevent dysg- lycemia with novel counterfactual explanations. ArXiv, abs/2310.01684, 2023

work page arXiv 2023

[28] [28]

Glyman: Glycemic management using patient-centric counterfac- tuals

Asiful Arefeen, Saman Khamesian, Mar´ıa Adela Grando, Bithika Thompson, and Hassan Ghasemzadeh. Glyman: Glycemic management using patient-centric counterfac- tuals. 2024 IEEE EMBS International Conference on Biomedical and Health Informatics (BHI) , pages 1–5, 2024

work page 2024

[29] [29]

A novel method to derive personalized minimum viable recommendations for type 2 diabetes prevention based on counterfactual explanations

Marta Lenatti, Alberto Carlevaro, Aziz Guergachi, Karim Keshavjee, Maurizio Mongelli, and Alessia Paglialonga. A novel method to derive personalized minimum viable recommendations for type 2 diabetes prevention based on counterfactual explanations. PLOS ONE, 17, 2022

work page 2022

[30] [30]

Characterization of type 2 diabetes using counterfactuals and explainable ai

Marta Lenatti, Alberto Carlevaro, Karim Keshavjee, Aziz Guergachi, Alessia Paglialonga, and Maurizio Mongelli. Characterization of type 2 diabetes using counterfactuals and explainable ai. Studies in health technology and informatics, 294:98–103, 2022

work page 2022

[31] [31]

Realistic coun- terfactual explanations by learned relations

Xintao Xiang and Artem Lenskiy. Realistic coun- terfactual explanations by learned relations. ArXiv, abs/2202.07356, 2022

work page arXiv 2022

[32] [32]

Enhancing metabolic syndrome prediction with hybrid data balanc- ing and counterfactuals

Sanyam Paresh Shah, Abdullah Mamun, Shovito Barua Soumma, and Hassan Ghasemzadeh. Enhancing metabolic syndrome prediction with hybrid data balanc- ing and counterfactuals. volume abs/2504.06987, 2025

work page arXiv 2025

[33] [33]

Directive explanations for monitoring the risk of diabetes onset: Introducing directive data- centric explanations and combinations to support what- if explorations

Aditya Bhattacharya, Jeroen Ooge, Gregor Stiglic, and Katrien Verbert. Directive explanations for monitoring the risk of diabetes onset: Introducing directive data- centric explanations and combinations to support what- if explorations. Proceedings of the 28th International Conference on Intelligent User Interfaces, 2023

work page 2023

[34] [34]

Explaining machine learning classifiers through diverse counterfactual explanations

Ramaravind K Mothilal, Amit Sharma, and Chenhao Tan. Explaining machine learning classifiers through diverse counterfactual explanations. In Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency, pages 607–617, 2020

work page 2020

[35] [35]

Optimal Counterfac- tual Explanations for Scorecard modelling

Guillermo Navas-Palencia. Optimal Counterfac- tual Explanations for Scorecard modelling. ArXiv, abs/2104.08619, 2021

work page arXiv 2021

[36] [36]

A model-agnostic and data- independent tabu search algorithm to generate counterfac- tuals for tabular, image, and text data

Raphael Mazzine Barbosa de Oliveira, Kenneth S¨orensen, and David Martens. A model-agnostic and data- independent tabu search algorithm to generate counterfac- tuals for tabular, image, and text data. European Journal of Operational Research, 2023

work page 2023

[37] [37]

CounterNet: End-to-End Training of Prediction Aware Counterfactual Explanations

Hangzhi Guo, Thanh Hong Nguyen, and Amulya Yadav. CounterNet: End-to-End Training of Prediction Aware Counterfactual Explanations. Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

work page 2021

[38] [38]

Buck- ingham

Fraser Cameron, G¨unter Niemeyer, and Bruce A. Buck- ingham. Probabilistic evolving meal detection and es- timation of meal total glucose appearance. Journal of Diabetes Science and Technology, 3:1022 – 1030, 2009

work page 2009

[39] [39]

Corbett, Marc D

John P. Corbett, Marc D. Breton, and Stephen D. Patek. A multiple hypothesis approach to estimating meal times in individuals with type 1 diabetes. Journal of Diabetes Science and Technology, 15:141 – 146, 2019

work page 2019

[40] [40]

Peters, Michelle A Van Name, Brian Larsen Thorsted, Johanne Spanggaard Piltoft, and William V Tamborlane

Anne L. Peters, Michelle A Van Name, Brian Larsen Thorsted, Johanne Spanggaard Piltoft, and William V Tamborlane. Postprandial dosing of bolus insulin in pa- tients with type 1 diabetes: A cross-sectional study using data from the t1d exchange registry. Endocrine practice : official journal of the American College of Endocrinology and the American Associa...

work page 2017

[41] [41]

Daenen, Agn `es Sola-Gazagnes, Jocelyne M’bemba, C Dorange-Breillard, F Defer, Fabienne Elgrably, Etienne Larger, and G´erard Slama

S. Daenen, Agn `es Sola-Gazagnes, Jocelyne M’bemba, C Dorange-Breillard, F Defer, Fabienne Elgrably, Etienne Larger, and G´erard Slama. Peak-time determination of post-meal glucose excursions in insulin-treated diabetic patients. Diabetes & metabolism, 36 2:165–9, 2010

work page 2010

[42] [42]

Explanation in Artificial Intelligence: In- sights from the Social Sciences

Tim Miller. Explanation in Artificial Intelligence: In- sights from the Social Sciences. Artif. Intell., 267:1–38, 2017

work page 2017

[43] [43]

Sandra Wachter, Brent Daniel Mittelstadt, and Chris Rus- sell. Counterfactual Explanations Without Opening the GlyTwin: Digital Twin for Glucose Control in Type 1 Diabetes Through Optimal Behavioral Modifications Using Patient-Centric Counterfactuals — 15/15 Black Box: Automated Decisions and the GDPR. Cyber- security, 2017

work page 2017

[44] [44]

Counterfactual Fairness through Transforming Data Orthogonal to Bias

Shuyi Chen and Shixiang Zhu. Counterfactual Fairness through Transforming Data Orthogonal to Bias. ArXiv, abs/2403.17852, 2024

work page arXiv 2024

[45] [45]

Counterfactual explanations and how to find them: literature review and benchmarking

Riccardo Guidotti. Counterfactual explanations and how to find them: literature review and benchmarking. Data Mining and Knowledge Discovery, pages 1–55, 2022

work page 2022