EngageTriBoost: Predictive Modeling of User Engagement in Digital Mental Health Intervention Using Explainable Machine Learning
Pith reviewed 2026-05-13 23:19 UTC · model grok-4.3
The pith
An ensemble model called EngageTriBoost predicts user engagement in a digital mental health program for college students with up to 84 percent accuracy and uses SHAP to link emotional dysregulation and stigma to participation levels.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The central claim is that the EngageTriBoost ensemble achieves up to 84 percent accuracy when predicting engagement, defined as sign-ins and counselor interactions, and that SHAP analysis applied afterward identifies emotional dysregulation and perceived stigma as the factors with the largest effects on whether users stay with the digital mental health intervention.
What carries the argument
The EngageTriBoost ensemble model paired with Shapley Additive exPlanations (SHAP) to generate interpretable feature rankings from user sign-in and interaction data.
If this is right
- Programs could use early predictions to send targeted reminders or simplified onboarding to users flagged as likely to disengage.
- Design changes that reduce perceived stigma or support emotional regulation could raise overall sign-in and interaction rates.
- Resource allocation in counseling services could shift toward users the model identifies as high-engagement prospects.
- The same pipeline of ensemble prediction plus SHAP ranking could be reused on data from similar motivational-interviewing platforms.
- Higher sustained engagement would increase the fraction of at-risk students who complete the referral to in-person professional care.
Where Pith is reading between the lines
- Testing the model in a randomized trial where one arm receives SHAP-guided adjustments could reveal whether the identified factors are modifiable enough to move engagement numbers.
- Similar explainable models might transfer to other digital health domains such as chronic-disease self-management apps where dropout is also common.
- If emotional dysregulation ranks high across datasets, it suggests screening for that trait at enrollment could become standard for improving retention.
- Longer-term follow-up data would show whether the users the model predicts as engaged actually experience better mental-health outcomes than low-engagement users.
Load-bearing premise
The study assumes that the behavioral patterns and self-reported features observed in the eBridge dataset of college students are representative enough to generalize and that the SHAP rankings point to drivers that can be changed rather than fixed correlations.
What would settle it
Apply the trained model to a fresh, independent set of users from another digital mental health program and measure whether accuracy falls below 70 percent or whether the top SHAP-ranked features shift to different variables.
Figures
read the original abstract
Mental health challenges among young adults, are on the rise, necessitating effective solutions such as digital mental health interventions (DMHIs). Despite their promise, DMHIs face significant adoption barriers, including low initial uptake and high dropout rates. This study leverages machine learning (ML) to analyze behavioral patterns of users of a DMHI, eBridge, designed to increase the utilization of professional mental health services among at-risk college students through motivational interviewing-based online counseling. Our ensemble model, EngageTriBoost, achieved up to 84% accuracy in predicting engagement, measured by sign-ins and counselor interactions. We then applied the Shapley Additive exPlanations (SHAP) analysis which provided clear, interpretable insights into key factors influencing user engagement such as emotional dysregulation and perceived stigma, highlighting their critical effect on DMHI adoption. This study demonstrates the power of explainable ML for better understanding user engagement with DMHI to improve their adoption and achievable impact on mental health outcomes.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript presents EngageTriBoost, an ensemble machine learning model for predicting user engagement (measured by sign-ins and counselor interactions) in the eBridge digital mental health intervention for at-risk college students. It reports up to 84% accuracy and applies SHAP analysis to identify interpretable factors such as emotional dysregulation and perceived stigma as key influences on engagement and DMHI adoption.
Significance. If the performance and interpretability claims are supported by rigorous validation, the work could aid in identifying barriers to DMHI uptake and informing targeted improvements. The use of an ensemble approach combined with post-hoc explainability is a positive step toward actionable insights in mental health applications. However, missing methodological details and the correlational nature of SHAP limit the strength of conclusions about critical effects or generalizability.
major comments (2)
- [Abstract] Abstract: The central performance claim of 'up to 84% accuracy' is stated without any information on dataset size (e.g., number of users or sessions in eBridge), validation strategy (train-test split, cross-validation folds), baseline comparisons, or handling of class imbalance. This absence makes it impossible to determine whether the result exceeds trivial baselines or supports the predictive modeling contribution.
- [SHAP Analysis] SHAP Analysis section: The manuscript states that SHAP 'provided clear, interpretable insights into key factors influencing user engagement such as emotional dysregulation and perceived stigma, highlighting their critical effect on DMHI adoption.' SHAP values quantify additive feature contributions to model predictions on the observed distribution and do not perform causal inference or counterfactual reasoning. No mention is made of causal methods, randomized elements in the eBridge design, or confounder sensitivity checks, so the leap from correlation to 'critical effect' is unsupported.
minor comments (1)
- [Abstract] The abstract and introduction could more explicitly define the engagement labels (sign-ins vs. counselor interactions) and any preprocessing steps applied to the behavioral and self-reported features.
Simulated Author's Rebuttal
We thank the referee for the constructive comments, which help strengthen the clarity and precision of our work. We address each major comment point by point below.
read point-by-point responses
-
Referee: [Abstract] Abstract: The central performance claim of 'up to 84% accuracy' is stated without any information on dataset size (e.g., number of users or sessions in eBridge), validation strategy (train-test split, cross-validation folds), baseline comparisons, or handling of class imbalance. This absence makes it impossible to determine whether the result exceeds trivial baselines or supports the predictive modeling contribution.
Authors: We agree that the abstract should be self-contained to allow readers to evaluate the performance claim. The full manuscript details the eBridge dataset size, 5-fold cross-validation strategy, baseline comparisons (including logistic regression and random forest), and class imbalance handling via weighted objectives. To address the referee's concern, we will revise the abstract to concisely report the sample size, validation approach, and that accuracy exceeds the majority-class baseline. revision: yes
-
Referee: [SHAP Analysis] SHAP Analysis section: The manuscript states that SHAP 'provided clear, interpretable insights into key factors influencing user engagement such as emotional dysregulation and perceived stigma, highlighting their critical effect on DMHI adoption.' SHAP values quantify additive feature contributions to model predictions on the observed distribution and do not perform causal inference or counterfactual reasoning. No mention is made of causal methods, randomized elements in the eBridge design, or confounder sensitivity checks, so the leap from correlation to 'critical effect' is unsupported.
Authors: We concur that SHAP quantifies correlational feature contributions and does not support causal claims. The eBridge data is observational with no randomized elements or causal methods applied. We will revise the wording in the abstract and SHAP section from 'critical effect' to 'key associations' to accurately reflect the correlational nature of the results. We will also add a limitations paragraph noting that SHAP insights are associative and that causal inference would require additional methods or study designs. revision: yes
Circularity Check
No significant circularity in the modeling or explanation pipeline
full rationale
The paper describes a standard supervised ML workflow: features from the eBridge dataset are used to train the EngageTriBoost ensemble to predict engagement (sign-ins and counselor interactions), with accuracy evaluated on held-out data. SHAP is then applied post-hoc to attribute contributions to the trained model's outputs. No derivation step reduces the target variable or predictions to quantities defined by the fitted parameters themselves, no self-citation chain is load-bearing for the core claims, and no ansatz or uniqueness theorem is smuggled in. The reported accuracy and feature attributions are computed independently of the input definitions, making the chain self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Our ensemble model, EngageTriBoost, achieved up to 84% accuracy in predicting engagement... We then applied the Shapley Additive exPlanations (SHAP) analysis which provided clear, interpretable insights into key factors influencing user engagement such as emotional dysregulation and perceived stigma.
-
IndisputableMonolith/Foundation/ArithmeticFromLogic.leanLogicNat recovery unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
ETB integrates XGBoost, LightGBM, and CatBoost as base learners with logistic regression as a meta-learner, tuned through cross-validation.
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
C. A. King, D. Eisenberg, K. Zheng, E. Czyz, A. Kramer, A. Horwitz, and S. Chermack, “Online suicide risk screening and intervention with college students: a pilot randomized controlled trial,” J Consult Clin Psychol, vol. 83, no. 3, pp. 630-6, 2015. [9] C. A. King, D. Eisenberg, and J. Pistorello, “Electronic bridge to mental health for college students:...
work page 2015
-
[2]
K. Aschbacher, L. M. Rivera, S. Hornstein, B. W. Nelson, V. Forman-Hoffman, and N. Peiper, “Longitudinal patterns of engagement and clinical outcomes: results from a therapist-supported digital mental health intervention,” Psychosom Med, 2023. [29] Y. Huang, H. Liu, S. Li, W. Wang, and Z. Zhou, “Effective prediction and important counseling experience for...
work page 2023
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.