pith. sign in

arxiv: 2601.01119 · v2 · submitted 2026-01-03 · 💻 cs.LG

Community-Based Early-Stage Chronic Kidney Disease Screening using Explainable Machine Learning for Low-Resource Settings

Pith reviewed 2026-05-16 18:10 UTC · model grok-4.3

classification 💻 cs.LG
keywords chronic kidney diseaseearly detectionmachine learningcommunity screeninglow-resource settingsfeature selectionexplainable AIBangladesh
0
0 comments X

The pith

Machine learning models detect early-stage chronic kidney disease with over 89 percent balanced accuracy using minimal accessible features in low-resource Bangladeshi communities.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper builds an explainable machine learning system to identify early chronic kidney disease before it advances, using data from community settings in Bangladesh where standard tools fall short. Existing scoring methods come from high-income populations and later disease stages, so they miss interactions among local risk factors and demand inputs that are hard to obtain locally. By testing many feature-selection techniques and classifiers, the authors find that a small set of non-laboratory variables already yields 89.23 percent balanced accuracy and often beats models that include full lab results. These models also show higher sensitivity than current screening tools while needing fewer measurements, and they hold up on separate datasets from India, the UAE, and Bangladesh. The result matters because catching the disease early in places with limited dialysis or transplant access can slow progression and reduce long-term health-system costs.

Core claim

An ML model trained on an RFECV-selected feature subset reached 90.40 percent balanced accuracy for early-stage CKD, while a minimal set of non-pathology-test features alone delivered 89.23 percent balanced accuracy and frequently outperformed larger feature collections. These models exceeded the accuracy and sensitivity of established CKD screening tools while using fewer and more readily available inputs. External validation on independent datasets from India, the UAE, and Bangladesh produced sensitivities between 78 percent and 98 percent.

What carries the argument

The explainable ML framework that applies ten complementary feature-selection methods to identify robust predictor subsets, evaluates twelve classifiers with nested cross-validation, and uses SHAP values to interpret predictions.

Load-bearing premise

The community-based CKD dataset from Bangladesh accurately represents the target population's risk profiles and the selected minimal features remain consistently measurable without major error or bias across varied low-resource settings.

What would settle it

A new community-collected dataset from a different low-resource South Asian region in which the minimal non-pathology model drops below 70 percent balanced accuracy would falsify the claim of strong generalizability.

Figures

Figures reproduced from arXiv: 2601.01119 by Dewan Tasnia Azad, Mohammad Habibur Rahman Sarker, Muhammad Ashad Kabir, Saleh Mohammed Ikram, Sirajam Munira, Syed Manzoor Ahmed Hanifi.

Figure 1
Figure 1. Figure 1: A schematic overview of our methodology 3.1. Datasets The dataset used in this study originates from a community-based CKD screening conducted in the Mirzapur sub-district of Tangail, Bangladesh, a rural and peri-urban region covered by the Mirzapur demographic surveillance system (DSS). Adults aged ≥ 18 years with at least five years of residency were selected using age-stratified random sampling, yieldin… view at source ↗
Figure 2
Figure 2. Figure 2: Performance comparison of machine learning models trained on three di [PITH_FULL_IMAGE:figures/full_fig_p018_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Confusion matrices illustrating the prediction results for CKD and non-CKD cases using three feature configurations: (a) the full feature [PITH_FULL_IMAGE:figures/full_fig_p019_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: SHAP summary plot illustrating the contribution of the best-performing S1 feature set to the Decision Tree model’s predictions for CKD [PITH_FULL_IMAGE:figures/full_fig_p022_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: SHAP waterfall plot for a correctly classified (a) CKD and (b) Non-CKD case, illustrating the feature values contributing to the model [PITH_FULL_IMAGE:figures/full_fig_p022_5.png] view at source ↗
read the original abstract

Early detection of chronic kidney disease (CKD) is essential for preventing progression to end-stage renal disease. However, existing screening tools - primarily developed using populations from high-income countries - often underperform in Bangladesh and South Asia, where risk profiles differ. Most of these tools rely on simple additive scoring functions and are based on data from patients with advanced-stage CKD. Consequently, they fail to capture complex interactions among risk factors and are limited in predicting early-stage CKD. Our objective was to develop and evaluate an explainable machine learning (ML) framework for community-based early-stage CKD screening for low-resource settings, tailored to the Bangladeshi and South Asian population context. A community-based CKD dataset from Bangladesh was used to develop predictive models. Variables were organized into clinically meaningful feature groups, and ten complementary feature selection methods were applied to identify robust predictor subsets. Twelve ML classifiers were evaluated using nested cross-validation. Model performance was benchmarked against established CKD screening tools and externally validated on three independent datasets from India, the UAE, and Bangladesh. SHAP was used to interpret model predictions. An ML model trained on an RFECV-selected feature subset achieved a balanced accuracy of 90.40%, whereas minimal non-pathology-test features demonstrated excellent predictive capability with a balanced accuracy of 89.23%, often outperforming larger or full feature sets. Compared with existing screening tools, the proposed models achieved substantially higher accuracy and sensitivity while requiring fewer and more accessible inputs. External validation confirmed strong generalizability with 78% to 98% sensitivity.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 2 minor

Summary. The paper claims to develop an explainable ML framework for community-based early-stage CKD screening tailored to low-resource Bangladeshi/South Asian settings. Using a Bangladesh community dataset, it organizes variables into feature groups, applies ten feature selection methods including RFECV, evaluates twelve classifiers via nested cross-validation, reports balanced accuracies of 90.40% (RFECV subset) and 89.23% (minimal non-pathology features), shows these outperform existing additive screening tools in accuracy/sensitivity while using fewer accessible inputs, provides SHAP interpretations, and externally validates on three independent datasets (India, UAE, Bangladesh) with 78-98% sensitivity.

Significance. If the performance and generalizability claims hold after addressing missing details, the work would be significant for low-resource CKD screening: it demonstrates that minimal non-lab features can achieve near-full performance, provides SHAP-based explanations for clinical trust, and reports external validation across sites. This directly addresses the documented underperformance of high-income-country tools in South Asia and offers a practical, data-driven alternative to simple scoring systems with potential for community deployment.

major comments (3)
  1. [Abstract/Methods] Abstract and Methods: the reported balanced accuracies (90.40% RFECV, 89.23% minimal features) and superiority claims rest on unspecified data preprocessing, class-imbalance handling, and confounding-variable controls; without these, the nested-CV results cannot be fully evaluated for robustness.
  2. [Results] Results/External validation: only a sensitivity range (78-98%) is provided for the three independent datasets; full metrics (balanced accuracy, specificity, PPV) and explicit confirmation that the identical minimal feature set was measured consistently across sites are required to support the generalizability claim.
  3. [Methods] Methods: RFECV feature selection performed on the full training data before nested CV introduces data-driven dependency; the manuscript must clarify how selection was isolated from final evaluation to avoid over-optimism in the reported outperformance versus full feature sets.
minor comments (2)
  1. [Methods] Clarify the exact definition of 'early-stage CKD' labels and any exclusion criteria in the Bangladesh dataset to aid reproducibility.
  2. [Results] Figure legends and SHAP plots would benefit from explicit mapping of feature names to clinical variables for non-ML readers.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for their thoughtful and constructive review. We have carefully addressed each major comment by expanding methodological details, providing additional performance metrics, and clarifying the nested cross-validation procedure. These revisions will improve transparency and strengthen the manuscript without altering our core findings.

read point-by-point responses
  1. Referee: [Abstract/Methods] Abstract and Methods: the reported balanced accuracies (90.40% RFECV, 89.23% minimal features) and superiority claims rest on unspecified data preprocessing, class-imbalance handling, and confounding-variable controls; without these, the nested-CV results cannot be fully evaluated for robustness.

    Authors: We agree that greater detail is required for reproducibility. In the revised Methods section, we will explicitly describe all preprocessing steps (missing-value imputation via median/mode, z-score normalization), class-imbalance handling (stratified k-fold splits plus class-weighting in all classifiers), and confounder controls (age- and sex-stratified folds plus sensitivity analyses excluding hypertension/diabetes). These additions will allow full evaluation of the nested-CV robustness. revision: yes

  2. Referee: [Results] Results/External validation: only a sensitivity range (78-98%) is provided for the three independent datasets; full metrics (balanced accuracy, specificity, PPV) and explicit confirmation that the identical minimal feature set was measured consistently across sites are required to support the generalizability claim.

    Authors: We will add a new supplementary table reporting balanced accuracy, specificity, PPV, and NPV for each external dataset individually. The identical minimal non-pathology feature set (age, sex, BMI, hypertension, diabetes, family history, lifestyle variables) was applied uniformly across all sites; we will state this explicitly in the revised Results and Methods to support the generalizability claim. revision: yes

  3. Referee: [Methods] Methods: RFECV feature selection performed on the full training data before nested CV introduces data-driven dependency; the manuscript must clarify how selection was isolated from final evaluation to avoid over-optimism in the reported outperformance versus full feature sets.

    Authors: We acknowledge the risk of over-optimism. RFECV was executed inside the inner loop of nested CV on each training fold only, with the outer test fold held completely out; the selected features were then used solely for final evaluation on the outer fold. We will insert a detailed description plus pseudocode in the revised Methods to make this isolation explicit and confirm that reported performance reflects truly unseen data. revision: yes

Circularity Check

0 steps flagged

No significant circularity; performance claims rest on nested CV and external validation

full rationale

The paper applies ten feature selection methods including RFECV then evaluates twelve classifiers via nested cross-validation, with additional benchmarking against existing tools and external validation on three independent datasets (India, UAE, Bangladesh). This structure separates feature selection from final performance estimation, preventing the reported balanced accuracies (90.40% and 89.23%) from reducing to the training inputs by construction. No self-citations, uniqueness theorems, or ansatz smuggling appear in the derivation; the results are empirical measurements on held-out and external data rather than definitional or fitted-input predictions.

Axiom & Free-Parameter Ledger

2 free parameters · 1 axioms · 0 invented entities

The central claim rests on the representativeness of the single community dataset and the assumption that ML performance on held-out and external data will translate to real-world screening utility; no new physical entities are introduced.

free parameters (2)
  • ML model hyperparameters
    Standard tuning during nested cross-validation for the twelve classifiers; values not reported in abstract.
  • Feature selection thresholds
    RFECV and ten other methods involve implicit cutoffs chosen to optimize performance on the training data.
axioms (1)
  • domain assumption The collected community variables accurately reflect true risk factor distributions for early-stage CKD in the target population.
    Invoked when claiming generalizability from the Bangladesh dataset to South Asia and external sites.

pith-pipeline@v0.9.0 · 5611 in / 1433 out tokens · 45598 ms · 2026-05-16T18:10:37.788355+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

75 extracted references · 75 canonical work pages · 1 internal anchor

  1. [1]

    K. J. Jager, C. Kovesdy, R. Langham, M. Rosenberg, V . Jha, C. Zoccali, A single number for advocacy and communication—worldwide more than 850 million individuals have kidney diseases, Nephrology Dialysis Transplantation 34 (2019) 1803–1805. doi:10.1093/ndt/gfz174

  2. [2]

    N. R. Hill, S. T. Fatoba, J. L. Oke, J. A. Hirst, C. A. O’Callaghan, D. S. Lasserson, F. R. Hobbs, Global prevalence of chronic kidney disease–a systematic review and meta-analysis, PloS one 11 (2016) e0158765. doi:10.1371/journal.pone.0158765

  3. [3]

    Bikbov, C

    B. Bikbov, C. A. Purcell, A. S. Levey, M. Smith, A. Abdoli, M. Abebe, C. J. Murray, Others, Global, regional, and national burden of chronic kidney disease, 1990–2017: a systematic analysis, The Lancet 395 (2020) 709–733. doi:10.1016/S0140-6736(20)30045-3

  4. [4]

    Abraham, S

    G. Abraham, S. Varughese, T. Thandavan, A. Iyengar, E. Fernando, S. J. Naqvi, R. Sheriff, H. Ur-Rashid, N. Gopalakrishnan, R. K. Kafle, Chronic kidney disease hotspots in developing countries in south asia, Clinical kidney journal 9 (2016) 135–141. doi:10.1093/ckj/sfv109

  5. [5]

    Anand, M

    S. Anand, M. A. Khanam, J. Saquib, N. Saquib, T. Ahmed, D. S. Alam, M. R. Cullen, M. Barry, G. M. Chertow, High prevalence of chronic kidney disease in a community survey of urban bangladeshis: a cross-sectional study, Globalization and health 10 (2014) 9. doi:10.1186/ 1744-8603-10-9

  6. [6]

    M. H. R. Sarker, M. Moriyama, H. U. Rashid, M. J. Chisti, M. M. Rahman, S. K. Das, A. Uddin, S. K. Saha, S. E. Arifeen, T. Ahmed, et al., Community-based screening to determine the prevalence, health and nutritional status of patients with ckd in rural and peri-urban bangladesh, Therapeutic advances in chronic disease 12 (2021) 20406223211035281. doi:10.1...

  7. [7]

    Banik, A

    S. Banik, A. Ghosh, Prevalence of chronic kidney disease in bangladesh: a systematic review and meta-analysis, International urology and nephrology 53 (2021) 713–718. doi:10.1007/s11255-020-02597-6

  8. [8]

    Masset, R

    M. Iqbal, R. Hossain, K. Hossain, M. Faroque, S. Islam, S. Iqbal, M. Chowdhury, Knowledge, attitude, and perception about renal transplantation of ckd patients, caregivers, and general population, Transplantation Proceedings 50 (2018) 2323–2326. doi:10.1016/j. transproceed.2018.04.048

  9. [9]

    Pollock, J.-y

    C. Pollock, J.-y. Moon, P. Gojaseni, C. H. Ching, L. Gomez, T. M. Chan, M.-J. Wu, S. C. Yeo, P. Nugroho, A. K. Bhalla, et al., Framework of guidelines for management of ckd in asia, Kidney International Reports 9 (2024) 752–790. doi:10.1016/j.ekir.2023.12.010

  10. [10]

    L. C. Plantinga, L. E. Boulware, J. Coresh, L. A. Stevens, E. R. Miller, R. Saran, K. L. Messer, A. S. Levey, N. R. Powe, Patient awareness of chronic kidney disease: trends and predictors, Archives of internal medicine 168 (2008) 2268–2275. doi:10.1001/archinte.168.20. xxviii 2268

  11. [11]

    A. S. Levey, J. Coresh, Chronic kidney disease, The Lancet 399 (2022) 129–144. doi:10.1016/S0140-6736(21)00519-5

  12. [12]

    V . A. Luyckx, M. Tonelli, J. W. Stanifer, The global burden of kidney disease and the sustainable development goals, Bulletin of the World Health Organization 96 (2021) 414–422. doi:10.2471/BLT.17.206441

  13. [13]

    Niang, A

    A. Niang, A. Iyengar, V . A. Luyckx, Hemodialysis versus peritoneal dialysis in resource-limited settings, Current opinion in nephrology and hypertension 27 (2018) 463–471. doi:10.1097/MNH.0000000000000455

  14. [14]

    Levin, S

    A. Levin, S. B. Ahmed, J. J. Carrero, B. Foster, A. Francis, R. K. Hall, W. G. Herrington, G. Hill, L. A. Inker, R. Kazancıo ˘glu, et al., Executive summary of the kdigo 2024 clinical practice guideline for the evaluation and management of chronic kidney disease: known knowns and known unknowns, Kidney international 105 (2024) 684–701. doi:10.1016/j.kint....

  15. [15]

    J. W. Stanifer, A. Muiru, T. H. Jafar, U. D. Patel, Chronic kidney disease in low- and middle-income countries, Nephrology Dialysis Transplantation 31 (2016) 868–874. doi:10.1093/ndt/gfv466

  16. [16]

    Kabir, M

    A. Kabir, M. N. Karim, B. Billah, The capacity of primary healthcare facilities in bangladesh to prevent and control non-communicable diseases, BMC Primary Care 24 (2023) 60. doi:10.1186/s12875-023-02016-6

  17. [17]

    Z. Zeba, K. Fatema, A. F. Sumit, R. Zinnat, L. Ali, Early screening of chronic kidney disease patients among the asymptomatic adult population in bangladesh, Journal of Preventive Epidemiology 5 (2020) e10–e10. doi:10.34172/jpe.2020.10

  18. [18]

    T. H. Jafar, C. Ramakrishnan, O. John, A. Tewari, B. Cobb, H. Legido-Quigley, Y . Sungwon, V . Jha, Access to ckd care in rural communities of india: a qualitative study exploring the barriers and potential facilitators, BMC nephrology 21 (2020) 26. doi:10.1186/s12882-020- 1702-6

  19. [19]

    C. M. J. Nazar, T. B. Kindratt, S. M. A. Ahmad, M. Ahmed, J. Anderson, Barriers to the successful practice of chronic kidney diseases at the primary health care level; a systematic review, Journal of renal injury prevention 3 (2014) 61. doi:10.12861/jrip.2014.20

  20. [20]

    L. S. Kahn, B. M. Vest, N. Madurai, R. Singh, T. R. York, C. W. Cipparone, S. Reilly, K. S. Malik, C. H. Fox, Chronic kidney disease (ckd) treatment burden among low-income primary care patients, Chronic illness 11 (2015) 171–183. doi:10.1177/1742395314559751

  21. [21]

    H. Bang, S. Vupputuri, D. A. Shoham, P. J. Klemmer, R. J. Falk, M. Mazumdar, D. Gipson, R. E. Colindres, A. V . Kshirsagar, Screening for occult renal disease (scored): a simple prediction model for chronic kidney disease, Archives of internal medicine 167 (2007) 374–381. doi:10.1001/archinte.167.4.374

  22. [22]

    Stolpe, B

    S. Stolpe, B. Kowall, D. Zwanziger, M. Frank, K.-H. Joeckel, R. Erbel, A. Stang, External validation of six clinical models for prediction of chronic kidney disease in a german population, BMC nephrology 23 (2022) 272. doi:10.1186/s12882-022-02899-0

  23. [23]

    A. V . Kshirsagar, H. Bang, A. S. Bomback, S. Vupputuri, D. A. Shoham, L. M. Kern, P. J. Klemmer, M. Mazumdar, P. A. August, A simple algorithm to predict incident kidney disease, Archives of internal medicine 168 (2008) 2466–2473. doi:10.1001/archinte.168.22.2466

  24. [24]

    Thakkinstian, A

    A. Thakkinstian, A. Ingsathit, A. Chaiprasert, S. Rattanasiri, P. Sangthawan, P. Gojaseni, K. Kiattisunthorn, L. Ongaiyooth, P. Thirakhupt, A simplified clinical prediction score of chronic kidney disease: a cross-sectional-survey study, BMC nephrology 12 (2011) 45. doi:10.1186/ 1471-2369-12-45

  25. [25]

    Kearns, H

    B. Kearns, H. Gallagher, S. de Lusignan, Predicting the prevalence of chronic kidney disease in the english population: a cross-sectional study, BMC nephrology 14 (2013) 49. doi:10.1186/1471-2369-14-49

  26. [26]

    K.-S. Kwon, H. Bang, A. S. Bomback, D.-h. Koh, J.-H. Yum, J.-H. Lee, S. Lee, S. K. Park, K.-Y . Yoo, S. K. Park, et al., A simple prediction score for kidney disease in the korean population, Nephrology 17 (2012) 278–284. doi:10.1111/j.1440-1797.2011.01552.x

  27. [27]

    Sanmarchi, C

    F. Sanmarchi, C. Fanconi, D. Golinelli, D. Gori, T. Hernandez-Boussard, A. Capodici, Predict, diagnose, and treat chronic kidney disease with machine learning: a systematic literature review, Journal of nephrology 36 (2023) 1101–1117

  28. [28]

    Delrue, S

    C. Delrue, S. De Bruyne, M. M. Speeckaert, Application of machine learning in chronic kidney disease: current status and future prospects, Biomedicines 12 (2024) 568. doi:10.3390/biomedicines12030568

  29. [29]

    Gogoi, J

    P. Gogoi, J. A. Valan, Machine learning approaches for predicting and diagnosing chronic kidney disease: current trends, challenges, solutions, and future directions, International Urology and Nephrology 57 (2025) 1245–1268

  30. [30]

    Sabanayagam, R

    C. Sabanayagam, R. Banu, C. Lim, Y . C. Tham, C.-Y . Cheng, G. Tan, E. Ekinci, B. Sheng, G. McKay, J. E. Shaw, et al., Artificial intelligence xxix in chronic kidney disease management: a scoping review, Theranostics 15 (2025) 4566

  31. [31]

    A. Khan, Y . Tayyebi, A comprehensive analysis to detect chronic kidney disease and stage prediction: Using machine learning, Asian Journal of Research in Computer Science 18 (2025) 151–162

  32. [32]

    Rubini, P

    L. Rubini, P. Soundarapandian, P. Eswaran, Chronic Kidney Disease, UCI Machine Learning Repository, 2015. doi:10.24432/C5G020

  33. [33]

    Dharmarathne, M

    G. Dharmarathne, M. Bogahawaththa, M. McAfee, U. Rathnayake, D. Meddage, On the diagnosis of chronic kidney disease using a machine learning-based interface with explainable artificial intelligence, Intelligent Systems with Applications 22 (2024) 200397

  34. [34]

    M. A. Islam, M. Z. H. Majumder, M. A. Hussein, Chronic kidney disease prediction based on machine learning algorithms, Journal of pathology informatics 14 (2023) 100189

  35. [35]

    Pujitha, N

    K. Pujitha, N. B. Soni, L. F. Eram, P. N. Sai, S. Divija, R. S. Supriya, Chronic kidney disease detection using machine learning approach, in: 2023 2nd International Conference on Vision Towards Emerging Trends in Communication and Networking Technologies (ViTECoN), IEEE, 2023, pp. 1–5

  36. [36]

    Gogoi, J

    P. Gogoi, J. A. Valan, Chronic kidney disease prediction using machine learning techniques: a comparative study of feature selection methods with smote and shap, Multiscale and Multidisciplinary Modeling, Experiments and Design 8 (2025) 1–23

  37. [37]

    M. H. I. Bijoy, M. J. Mia, M. M. Rahman, M. S. Arefin, P. K. Dhar, T. Shimamura, A robot process automation based mobile application for early prediction of chronic kidney disease using machine learning, Discover Applied Sciences 7 (2025) 528

  38. [38]

    K. T. Jawad, A. Verma, F. Amsaad, L. Ashraf, A study on the application of explainable ai on ensemble models for predictive analysis of chronic kidney disease, IEEE Access (2025)

  39. [39]

    G. U. Nneji, H. N. Monday, V . S. R. Pathapati, S. Nahar, G. T. Mgbejime, E. S. Umana, M. A. Hossin, Ffs-iml: fusion-based statistical feature selection for machine learning-driven interpretability of chronic kidney disease, International Journal of Machine Learning and Cybernetics (2025) 1–34

  40. [40]

    D. A. Debal, T. M. Sitote, Chronic kidney disease prediction using machine learning techniques, Journal of Big Data 9 (2022) 109

  41. [41]

    Zheng, X

    J.-X. Zheng, X. Li, J. Zhu, S.-Y . Guan, S.-X. Zhang, W.-M. Wang, Interpretable machine learning for predicting chronic kidney disease progression risk, Digital Health 10 (2024) 20552076231224225

  42. [42]

    S. K. Ghosh, A. H. Khandoker, Investigation on explainable machine learning models to predict chronic kidney diseases, Scientific Reports 14 (2024) 3687

  43. [43]

    Khalil, K

    W. Khalil, K. Bashir, M. Mosadag, Early detection of chronic kidney disease (ckd) using machine learning algorithms, East Journal of Computer Science 1 (2025) 1–9

  44. [44]

    Iftikhar, M

    H. Iftikhar, M. Khan, Z. Khan, F. Khan, H. M. Alshanbari, Z. Ahmad, A comparative analysis of machine learning models: a case study in predicting chronic kidney disease, Sustainability 15 (2023) 2754

  45. [45]

    Iftikhar, A

    H. Iftikhar, A. F. Hashem, M. Qureshi, P. C. Rodrigues, Clinical application of machine learning models for early-stage chronic kidney disease detection, Diagnostics 15 (2025) 2610

  46. [46]

    Iftikhar, A

    H. Iftikhar, A. F. Hashem, L. A. Mohamud, A. Al-Moisheer, R. I. G. Medina, J. L. L´opez-Gonzales, An intelligent ensemble machine learning model for early detection of chronic kidney disease in aging populations, Scientific Reports (2026)

  47. [47]

    Metherall, A

    B. Metherall, A. K. Berryman, G. S. Brennan, Machine learning for classifying chronic kidney disease and predicting creatinine levels using at-home measurements, Scientific Reports 15 (2025) 4364

  48. [48]

    S. S. Natarajan, E. Balasubramanian, B. K. Raghupathy, M. Ganesan, Using machine learning models with elephant herd feature selection method for diagnosing chronic kidney disease, Computational Journal of Mathematical and Statistical Sciences (2025)

  49. [49]

    M. F. Hossain, S. T. Diya, R. Khan, Acd-ml: Advanced ckd detection using machine learning: A tri-phase ensemble and multi-layered stacking and blending approach, Computer Methods and Programs in Biomedicine Update 7 (2025) 100173

  50. [50]

    M. M. Rahman, M. Al-Amin, J. Hossain, Machine learning models for chronic kidney disease diagnosis and prediction, Biomedical Signal Processing and Control 87 (2024) 105368

  51. [51]

    N. Md. Ashafuddula, B. Islam, R. Islam, An intelligent diagnostic system to analyze early-stage chronic kidney disease for clinical applica- tion, Applied Computational Intelligence and Soft Computing 2023 (2023) 3140270. xxx

  52. [52]

    M. A. Islam, S. Akter, Risk Factor Prediction of Chronic Kidney Disease, UCI Machine Learning Repository, 2020. doi:10.24432/C5WP64

  53. [53]

    Al-Shamsi, D

    S. Al-Shamsi, D. Regmi, R. D. Govender, Chronic kidney disease in patients at high risk of cardiovascular disease in the united arab emirates: A population-based study, PLOS ONE 13 (2018) 1–12. doi:10.1371/journal.pone.0199920

  54. [54]

    L. A. Inker, B. C. Astor, C. H. Fox, T. Isakova, J. P. Lash, C. A. Peralta, M. K. Tamura, H. I. Feldman, Kdoqi us commentary on the 2012 kdigo clinical practice guideline for the evaluation and management of ckd, American Journal of Kidney Diseases 63 (2014) 713–735. doi:10.1053/j.ajkd.2014.01.416

  55. [55]

    doi:10.18637/jss.v045.i03 , abstract =

    S. van Buuren, K. Groothuis-Oudshoorn, mice: Multivariate imputation by chained equations in r, Journal of Statistical Software 45 (2011) 1–67. doi:10.18637/jss.v045.i03

  56. [56]

    Adhikari, W

    D. Adhikari, W. Jiang, J. Zhan, Z. He, D. B. Rawat, U. Aickelin, H. A. Khorshidi, A comprehensive survey on imputation of missing data in internet of things, ACM Computing Surveys 55 (2022) 1–38. doi:10.1145/3533381

  57. [57]

    Pedregosa, G

    F. Pedregosa, G. Varoquaux, A. Gramfort, V . Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V . Dubourg, J. Vanderplas, A. Passos, D. Cournapeau, M. Brucher, M. Perrot, ´E. Duchesnay, Scikit-learn: Machine learning in python, Journal of Machine Learning Research 12 (2011) 2825–2830

  58. [58]

    H. B. Mann, D. R. Whitney, On a test of whether one of two random variables is stochastically larger than the other, The Annals of Mathematical Statistics 18 (1947) 50–60. doi:10.1214/aoms/1177730491

  59. [59]

    Tibshirani, Regression shrinkage and selection via the lasso, Journal of the Royal Statistical Society: Series B 58 (1996) 267–288

    R. Tibshirani, Regression shrinkage and selection via the lasso, Journal of the Royal Statistical Society: Series B 58 (1996) 267–288

  60. [60]

    Guyon, J

    I. Guyon, J. Weston, S. Barnhill, V . Vapnik, Gene selection for cancer classification using support vector machines, in: Machine Learning, MIT Press, 2002, pp. 389–422

  61. [61]

    D. W. Hosmer, S. Lemeshow, R. X. Sturdivant, Applied Logistic Regression, John Wiley & Sons, 2013

  62. [62]

    Breiman, J

    L. Breiman, J. Friedman, R. Olshen, C. Stone, Classification and Regression Trees, Wadsworth International Group, 1984

  63. [63]

    Machine Learning 45(1), 5–32 (Oct 2001)

    L. Breiman, Random forests, Machine Learning 45 (2001) 5–32. doi:10.1023/A:1010933404324

  64. [64]

    J. H. Friedman, Greedy function approximation: A gradient boosting machine, Annals of Statistics 29 (2001) 1189–1232

  65. [65]

    Freund, R

    Y . Freund, R. E. Schapire, A decision-theoretic generalization of on-line learning and an application to boosting, Journal of Computer and System Sciences 55 (1997) 119–139

  66. [66]

    Geurts, D

    P. Geurts, D. Ernst, L. Wehenkel, Extremely randomized trees, Machine Learning 63 (2006) 3–42

  67. [67]

    T. Chen, C. Guestrin, Xgboost: A scalable tree boosting system, in: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016, pp. 785–794. doi:10.1145/2939672.2939785

  68. [68]

    A. V . Dorogush, V . Ershov, A. Gulin, Catboost: Gradient boosting with categorical features support, 2018.arXiv:1810.11363

  69. [69]

    Cover, P

    T. Cover, P. Hart, Nearest neighbor pattern classification, IEEE Transactions on Information Theory 13 (1967) 21–27

  70. [70]

    Cortes, V

    C. Cortes, V . Vapnik, Support vector networks, Machine Learning 20 (1995) 273–297

  71. [71]

    G. Ke, Q. Meng, T. Finley, T. Wang, W. Chen, W. Ma, Q. Ye, T.-Y . Liu, Lightgbm: A highly efficient gradient boosting decision tree, in: Advances in Neural Information Processing Systems, 2017, pp. 3149–3157

  72. [72]

    D. E. Rumelhart, G. E. Hinton, R. J. Williams, Learning representations by back-propagating errors, Nature 323 (1986) 533–536

  73. [73]

    R. Kohavi, A study of cross-validation and bootstrap for accuracy estimation and model selection, in: Proceedings of the 14th International Joint Conference on Artificial Intelligence, volume 2, Morgan Kaufmann Publishers, 1995, pp. 1137–1143

  74. [74]

    Robust anomaly detection for multivariate time series through stochastic recurrent neural network,

    T. Akiba, S. Sano, T. Yanase, T. Ohta, M. Koyama, Optuna: A next-generation hyperparameter optimization framework, in: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2019, pp. 2623–2631. doi:10.1145/3292500. 3330701

  75. [75]

    S. M. Lundberg, S.-I. Lee, A unified approach to interpreting model predictions, in: Advances in Neural Information Processing Systems, volume 30, 2017, pp. 4765–4774. xxxi