Community-Based Early-Stage Chronic Kidney Disease Screening using Explainable Machine Learning for Low-Resource Settings

Dewan Tasnia Azad; Mohammad Habibur Rahman Sarker; Muhammad Ashad Kabir; Saleh Mohammed Ikram; Sirajam Munira; Syed Manzoor Ahmed Hanifi

arxiv: 2601.01119 · v2 · submitted 2026-01-03 · 💻 cs.LG

Community-Based Early-Stage Chronic Kidney Disease Screening using Explainable Machine Learning for Low-Resource Settings

Muhammad Ashad Kabir , Sirajam Munira , Dewan Tasnia Azad , Saleh Mohammed Ikram , Mohammad Habibur Rahman Sarker , Syed Manzoor Ahmed Hanifi This is my paper

Pith reviewed 2026-05-16 18:10 UTC · model grok-4.3

classification 💻 cs.LG

keywords chronic kidney diseaseearly detectionmachine learningcommunity screeninglow-resource settingsfeature selectionexplainable AIBangladesh

0 comments

The pith

Machine learning models detect early-stage chronic kidney disease with over 89 percent balanced accuracy using minimal accessible features in low-resource Bangladeshi communities.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper builds an explainable machine learning system to identify early chronic kidney disease before it advances, using data from community settings in Bangladesh where standard tools fall short. Existing scoring methods come from high-income populations and later disease stages, so they miss interactions among local risk factors and demand inputs that are hard to obtain locally. By testing many feature-selection techniques and classifiers, the authors find that a small set of non-laboratory variables already yields 89.23 percent balanced accuracy and often beats models that include full lab results. These models also show higher sensitivity than current screening tools while needing fewer measurements, and they hold up on separate datasets from India, the UAE, and Bangladesh. The result matters because catching the disease early in places with limited dialysis or transplant access can slow progression and reduce long-term health-system costs.

Core claim

An ML model trained on an RFECV-selected feature subset reached 90.40 percent balanced accuracy for early-stage CKD, while a minimal set of non-pathology-test features alone delivered 89.23 percent balanced accuracy and frequently outperformed larger feature collections. These models exceeded the accuracy and sensitivity of established CKD screening tools while using fewer and more readily available inputs. External validation on independent datasets from India, the UAE, and Bangladesh produced sensitivities between 78 percent and 98 percent.

What carries the argument

The explainable ML framework that applies ten complementary feature-selection methods to identify robust predictor subsets, evaluates twelve classifiers with nested cross-validation, and uses SHAP values to interpret predictions.

Load-bearing premise

The community-based CKD dataset from Bangladesh accurately represents the target population's risk profiles and the selected minimal features remain consistently measurable without major error or bias across varied low-resource settings.

What would settle it

A new community-collected dataset from a different low-resource South Asian region in which the minimal non-pathology model drops below 70 percent balanced accuracy would falsify the claim of strong generalizability.

Figures

Figures reproduced from arXiv: 2601.01119 by Dewan Tasnia Azad, Mohammad Habibur Rahman Sarker, Muhammad Ashad Kabir, Saleh Mohammed Ikram, Sirajam Munira, Syed Manzoor Ahmed Hanifi.

**Figure 1.** Figure 1: A schematic overview of our methodology 3.1. Datasets The dataset used in this study originates from a community-based CKD screening conducted in the Mirzapur sub-district of Tangail, Bangladesh, a rural and peri-urban region covered by the Mirzapur demographic surveillance system (DSS). Adults aged ≥ 18 years with at least five years of residency were selected using age-stratified random sampling, yieldin… view at source ↗

**Figure 2.** Figure 2: Performance comparison of machine learning models trained on three di [PITH_FULL_IMAGE:figures/full_fig_p018_2.png] view at source ↗

**Figure 3.** Figure 3: Confusion matrices illustrating the prediction results for CKD and non-CKD cases using three feature configurations: (a) the full feature [PITH_FULL_IMAGE:figures/full_fig_p019_3.png] view at source ↗

**Figure 4.** Figure 4: SHAP summary plot illustrating the contribution of the best-performing S1 feature set to the Decision Tree model’s predictions for CKD [PITH_FULL_IMAGE:figures/full_fig_p022_4.png] view at source ↗

**Figure 5.** Figure 5: SHAP waterfall plot for a correctly classified (a) CKD and (b) Non-CKD case, illustrating the feature values contributing to the model [PITH_FULL_IMAGE:figures/full_fig_p022_5.png] view at source ↗

read the original abstract

Early detection of chronic kidney disease (CKD) is essential for preventing progression to end-stage renal disease. However, existing screening tools - primarily developed using populations from high-income countries - often underperform in Bangladesh and South Asia, where risk profiles differ. Most of these tools rely on simple additive scoring functions and are based on data from patients with advanced-stage CKD. Consequently, they fail to capture complex interactions among risk factors and are limited in predicting early-stage CKD. Our objective was to develop and evaluate an explainable machine learning (ML) framework for community-based early-stage CKD screening for low-resource settings, tailored to the Bangladeshi and South Asian population context. A community-based CKD dataset from Bangladesh was used to develop predictive models. Variables were organized into clinically meaningful feature groups, and ten complementary feature selection methods were applied to identify robust predictor subsets. Twelve ML classifiers were evaluated using nested cross-validation. Model performance was benchmarked against established CKD screening tools and externally validated on three independent datasets from India, the UAE, and Bangladesh. SHAP was used to interpret model predictions. An ML model trained on an RFECV-selected feature subset achieved a balanced accuracy of 90.40%, whereas minimal non-pathology-test features demonstrated excellent predictive capability with a balanced accuracy of 89.23%, often outperforming larger or full feature sets. Compared with existing screening tools, the proposed models achieved substantially higher accuracy and sensitivity while requiring fewer and more accessible inputs. External validation confirmed strong generalizability with 78% to 98% sensitivity.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Standard ML on a Bangladeshi community cohort delivers usable early CKD prediction with minimal non-lab features and external checks, but the high numbers rest on thin method details that need verification.

read the letter

The paper's main contribution is showing that off-the-shelf classifiers plus RFECV feature selection can reach 89-90% balanced accuracy on early-stage CKD using mostly accessible variables from a Bangladeshi sample, and that this holds up reasonably on three external sets from India, UAE, and Bangladesh. The practical angle is the minimal feature set performing almost as well as fuller ones, which matters for places without routine labs. They also run SHAP and compare against existing additive scores, which is the right way to frame the work for low-resource screening. That combination of population focus and external validation is what makes it worth looking at rather than another generic CKD model. What they do well is the nested cross-validation, the multiple selection methods, and the explicit check against current tools. The external sensitivity range of 78-98% is better than nothing, and the emphasis on early detection rather than advanced cases aligns with the stated goal. The soft spots are the missing pieces that make the numbers hard to evaluate fully. The abstract gives no information on missing-data handling, class imbalance correction, or how confounders like age and hypertension were distributed across sites. External validation reports only a sensitivity band without balanced accuracy, specificity, or calibration plots, so it is unclear whether performance drops on other metrics or whether feature definitions matched exactly across countries. Feature selection on the same data introduces the usual optimism risk even with nesting. These are fixable but they matter for a claim of superiority in new settings. This is for applied ML groups working on global health screening or for public-health researchers who need concrete feature lists for community programs. A reader who wants to adapt the minimal set to their own data would get something usable. It deserves peer review because the application is real and the external validation exists, even if the methods section will need expansion and the metrics will need fuller reporting before publication.

Referee Report

3 major / 2 minor

Summary. The paper claims to develop an explainable ML framework for community-based early-stage CKD screening tailored to low-resource Bangladeshi/South Asian settings. Using a Bangladesh community dataset, it organizes variables into feature groups, applies ten feature selection methods including RFECV, evaluates twelve classifiers via nested cross-validation, reports balanced accuracies of 90.40% (RFECV subset) and 89.23% (minimal non-pathology features), shows these outperform existing additive screening tools in accuracy/sensitivity while using fewer accessible inputs, provides SHAP interpretations, and externally validates on three independent datasets (India, UAE, Bangladesh) with 78-98% sensitivity.

Significance. If the performance and generalizability claims hold after addressing missing details, the work would be significant for low-resource CKD screening: it demonstrates that minimal non-lab features can achieve near-full performance, provides SHAP-based explanations for clinical trust, and reports external validation across sites. This directly addresses the documented underperformance of high-income-country tools in South Asia and offers a practical, data-driven alternative to simple scoring systems with potential for community deployment.

major comments (3)

[Abstract/Methods] Abstract and Methods: the reported balanced accuracies (90.40% RFECV, 89.23% minimal features) and superiority claims rest on unspecified data preprocessing, class-imbalance handling, and confounding-variable controls; without these, the nested-CV results cannot be fully evaluated for robustness.
[Results] Results/External validation: only a sensitivity range (78-98%) is provided for the three independent datasets; full metrics (balanced accuracy, specificity, PPV) and explicit confirmation that the identical minimal feature set was measured consistently across sites are required to support the generalizability claim.
[Methods] Methods: RFECV feature selection performed on the full training data before nested CV introduces data-driven dependency; the manuscript must clarify how selection was isolated from final evaluation to avoid over-optimism in the reported outperformance versus full feature sets.

minor comments (2)

[Methods] Clarify the exact definition of 'early-stage CKD' labels and any exclusion criteria in the Bangladesh dataset to aid reproducibility.
[Results] Figure legends and SHAP plots would benefit from explicit mapping of feature names to clinical variables for non-ML readers.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for their thoughtful and constructive review. We have carefully addressed each major comment by expanding methodological details, providing additional performance metrics, and clarifying the nested cross-validation procedure. These revisions will improve transparency and strengthen the manuscript without altering our core findings.

read point-by-point responses

Referee: [Abstract/Methods] Abstract and Methods: the reported balanced accuracies (90.40% RFECV, 89.23% minimal features) and superiority claims rest on unspecified data preprocessing, class-imbalance handling, and confounding-variable controls; without these, the nested-CV results cannot be fully evaluated for robustness.

Authors: We agree that greater detail is required for reproducibility. In the revised Methods section, we will explicitly describe all preprocessing steps (missing-value imputation via median/mode, z-score normalization), class-imbalance handling (stratified k-fold splits plus class-weighting in all classifiers), and confounder controls (age- and sex-stratified folds plus sensitivity analyses excluding hypertension/diabetes). These additions will allow full evaluation of the nested-CV robustness. revision: yes
Referee: [Results] Results/External validation: only a sensitivity range (78-98%) is provided for the three independent datasets; full metrics (balanced accuracy, specificity, PPV) and explicit confirmation that the identical minimal feature set was measured consistently across sites are required to support the generalizability claim.

Authors: We will add a new supplementary table reporting balanced accuracy, specificity, PPV, and NPV for each external dataset individually. The identical minimal non-pathology feature set (age, sex, BMI, hypertension, diabetes, family history, lifestyle variables) was applied uniformly across all sites; we will state this explicitly in the revised Results and Methods to support the generalizability claim. revision: yes
Referee: [Methods] Methods: RFECV feature selection performed on the full training data before nested CV introduces data-driven dependency; the manuscript must clarify how selection was isolated from final evaluation to avoid over-optimism in the reported outperformance versus full feature sets.

Authors: We acknowledge the risk of over-optimism. RFECV was executed inside the inner loop of nested CV on each training fold only, with the outer test fold held completely out; the selected features were then used solely for final evaluation on the outer fold. We will insert a detailed description plus pseudocode in the revised Methods to make this isolation explicit and confirm that reported performance reflects truly unseen data. revision: yes

Circularity Check

0 steps flagged

No significant circularity; performance claims rest on nested CV and external validation

full rationale

The paper applies ten feature selection methods including RFECV then evaluates twelve classifiers via nested cross-validation, with additional benchmarking against existing tools and external validation on three independent datasets (India, UAE, Bangladesh). This structure separates feature selection from final performance estimation, preventing the reported balanced accuracies (90.40% and 89.23%) from reducing to the training inputs by construction. No self-citations, uniqueness theorems, or ansatz smuggling appear in the derivation; the results are empirical measurements on held-out and external data rather than definitional or fitted-input predictions.

Axiom & Free-Parameter Ledger

2 free parameters · 1 axioms · 0 invented entities

The central claim rests on the representativeness of the single community dataset and the assumption that ML performance on held-out and external data will translate to real-world screening utility; no new physical entities are introduced.

free parameters (2)

ML model hyperparameters
Standard tuning during nested cross-validation for the twelve classifiers; values not reported in abstract.
Feature selection thresholds
RFECV and ten other methods involve implicit cutoffs chosen to optimize performance on the training data.

axioms (1)

domain assumption The collected community variables accurately reflect true risk factor distributions for early-stage CKD in the target population.
Invoked when claiming generalizability from the Bangladesh dataset to South Asia and external sites.

pith-pipeline@v0.9.0 · 5611 in / 1433 out tokens · 45598 ms · 2026-05-16T18:10:37.788355+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

75 extracted references · 75 canonical work pages · 1 internal anchor

[1]

K. J. Jager, C. Kovesdy, R. Langham, M. Rosenberg, V . Jha, C. Zoccali, A single number for advocacy and communication—worldwide more than 850 million individuals have kidney diseases, Nephrology Dialysis Transplantation 34 (2019) 1803–1805. doi:10.1093/ndt/gfz174

work page doi:10.1093/ndt/gfz174 2019
[2]

N. R. Hill, S. T. Fatoba, J. L. Oke, J. A. Hirst, C. A. O’Callaghan, D. S. Lasserson, F. R. Hobbs, Global prevalence of chronic kidney disease–a systematic review and meta-analysis, PloS one 11 (2016) e0158765. doi:10.1371/journal.pone.0158765

work page doi:10.1371/journal.pone.0158765 2016
[3]

Bikbov, C

B. Bikbov, C. A. Purcell, A. S. Levey, M. Smith, A. Abdoli, M. Abebe, C. J. Murray, Others, Global, regional, and national burden of chronic kidney disease, 1990–2017: a systematic analysis, The Lancet 395 (2020) 709–733. doi:10.1016/S0140-6736(20)30045-3

work page doi:10.1016/s0140-6736(20)30045-3 1990
[4]

Abraham, S

G. Abraham, S. Varughese, T. Thandavan, A. Iyengar, E. Fernando, S. J. Naqvi, R. Sheriff, H. Ur-Rashid, N. Gopalakrishnan, R. K. Kafle, Chronic kidney disease hotspots in developing countries in south asia, Clinical kidney journal 9 (2016) 135–141. doi:10.1093/ckj/sfv109

work page doi:10.1093/ckj/sfv109 2016
[5]

Anand, M

S. Anand, M. A. Khanam, J. Saquib, N. Saquib, T. Ahmed, D. S. Alam, M. R. Cullen, M. Barry, G. M. Chertow, High prevalence of chronic kidney disease in a community survey of urban bangladeshis: a cross-sectional study, Globalization and health 10 (2014) 9. doi:10.1186/ 1744-8603-10-9

work page 2014
[6]

M. H. R. Sarker, M. Moriyama, H. U. Rashid, M. J. Chisti, M. M. Rahman, S. K. Das, A. Uddin, S. K. Saha, S. E. Arifeen, T. Ahmed, et al., Community-based screening to determine the prevalence, health and nutritional status of patients with ckd in rural and peri-urban bangladesh, Therapeutic advances in chronic disease 12 (2021) 20406223211035281. doi:10.1...

work page doi:10.1177/20406223211035281 2021
[7]

Banik, A

S. Banik, A. Ghosh, Prevalence of chronic kidney disease in bangladesh: a systematic review and meta-analysis, International urology and nephrology 53 (2021) 713–718. doi:10.1007/s11255-020-02597-6

work page doi:10.1007/s11255-020-02597-6 2021
[8]

Masset, R

M. Iqbal, R. Hossain, K. Hossain, M. Faroque, S. Islam, S. Iqbal, M. Chowdhury, Knowledge, attitude, and perception about renal transplantation of ckd patients, caregivers, and general population, Transplantation Proceedings 50 (2018) 2323–2326. doi:10.1016/j. transproceed.2018.04.048

work page doi:10.1016/j 2018
[9]

Pollock, J.-y

C. Pollock, J.-y. Moon, P. Gojaseni, C. H. Ching, L. Gomez, T. M. Chan, M.-J. Wu, S. C. Yeo, P. Nugroho, A. K. Bhalla, et al., Framework of guidelines for management of ckd in asia, Kidney International Reports 9 (2024) 752–790. doi:10.1016/j.ekir.2023.12.010

work page doi:10.1016/j.ekir.2023.12.010 2024
[10]

L. C. Plantinga, L. E. Boulware, J. Coresh, L. A. Stevens, E. R. Miller, R. Saran, K. L. Messer, A. S. Levey, N. R. Powe, Patient awareness of chronic kidney disease: trends and predictors, Archives of internal medicine 168 (2008) 2268–2275. doi:10.1001/archinte.168.20. xxviii 2268

work page doi:10.1001/archinte.168.20 2008
[11]

A. S. Levey, J. Coresh, Chronic kidney disease, The Lancet 399 (2022) 129–144. doi:10.1016/S0140-6736(21)00519-5

work page doi:10.1016/s0140-6736(21)00519-5 2022
[12]

V . A. Luyckx, M. Tonelli, J. W. Stanifer, The global burden of kidney disease and the sustainable development goals, Bulletin of the World Health Organization 96 (2021) 414–422. doi:10.2471/BLT.17.206441

work page doi:10.2471/blt.17.206441 2021
[13]

Niang, A

A. Niang, A. Iyengar, V . A. Luyckx, Hemodialysis versus peritoneal dialysis in resource-limited settings, Current opinion in nephrology and hypertension 27 (2018) 463–471. doi:10.1097/MNH.0000000000000455

work page doi:10.1097/mnh.0000000000000455 2018
[14]

Levin, S

A. Levin, S. B. Ahmed, J. J. Carrero, B. Foster, A. Francis, R. K. Hall, W. G. Herrington, G. Hill, L. A. Inker, R. Kazancıo ˘glu, et al., Executive summary of the kdigo 2024 clinical practice guideline for the evaluation and management of chronic kidney disease: known knowns and known unknowns, Kidney international 105 (2024) 684–701. doi:10.1016/j.kint....

work page doi:10.1016/j.kint.2023.10.016 2024
[15]

J. W. Stanifer, A. Muiru, T. H. Jafar, U. D. Patel, Chronic kidney disease in low- and middle-income countries, Nephrology Dialysis Transplantation 31 (2016) 868–874. doi:10.1093/ndt/gfv466

work page doi:10.1093/ndt/gfv466 2016
[16]

Kabir, M

A. Kabir, M. N. Karim, B. Billah, The capacity of primary healthcare facilities in bangladesh to prevent and control non-communicable diseases, BMC Primary Care 24 (2023) 60. doi:10.1186/s12875-023-02016-6

work page doi:10.1186/s12875-023-02016-6 2023
[17]

Z. Zeba, K. Fatema, A. F. Sumit, R. Zinnat, L. Ali, Early screening of chronic kidney disease patients among the asymptomatic adult population in bangladesh, Journal of Preventive Epidemiology 5 (2020) e10–e10. doi:10.34172/jpe.2020.10

work page doi:10.34172/jpe.2020.10 2020
[18]

T. H. Jafar, C. Ramakrishnan, O. John, A. Tewari, B. Cobb, H. Legido-Quigley, Y . Sungwon, V . Jha, Access to ckd care in rural communities of india: a qualitative study exploring the barriers and potential facilitators, BMC nephrology 21 (2020) 26. doi:10.1186/s12882-020- 1702-6

work page doi:10.1186/s12882-020- 2020
[19]

C. M. J. Nazar, T. B. Kindratt, S. M. A. Ahmad, M. Ahmed, J. Anderson, Barriers to the successful practice of chronic kidney diseases at the primary health care level; a systematic review, Journal of renal injury prevention 3 (2014) 61. doi:10.12861/jrip.2014.20

work page doi:10.12861/jrip.2014.20 2014
[20]

L. S. Kahn, B. M. Vest, N. Madurai, R. Singh, T. R. York, C. W. Cipparone, S. Reilly, K. S. Malik, C. H. Fox, Chronic kidney disease (ckd) treatment burden among low-income primary care patients, Chronic illness 11 (2015) 171–183. doi:10.1177/1742395314559751

work page doi:10.1177/1742395314559751 2015
[21]

H. Bang, S. Vupputuri, D. A. Shoham, P. J. Klemmer, R. J. Falk, M. Mazumdar, D. Gipson, R. E. Colindres, A. V . Kshirsagar, Screening for occult renal disease (scored): a simple prediction model for chronic kidney disease, Archives of internal medicine 167 (2007) 374–381. doi:10.1001/archinte.167.4.374

work page doi:10.1001/archinte.167.4.374 2007
[22]

Stolpe, B

S. Stolpe, B. Kowall, D. Zwanziger, M. Frank, K.-H. Joeckel, R. Erbel, A. Stang, External validation of six clinical models for prediction of chronic kidney disease in a german population, BMC nephrology 23 (2022) 272. doi:10.1186/s12882-022-02899-0

work page doi:10.1186/s12882-022-02899-0 2022
[23]

A. V . Kshirsagar, H. Bang, A. S. Bomback, S. Vupputuri, D. A. Shoham, L. M. Kern, P. J. Klemmer, M. Mazumdar, P. A. August, A simple algorithm to predict incident kidney disease, Archives of internal medicine 168 (2008) 2466–2473. doi:10.1001/archinte.168.22.2466

work page doi:10.1001/archinte.168.22.2466 2008
[24]

Thakkinstian, A

A. Thakkinstian, A. Ingsathit, A. Chaiprasert, S. Rattanasiri, P. Sangthawan, P. Gojaseni, K. Kiattisunthorn, L. Ongaiyooth, P. Thirakhupt, A simplified clinical prediction score of chronic kidney disease: a cross-sectional-survey study, BMC nephrology 12 (2011) 45. doi:10.1186/ 1471-2369-12-45

work page 2011
[25]

Kearns, H

B. Kearns, H. Gallagher, S. de Lusignan, Predicting the prevalence of chronic kidney disease in the english population: a cross-sectional study, BMC nephrology 14 (2013) 49. doi:10.1186/1471-2369-14-49

work page doi:10.1186/1471-2369-14-49 2013
[26]

K.-S. Kwon, H. Bang, A. S. Bomback, D.-h. Koh, J.-H. Yum, J.-H. Lee, S. Lee, S. K. Park, K.-Y . Yoo, S. K. Park, et al., A simple prediction score for kidney disease in the korean population, Nephrology 17 (2012) 278–284. doi:10.1111/j.1440-1797.2011.01552.x

work page doi:10.1111/j.1440-1797.2011.01552.x 2012
[27]

Sanmarchi, C

F. Sanmarchi, C. Fanconi, D. Golinelli, D. Gori, T. Hernandez-Boussard, A. Capodici, Predict, diagnose, and treat chronic kidney disease with machine learning: a systematic literature review, Journal of nephrology 36 (2023) 1101–1117

work page 2023
[28]

Delrue, S

C. Delrue, S. De Bruyne, M. M. Speeckaert, Application of machine learning in chronic kidney disease: current status and future prospects, Biomedicines 12 (2024) 568. doi:10.3390/biomedicines12030568

work page doi:10.3390/biomedicines12030568 2024
[29]

Gogoi, J

P. Gogoi, J. A. Valan, Machine learning approaches for predicting and diagnosing chronic kidney disease: current trends, challenges, solutions, and future directions, International Urology and Nephrology 57 (2025) 1245–1268

work page 2025
[30]

Sabanayagam, R

C. Sabanayagam, R. Banu, C. Lim, Y . C. Tham, C.-Y . Cheng, G. Tan, E. Ekinci, B. Sheng, G. McKay, J. E. Shaw, et al., Artificial intelligence xxix in chronic kidney disease management: a scoping review, Theranostics 15 (2025) 4566

work page 2025
[31]

A. Khan, Y . Tayyebi, A comprehensive analysis to detect chronic kidney disease and stage prediction: Using machine learning, Asian Journal of Research in Computer Science 18 (2025) 151–162

work page 2025
[32]

Rubini, P

L. Rubini, P. Soundarapandian, P. Eswaran, Chronic Kidney Disease, UCI Machine Learning Repository, 2015. doi:10.24432/C5G020

work page doi:10.24432/c5g020 2015
[33]

Dharmarathne, M

G. Dharmarathne, M. Bogahawaththa, M. McAfee, U. Rathnayake, D. Meddage, On the diagnosis of chronic kidney disease using a machine learning-based interface with explainable artificial intelligence, Intelligent Systems with Applications 22 (2024) 200397

work page 2024
[34]

M. A. Islam, M. Z. H. Majumder, M. A. Hussein, Chronic kidney disease prediction based on machine learning algorithms, Journal of pathology informatics 14 (2023) 100189

work page 2023
[35]

Pujitha, N

K. Pujitha, N. B. Soni, L. F. Eram, P. N. Sai, S. Divija, R. S. Supriya, Chronic kidney disease detection using machine learning approach, in: 2023 2nd International Conference on Vision Towards Emerging Trends in Communication and Networking Technologies (ViTECoN), IEEE, 2023, pp. 1–5

work page 2023
[36]

Gogoi, J

P. Gogoi, J. A. Valan, Chronic kidney disease prediction using machine learning techniques: a comparative study of feature selection methods with smote and shap, Multiscale and Multidisciplinary Modeling, Experiments and Design 8 (2025) 1–23

work page 2025
[37]

M. H. I. Bijoy, M. J. Mia, M. M. Rahman, M. S. Arefin, P. K. Dhar, T. Shimamura, A robot process automation based mobile application for early prediction of chronic kidney disease using machine learning, Discover Applied Sciences 7 (2025) 528

work page 2025
[38]

K. T. Jawad, A. Verma, F. Amsaad, L. Ashraf, A study on the application of explainable ai on ensemble models for predictive analysis of chronic kidney disease, IEEE Access (2025)

work page 2025
[39]

G. U. Nneji, H. N. Monday, V . S. R. Pathapati, S. Nahar, G. T. Mgbejime, E. S. Umana, M. A. Hossin, Ffs-iml: fusion-based statistical feature selection for machine learning-driven interpretability of chronic kidney disease, International Journal of Machine Learning and Cybernetics (2025) 1–34

work page 2025
[40]

D. A. Debal, T. M. Sitote, Chronic kidney disease prediction using machine learning techniques, Journal of Big Data 9 (2022) 109

work page 2022
[41]

Zheng, X

J.-X. Zheng, X. Li, J. Zhu, S.-Y . Guan, S.-X. Zhang, W.-M. Wang, Interpretable machine learning for predicting chronic kidney disease progression risk, Digital Health 10 (2024) 20552076231224225

work page 2024
[42]

S. K. Ghosh, A. H. Khandoker, Investigation on explainable machine learning models to predict chronic kidney diseases, Scientific Reports 14 (2024) 3687

work page 2024
[43]

Khalil, K

W. Khalil, K. Bashir, M. Mosadag, Early detection of chronic kidney disease (ckd) using machine learning algorithms, East Journal of Computer Science 1 (2025) 1–9

work page 2025
[44]

Iftikhar, M

H. Iftikhar, M. Khan, Z. Khan, F. Khan, H. M. Alshanbari, Z. Ahmad, A comparative analysis of machine learning models: a case study in predicting chronic kidney disease, Sustainability 15 (2023) 2754

work page 2023
[45]

Iftikhar, A

H. Iftikhar, A. F. Hashem, M. Qureshi, P. C. Rodrigues, Clinical application of machine learning models for early-stage chronic kidney disease detection, Diagnostics 15 (2025) 2610

work page 2025
[46]

Iftikhar, A

H. Iftikhar, A. F. Hashem, L. A. Mohamud, A. Al-Moisheer, R. I. G. Medina, J. L. L´opez-Gonzales, An intelligent ensemble machine learning model for early detection of chronic kidney disease in aging populations, Scientific Reports (2026)

work page 2026
[47]

Metherall, A

B. Metherall, A. K. Berryman, G. S. Brennan, Machine learning for classifying chronic kidney disease and predicting creatinine levels using at-home measurements, Scientific Reports 15 (2025) 4364

work page 2025
[48]

S. S. Natarajan, E. Balasubramanian, B. K. Raghupathy, M. Ganesan, Using machine learning models with elephant herd feature selection method for diagnosing chronic kidney disease, Computational Journal of Mathematical and Statistical Sciences (2025)

work page 2025
[49]

M. F. Hossain, S. T. Diya, R. Khan, Acd-ml: Advanced ckd detection using machine learning: A tri-phase ensemble and multi-layered stacking and blending approach, Computer Methods and Programs in Biomedicine Update 7 (2025) 100173

work page 2025
[50]

M. M. Rahman, M. Al-Amin, J. Hossain, Machine learning models for chronic kidney disease diagnosis and prediction, Biomedical Signal Processing and Control 87 (2024) 105368

work page 2024
[51]

N. Md. Ashafuddula, B. Islam, R. Islam, An intelligent diagnostic system to analyze early-stage chronic kidney disease for clinical applica- tion, Applied Computational Intelligence and Soft Computing 2023 (2023) 3140270. xxx

work page 2023
[52]

M. A. Islam, S. Akter, Risk Factor Prediction of Chronic Kidney Disease, UCI Machine Learning Repository, 2020. doi:10.24432/C5WP64

work page doi:10.24432/c5wp64 2020
[53]

Al-Shamsi, D

S. Al-Shamsi, D. Regmi, R. D. Govender, Chronic kidney disease in patients at high risk of cardiovascular disease in the united arab emirates: A population-based study, PLOS ONE 13 (2018) 1–12. doi:10.1371/journal.pone.0199920

work page doi:10.1371/journal.pone.0199920 2018
[54]

L. A. Inker, B. C. Astor, C. H. Fox, T. Isakova, J. P. Lash, C. A. Peralta, M. K. Tamura, H. I. Feldman, Kdoqi us commentary on the 2012 kdigo clinical practice guideline for the evaluation and management of ckd, American Journal of Kidney Diseases 63 (2014) 713–735. doi:10.1053/j.ajkd.2014.01.416

work page doi:10.1053/j.ajkd.2014.01.416 2012
[55]

doi:10.18637/jss.v045.i03 , abstract =

S. van Buuren, K. Groothuis-Oudshoorn, mice: Multivariate imputation by chained equations in r, Journal of Statistical Software 45 (2011) 1–67. doi:10.18637/jss.v045.i03

work page doi:10.18637/jss.v045.i03 2011
[56]

Adhikari, W

D. Adhikari, W. Jiang, J. Zhan, Z. He, D. B. Rawat, U. Aickelin, H. A. Khorshidi, A comprehensive survey on imputation of missing data in internet of things, ACM Computing Surveys 55 (2022) 1–38. doi:10.1145/3533381

work page doi:10.1145/3533381 2022
[57]

Pedregosa, G

F. Pedregosa, G. Varoquaux, A. Gramfort, V . Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V . Dubourg, J. Vanderplas, A. Passos, D. Cournapeau, M. Brucher, M. Perrot, ´E. Duchesnay, Scikit-learn: Machine learning in python, Journal of Machine Learning Research 12 (2011) 2825–2830

work page 2011
[58]

H. B. Mann, D. R. Whitney, On a test of whether one of two random variables is stochastically larger than the other, The Annals of Mathematical Statistics 18 (1947) 50–60. doi:10.1214/aoms/1177730491

work page doi:10.1214/aoms/1177730491 1947
[59]

Tibshirani, Regression shrinkage and selection via the lasso, Journal of the Royal Statistical Society: Series B 58 (1996) 267–288

R. Tibshirani, Regression shrinkage and selection via the lasso, Journal of the Royal Statistical Society: Series B 58 (1996) 267–288

work page 1996
[60]

Guyon, J

I. Guyon, J. Weston, S. Barnhill, V . Vapnik, Gene selection for cancer classification using support vector machines, in: Machine Learning, MIT Press, 2002, pp. 389–422

work page 2002
[61]

D. W. Hosmer, S. Lemeshow, R. X. Sturdivant, Applied Logistic Regression, John Wiley & Sons, 2013

work page 2013
[62]

Breiman, J

L. Breiman, J. Friedman, R. Olshen, C. Stone, Classification and Regression Trees, Wadsworth International Group, 1984

work page 1984
[63]

Machine Learning 45(1), 5–32 (Oct 2001)

L. Breiman, Random forests, Machine Learning 45 (2001) 5–32. doi:10.1023/A:1010933404324

work page doi:10.1023/a:1010933404324 2001
[64]

J. H. Friedman, Greedy function approximation: A gradient boosting machine, Annals of Statistics 29 (2001) 1189–1232

work page 2001
[65]

Freund, R

Y . Freund, R. E. Schapire, A decision-theoretic generalization of on-line learning and an application to boosting, Journal of Computer and System Sciences 55 (1997) 119–139

work page 1997
[66]

Geurts, D

P. Geurts, D. Ernst, L. Wehenkel, Extremely randomized trees, Machine Learning 63 (2006) 3–42

work page 2006
[67]

T. Chen, C. Guestrin, Xgboost: A scalable tree boosting system, in: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016, pp. 785–794. doi:10.1145/2939672.2939785

work page doi:10.1145/2939672.2939785 2016
[68]

A. V . Dorogush, V . Ershov, A. Gulin, Catboost: Gradient boosting with categorical features support, 2018.arXiv:1810.11363

work page internal anchor Pith review Pith/arXiv arXiv 2018
[69]

Cover, P

T. Cover, P. Hart, Nearest neighbor pattern classification, IEEE Transactions on Information Theory 13 (1967) 21–27

work page 1967
[70]

Cortes, V

C. Cortes, V . Vapnik, Support vector networks, Machine Learning 20 (1995) 273–297

work page 1995
[71]

G. Ke, Q. Meng, T. Finley, T. Wang, W. Chen, W. Ma, Q. Ye, T.-Y . Liu, Lightgbm: A highly efficient gradient boosting decision tree, in: Advances in Neural Information Processing Systems, 2017, pp. 3149–3157

work page 2017
[72]

D. E. Rumelhart, G. E. Hinton, R. J. Williams, Learning representations by back-propagating errors, Nature 323 (1986) 533–536

work page 1986
[73]

R. Kohavi, A study of cross-validation and bootstrap for accuracy estimation and model selection, in: Proceedings of the 14th International Joint Conference on Artificial Intelligence, volume 2, Morgan Kaufmann Publishers, 1995, pp. 1137–1143

work page 1995
[74]

Robust anomaly detection for multivariate time series through stochastic recurrent neural network,

T. Akiba, S. Sano, T. Yanase, T. Ohta, M. Koyama, Optuna: A next-generation hyperparameter optimization framework, in: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2019, pp. 2623–2631. doi:10.1145/3292500. 3330701

work page doi:10.1145/3292500 2019
[75]

S. M. Lundberg, S.-I. Lee, A unified approach to interpreting model predictions, in: Advances in Neural Information Processing Systems, volume 30, 2017, pp. 4765–4774. xxxi

work page 2017

[1] [1]

K. J. Jager, C. Kovesdy, R. Langham, M. Rosenberg, V . Jha, C. Zoccali, A single number for advocacy and communication—worldwide more than 850 million individuals have kidney diseases, Nephrology Dialysis Transplantation 34 (2019) 1803–1805. doi:10.1093/ndt/gfz174

work page doi:10.1093/ndt/gfz174 2019

[2] [2]

N. R. Hill, S. T. Fatoba, J. L. Oke, J. A. Hirst, C. A. O’Callaghan, D. S. Lasserson, F. R. Hobbs, Global prevalence of chronic kidney disease–a systematic review and meta-analysis, PloS one 11 (2016) e0158765. doi:10.1371/journal.pone.0158765

work page doi:10.1371/journal.pone.0158765 2016

[3] [3]

Bikbov, C

B. Bikbov, C. A. Purcell, A. S. Levey, M. Smith, A. Abdoli, M. Abebe, C. J. Murray, Others, Global, regional, and national burden of chronic kidney disease, 1990–2017: a systematic analysis, The Lancet 395 (2020) 709–733. doi:10.1016/S0140-6736(20)30045-3

work page doi:10.1016/s0140-6736(20)30045-3 1990

[4] [4]

Abraham, S

G. Abraham, S. Varughese, T. Thandavan, A. Iyengar, E. Fernando, S. J. Naqvi, R. Sheriff, H. Ur-Rashid, N. Gopalakrishnan, R. K. Kafle, Chronic kidney disease hotspots in developing countries in south asia, Clinical kidney journal 9 (2016) 135–141. doi:10.1093/ckj/sfv109

work page doi:10.1093/ckj/sfv109 2016

[5] [5]

Anand, M

S. Anand, M. A. Khanam, J. Saquib, N. Saquib, T. Ahmed, D. S. Alam, M. R. Cullen, M. Barry, G. M. Chertow, High prevalence of chronic kidney disease in a community survey of urban bangladeshis: a cross-sectional study, Globalization and health 10 (2014) 9. doi:10.1186/ 1744-8603-10-9

work page 2014

[6] [6]

M. H. R. Sarker, M. Moriyama, H. U. Rashid, M. J. Chisti, M. M. Rahman, S. K. Das, A. Uddin, S. K. Saha, S. E. Arifeen, T. Ahmed, et al., Community-based screening to determine the prevalence, health and nutritional status of patients with ckd in rural and peri-urban bangladesh, Therapeutic advances in chronic disease 12 (2021) 20406223211035281. doi:10.1...

work page doi:10.1177/20406223211035281 2021

[7] [7]

Banik, A

S. Banik, A. Ghosh, Prevalence of chronic kidney disease in bangladesh: a systematic review and meta-analysis, International urology and nephrology 53 (2021) 713–718. doi:10.1007/s11255-020-02597-6

work page doi:10.1007/s11255-020-02597-6 2021

[8] [8]

Masset, R

M. Iqbal, R. Hossain, K. Hossain, M. Faroque, S. Islam, S. Iqbal, M. Chowdhury, Knowledge, attitude, and perception about renal transplantation of ckd patients, caregivers, and general population, Transplantation Proceedings 50 (2018) 2323–2326. doi:10.1016/j. transproceed.2018.04.048

work page doi:10.1016/j 2018

[9] [9]

Pollock, J.-y

C. Pollock, J.-y. Moon, P. Gojaseni, C. H. Ching, L. Gomez, T. M. Chan, M.-J. Wu, S. C. Yeo, P. Nugroho, A. K. Bhalla, et al., Framework of guidelines for management of ckd in asia, Kidney International Reports 9 (2024) 752–790. doi:10.1016/j.ekir.2023.12.010

work page doi:10.1016/j.ekir.2023.12.010 2024

[10] [10]

L. C. Plantinga, L. E. Boulware, J. Coresh, L. A. Stevens, E. R. Miller, R. Saran, K. L. Messer, A. S. Levey, N. R. Powe, Patient awareness of chronic kidney disease: trends and predictors, Archives of internal medicine 168 (2008) 2268–2275. doi:10.1001/archinte.168.20. xxviii 2268

work page doi:10.1001/archinte.168.20 2008

[11] [11]

A. S. Levey, J. Coresh, Chronic kidney disease, The Lancet 399 (2022) 129–144. doi:10.1016/S0140-6736(21)00519-5

work page doi:10.1016/s0140-6736(21)00519-5 2022

[12] [12]

V . A. Luyckx, M. Tonelli, J. W. Stanifer, The global burden of kidney disease and the sustainable development goals, Bulletin of the World Health Organization 96 (2021) 414–422. doi:10.2471/BLT.17.206441

work page doi:10.2471/blt.17.206441 2021

[13] [13]

Niang, A

A. Niang, A. Iyengar, V . A. Luyckx, Hemodialysis versus peritoneal dialysis in resource-limited settings, Current opinion in nephrology and hypertension 27 (2018) 463–471. doi:10.1097/MNH.0000000000000455

work page doi:10.1097/mnh.0000000000000455 2018

[14] [14]

Levin, S

A. Levin, S. B. Ahmed, J. J. Carrero, B. Foster, A. Francis, R. K. Hall, W. G. Herrington, G. Hill, L. A. Inker, R. Kazancıo ˘glu, et al., Executive summary of the kdigo 2024 clinical practice guideline for the evaluation and management of chronic kidney disease: known knowns and known unknowns, Kidney international 105 (2024) 684–701. doi:10.1016/j.kint....

work page doi:10.1016/j.kint.2023.10.016 2024

[15] [15]

J. W. Stanifer, A. Muiru, T. H. Jafar, U. D. Patel, Chronic kidney disease in low- and middle-income countries, Nephrology Dialysis Transplantation 31 (2016) 868–874. doi:10.1093/ndt/gfv466

work page doi:10.1093/ndt/gfv466 2016

[16] [16]

Kabir, M

A. Kabir, M. N. Karim, B. Billah, The capacity of primary healthcare facilities in bangladesh to prevent and control non-communicable diseases, BMC Primary Care 24 (2023) 60. doi:10.1186/s12875-023-02016-6

work page doi:10.1186/s12875-023-02016-6 2023

[17] [17]

Z. Zeba, K. Fatema, A. F. Sumit, R. Zinnat, L. Ali, Early screening of chronic kidney disease patients among the asymptomatic adult population in bangladesh, Journal of Preventive Epidemiology 5 (2020) e10–e10. doi:10.34172/jpe.2020.10

work page doi:10.34172/jpe.2020.10 2020

[18] [18]

T. H. Jafar, C. Ramakrishnan, O. John, A. Tewari, B. Cobb, H. Legido-Quigley, Y . Sungwon, V . Jha, Access to ckd care in rural communities of india: a qualitative study exploring the barriers and potential facilitators, BMC nephrology 21 (2020) 26. doi:10.1186/s12882-020- 1702-6

work page doi:10.1186/s12882-020- 2020

[19] [19]

C. M. J. Nazar, T. B. Kindratt, S. M. A. Ahmad, M. Ahmed, J. Anderson, Barriers to the successful practice of chronic kidney diseases at the primary health care level; a systematic review, Journal of renal injury prevention 3 (2014) 61. doi:10.12861/jrip.2014.20

work page doi:10.12861/jrip.2014.20 2014

[20] [20]

L. S. Kahn, B. M. Vest, N. Madurai, R. Singh, T. R. York, C. W. Cipparone, S. Reilly, K. S. Malik, C. H. Fox, Chronic kidney disease (ckd) treatment burden among low-income primary care patients, Chronic illness 11 (2015) 171–183. doi:10.1177/1742395314559751

work page doi:10.1177/1742395314559751 2015

[21] [21]

H. Bang, S. Vupputuri, D. A. Shoham, P. J. Klemmer, R. J. Falk, M. Mazumdar, D. Gipson, R. E. Colindres, A. V . Kshirsagar, Screening for occult renal disease (scored): a simple prediction model for chronic kidney disease, Archives of internal medicine 167 (2007) 374–381. doi:10.1001/archinte.167.4.374

work page doi:10.1001/archinte.167.4.374 2007

[22] [22]

Stolpe, B

S. Stolpe, B. Kowall, D. Zwanziger, M. Frank, K.-H. Joeckel, R. Erbel, A. Stang, External validation of six clinical models for prediction of chronic kidney disease in a german population, BMC nephrology 23 (2022) 272. doi:10.1186/s12882-022-02899-0

work page doi:10.1186/s12882-022-02899-0 2022

[23] [23]

A. V . Kshirsagar, H. Bang, A. S. Bomback, S. Vupputuri, D. A. Shoham, L. M. Kern, P. J. Klemmer, M. Mazumdar, P. A. August, A simple algorithm to predict incident kidney disease, Archives of internal medicine 168 (2008) 2466–2473. doi:10.1001/archinte.168.22.2466

work page doi:10.1001/archinte.168.22.2466 2008

[24] [24]

Thakkinstian, A

A. Thakkinstian, A. Ingsathit, A. Chaiprasert, S. Rattanasiri, P. Sangthawan, P. Gojaseni, K. Kiattisunthorn, L. Ongaiyooth, P. Thirakhupt, A simplified clinical prediction score of chronic kidney disease: a cross-sectional-survey study, BMC nephrology 12 (2011) 45. doi:10.1186/ 1471-2369-12-45

work page 2011

[25] [25]

Kearns, H

B. Kearns, H. Gallagher, S. de Lusignan, Predicting the prevalence of chronic kidney disease in the english population: a cross-sectional study, BMC nephrology 14 (2013) 49. doi:10.1186/1471-2369-14-49

work page doi:10.1186/1471-2369-14-49 2013

[26] [26]

K.-S. Kwon, H. Bang, A. S. Bomback, D.-h. Koh, J.-H. Yum, J.-H. Lee, S. Lee, S. K. Park, K.-Y . Yoo, S. K. Park, et al., A simple prediction score for kidney disease in the korean population, Nephrology 17 (2012) 278–284. doi:10.1111/j.1440-1797.2011.01552.x

work page doi:10.1111/j.1440-1797.2011.01552.x 2012

[27] [27]

Sanmarchi, C

F. Sanmarchi, C. Fanconi, D. Golinelli, D. Gori, T. Hernandez-Boussard, A. Capodici, Predict, diagnose, and treat chronic kidney disease with machine learning: a systematic literature review, Journal of nephrology 36 (2023) 1101–1117

work page 2023

[28] [28]

Delrue, S

C. Delrue, S. De Bruyne, M. M. Speeckaert, Application of machine learning in chronic kidney disease: current status and future prospects, Biomedicines 12 (2024) 568. doi:10.3390/biomedicines12030568

work page doi:10.3390/biomedicines12030568 2024

[29] [29]

Gogoi, J

P. Gogoi, J. A. Valan, Machine learning approaches for predicting and diagnosing chronic kidney disease: current trends, challenges, solutions, and future directions, International Urology and Nephrology 57 (2025) 1245–1268

work page 2025

[30] [30]

Sabanayagam, R

C. Sabanayagam, R. Banu, C. Lim, Y . C. Tham, C.-Y . Cheng, G. Tan, E. Ekinci, B. Sheng, G. McKay, J. E. Shaw, et al., Artificial intelligence xxix in chronic kidney disease management: a scoping review, Theranostics 15 (2025) 4566

work page 2025

[31] [31]

A. Khan, Y . Tayyebi, A comprehensive analysis to detect chronic kidney disease and stage prediction: Using machine learning, Asian Journal of Research in Computer Science 18 (2025) 151–162

work page 2025

[32] [32]

Rubini, P

L. Rubini, P. Soundarapandian, P. Eswaran, Chronic Kidney Disease, UCI Machine Learning Repository, 2015. doi:10.24432/C5G020

work page doi:10.24432/c5g020 2015

[33] [33]

Dharmarathne, M

G. Dharmarathne, M. Bogahawaththa, M. McAfee, U. Rathnayake, D. Meddage, On the diagnosis of chronic kidney disease using a machine learning-based interface with explainable artificial intelligence, Intelligent Systems with Applications 22 (2024) 200397

work page 2024

[34] [34]

M. A. Islam, M. Z. H. Majumder, M. A. Hussein, Chronic kidney disease prediction based on machine learning algorithms, Journal of pathology informatics 14 (2023) 100189

work page 2023

[35] [35]

Pujitha, N

K. Pujitha, N. B. Soni, L. F. Eram, P. N. Sai, S. Divija, R. S. Supriya, Chronic kidney disease detection using machine learning approach, in: 2023 2nd International Conference on Vision Towards Emerging Trends in Communication and Networking Technologies (ViTECoN), IEEE, 2023, pp. 1–5

work page 2023

[36] [36]

Gogoi, J

P. Gogoi, J. A. Valan, Chronic kidney disease prediction using machine learning techniques: a comparative study of feature selection methods with smote and shap, Multiscale and Multidisciplinary Modeling, Experiments and Design 8 (2025) 1–23

work page 2025

[37] [37]

M. H. I. Bijoy, M. J. Mia, M. M. Rahman, M. S. Arefin, P. K. Dhar, T. Shimamura, A robot process automation based mobile application for early prediction of chronic kidney disease using machine learning, Discover Applied Sciences 7 (2025) 528

work page 2025

[38] [38]

K. T. Jawad, A. Verma, F. Amsaad, L. Ashraf, A study on the application of explainable ai on ensemble models for predictive analysis of chronic kidney disease, IEEE Access (2025)

work page 2025

[39] [39]

G. U. Nneji, H. N. Monday, V . S. R. Pathapati, S. Nahar, G. T. Mgbejime, E. S. Umana, M. A. Hossin, Ffs-iml: fusion-based statistical feature selection for machine learning-driven interpretability of chronic kidney disease, International Journal of Machine Learning and Cybernetics (2025) 1–34

work page 2025

[40] [40]

D. A. Debal, T. M. Sitote, Chronic kidney disease prediction using machine learning techniques, Journal of Big Data 9 (2022) 109

work page 2022

[41] [41]

Zheng, X

J.-X. Zheng, X. Li, J. Zhu, S.-Y . Guan, S.-X. Zhang, W.-M. Wang, Interpretable machine learning for predicting chronic kidney disease progression risk, Digital Health 10 (2024) 20552076231224225

work page 2024

[42] [42]

S. K. Ghosh, A. H. Khandoker, Investigation on explainable machine learning models to predict chronic kidney diseases, Scientific Reports 14 (2024) 3687

work page 2024

[43] [43]

Khalil, K

W. Khalil, K. Bashir, M. Mosadag, Early detection of chronic kidney disease (ckd) using machine learning algorithms, East Journal of Computer Science 1 (2025) 1–9

work page 2025

[44] [44]

Iftikhar, M

H. Iftikhar, M. Khan, Z. Khan, F. Khan, H. M. Alshanbari, Z. Ahmad, A comparative analysis of machine learning models: a case study in predicting chronic kidney disease, Sustainability 15 (2023) 2754

work page 2023

[45] [45]

Iftikhar, A

H. Iftikhar, A. F. Hashem, M. Qureshi, P. C. Rodrigues, Clinical application of machine learning models for early-stage chronic kidney disease detection, Diagnostics 15 (2025) 2610

work page 2025

[46] [46]

Iftikhar, A

H. Iftikhar, A. F. Hashem, L. A. Mohamud, A. Al-Moisheer, R. I. G. Medina, J. L. L´opez-Gonzales, An intelligent ensemble machine learning model for early detection of chronic kidney disease in aging populations, Scientific Reports (2026)

work page 2026

[47] [47]

Metherall, A

B. Metherall, A. K. Berryman, G. S. Brennan, Machine learning for classifying chronic kidney disease and predicting creatinine levels using at-home measurements, Scientific Reports 15 (2025) 4364

work page 2025

[48] [48]

S. S. Natarajan, E. Balasubramanian, B. K. Raghupathy, M. Ganesan, Using machine learning models with elephant herd feature selection method for diagnosing chronic kidney disease, Computational Journal of Mathematical and Statistical Sciences (2025)

work page 2025

[49] [49]

M. F. Hossain, S. T. Diya, R. Khan, Acd-ml: Advanced ckd detection using machine learning: A tri-phase ensemble and multi-layered stacking and blending approach, Computer Methods and Programs in Biomedicine Update 7 (2025) 100173

work page 2025

[50] [50]

M. M. Rahman, M. Al-Amin, J. Hossain, Machine learning models for chronic kidney disease diagnosis and prediction, Biomedical Signal Processing and Control 87 (2024) 105368

work page 2024

[51] [51]

N. Md. Ashafuddula, B. Islam, R. Islam, An intelligent diagnostic system to analyze early-stage chronic kidney disease for clinical applica- tion, Applied Computational Intelligence and Soft Computing 2023 (2023) 3140270. xxx

work page 2023

[52] [52]

M. A. Islam, S. Akter, Risk Factor Prediction of Chronic Kidney Disease, UCI Machine Learning Repository, 2020. doi:10.24432/C5WP64

work page doi:10.24432/c5wp64 2020

[53] [53]

Al-Shamsi, D

S. Al-Shamsi, D. Regmi, R. D. Govender, Chronic kidney disease in patients at high risk of cardiovascular disease in the united arab emirates: A population-based study, PLOS ONE 13 (2018) 1–12. doi:10.1371/journal.pone.0199920

work page doi:10.1371/journal.pone.0199920 2018

[54] [54]

L. A. Inker, B. C. Astor, C. H. Fox, T. Isakova, J. P. Lash, C. A. Peralta, M. K. Tamura, H. I. Feldman, Kdoqi us commentary on the 2012 kdigo clinical practice guideline for the evaluation and management of ckd, American Journal of Kidney Diseases 63 (2014) 713–735. doi:10.1053/j.ajkd.2014.01.416

work page doi:10.1053/j.ajkd.2014.01.416 2012

[55] [55]

doi:10.18637/jss.v045.i03 , abstract =

S. van Buuren, K. Groothuis-Oudshoorn, mice: Multivariate imputation by chained equations in r, Journal of Statistical Software 45 (2011) 1–67. doi:10.18637/jss.v045.i03

work page doi:10.18637/jss.v045.i03 2011

[56] [56]

Adhikari, W

D. Adhikari, W. Jiang, J. Zhan, Z. He, D. B. Rawat, U. Aickelin, H. A. Khorshidi, A comprehensive survey on imputation of missing data in internet of things, ACM Computing Surveys 55 (2022) 1–38. doi:10.1145/3533381

work page doi:10.1145/3533381 2022

[57] [57]

Pedregosa, G

F. Pedregosa, G. Varoquaux, A. Gramfort, V . Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V . Dubourg, J. Vanderplas, A. Passos, D. Cournapeau, M. Brucher, M. Perrot, ´E. Duchesnay, Scikit-learn: Machine learning in python, Journal of Machine Learning Research 12 (2011) 2825–2830

work page 2011

[58] [58]

H. B. Mann, D. R. Whitney, On a test of whether one of two random variables is stochastically larger than the other, The Annals of Mathematical Statistics 18 (1947) 50–60. doi:10.1214/aoms/1177730491

work page doi:10.1214/aoms/1177730491 1947

[59] [59]

Tibshirani, Regression shrinkage and selection via the lasso, Journal of the Royal Statistical Society: Series B 58 (1996) 267–288

R. Tibshirani, Regression shrinkage and selection via the lasso, Journal of the Royal Statistical Society: Series B 58 (1996) 267–288

work page 1996

[60] [60]

Guyon, J

I. Guyon, J. Weston, S. Barnhill, V . Vapnik, Gene selection for cancer classification using support vector machines, in: Machine Learning, MIT Press, 2002, pp. 389–422

work page 2002

[61] [61]

D. W. Hosmer, S. Lemeshow, R. X. Sturdivant, Applied Logistic Regression, John Wiley & Sons, 2013

work page 2013

[62] [62]

Breiman, J

L. Breiman, J. Friedman, R. Olshen, C. Stone, Classification and Regression Trees, Wadsworth International Group, 1984

work page 1984

[63] [63]

Machine Learning 45(1), 5–32 (Oct 2001)

L. Breiman, Random forests, Machine Learning 45 (2001) 5–32. doi:10.1023/A:1010933404324

work page doi:10.1023/a:1010933404324 2001

[64] [64]

J. H. Friedman, Greedy function approximation: A gradient boosting machine, Annals of Statistics 29 (2001) 1189–1232

work page 2001

[65] [65]

Freund, R

Y . Freund, R. E. Schapire, A decision-theoretic generalization of on-line learning and an application to boosting, Journal of Computer and System Sciences 55 (1997) 119–139

work page 1997

[66] [66]

Geurts, D

P. Geurts, D. Ernst, L. Wehenkel, Extremely randomized trees, Machine Learning 63 (2006) 3–42

work page 2006

[67] [67]

T. Chen, C. Guestrin, Xgboost: A scalable tree boosting system, in: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016, pp. 785–794. doi:10.1145/2939672.2939785

work page doi:10.1145/2939672.2939785 2016

[68] [68]

A. V . Dorogush, V . Ershov, A. Gulin, Catboost: Gradient boosting with categorical features support, 2018.arXiv:1810.11363

work page internal anchor Pith review Pith/arXiv arXiv 2018

[69] [69]

Cover, P

T. Cover, P. Hart, Nearest neighbor pattern classification, IEEE Transactions on Information Theory 13 (1967) 21–27

work page 1967

[70] [70]

Cortes, V

C. Cortes, V . Vapnik, Support vector networks, Machine Learning 20 (1995) 273–297

work page 1995

[71] [71]

G. Ke, Q. Meng, T. Finley, T. Wang, W. Chen, W. Ma, Q. Ye, T.-Y . Liu, Lightgbm: A highly efficient gradient boosting decision tree, in: Advances in Neural Information Processing Systems, 2017, pp. 3149–3157

work page 2017

[72] [72]

D. E. Rumelhart, G. E. Hinton, R. J. Williams, Learning representations by back-propagating errors, Nature 323 (1986) 533–536

work page 1986

[73] [73]

R. Kohavi, A study of cross-validation and bootstrap for accuracy estimation and model selection, in: Proceedings of the 14th International Joint Conference on Artificial Intelligence, volume 2, Morgan Kaufmann Publishers, 1995, pp. 1137–1143

work page 1995

[74] [74]

Robust anomaly detection for multivariate time series through stochastic recurrent neural network,

T. Akiba, S. Sano, T. Yanase, T. Ohta, M. Koyama, Optuna: A next-generation hyperparameter optimization framework, in: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2019, pp. 2623–2631. doi:10.1145/3292500. 3330701

work page doi:10.1145/3292500 2019

[75] [75]

S. M. Lundberg, S.-I. Lee, A unified approach to interpreting model predictions, in: Advances in Neural Information Processing Systems, volume 30, 2017, pp. 4765–4774. xxxi

work page 2017