Leveraging Stacked Machine Learning Models to Advance Diagnostic Precision and Predictive Insights in Chronic Kidney Disease
Published 2025-06-26
Keywords
- Chronic Kidney Disease (CKD),
- Machine Learning,
- Stacking Ensemble,
- AdaBoost,
- Classification
- Clinical Data,
- Model Evaluation ...More
How to Cite
Abstract
A considerable number of people worldwide suffer from chronic kidney disease (CKD), a progressive illness that often goes undiagnosed until it has advanced to a severe degree. Timely action and better patient outcomes rely on early diagnosis. However, traditional diagnostic methods are time-consuming and may lack consistency, especially in resource-constrained settings. To address this, our study explores the application of machine learning (ML) for early-stage CKD prediction. We used the publicly available CKD dataset from the UCI Machine Learning Repository, which includes 400 patient records across 14 clinical features. After performing thorough preprocessing—including handling missing values and converting categorical data—we applied multiple ML classifiers: Naïve Bayes, Decision Tree, Gradient Boosting, AdaBoost, and XGBoost. Each model was evaluated using 10-fold cross-validation to ensure reliability. The core of our approach lies in a stacking ensemble model, which combines predictions from three base learners—Naïve Bayes, Decision Tree, and Gradient Boosting—and passes them to a meta-learner based on AdaBoost. This layered learning framework was specifically designed to mitigate overfitting, which is evident in some individual models during evaluation. The proposed stacking model demonstrated the best performance among all tested models, achieving a training accuracy of 99.2% and a testing accuracy of 98.93%. Furthermore, it yielded a precision of 98.70%, a recall of 98.10%, and an F1-score of 98.40%, outperforming other standalone algorithms in accuracy and stability.
References
- Pathak, A., Singh, O. P., & Biswas, U. (2025). Predicting chronic kidney disease using machine learning methodologies. In 2025 8th International Conference on Electronics, Materials Engineering & Nano-Technology (IEMENTech) (pp. 1–6). IEEE.
- Ahmed, K., Dubey, M. K., Dubey, S., Pandey, D. K., et al. (2025). Chronic kidney disease: Causes, treatment, management, and future scope. In Computational Intelligence for Genomics Data (pp. 99–111). Elsevier.
- Kounatidis, D., Vallianou, N. G., Stratigou, T., Voukali, M., Karampela, I., & Dalamaga, M. (2024). The kidney in obesity: Current evidence, perspectives and controversies. Current Obesity Reports, 13(4), 680–702.
- Wang, Y., Lu, Y., Ma, J., Hu, Y., Zheng, R., Liu, L., Lin, K., Zhang, K., & Cai, H. (2024). The global burden of chronic kidney disease (CKD) attributable to high sodium intake: A comprehensive analysis of trends from 1990 to 2021 and burden prediction to 2040. SSRN. https://ssrn.com/abstract=5242625
- Sinha, R., Noh, L., Sethi, S. K., Safadi, R., Smith, S., Düzova, A., Bjornstad, E. C., Antwi, S., Ishikura, K., Salgia, E., et al. (2025). Pediatric kidney replacement therapies in low-to-middle income countries: A review and white paper. Pediatric Nephrology, 1–17.
- Lee, P.-H., Huang, S. M., Tsai, Y.-C., Wang, Y.-T., & Chew, F. Y. (2025). Biomarkers in contrast-induced nephropathy: Advances in early detection, risk assessment, and prevention strategies. International Journal of Molecular Sciences, 26(7), 2869.
- Razzaq, K., & Shah, M. (2025). Machine learning and deep learning paradigms: From techniques to practical applications and research frontiers. Computers, 14(3), 93.
- Metherall, B., Berryman, A. K., & Brennan, G. S. (2025). Machine learning for classifying chronic kidney disease and predicting creatinine levels using at-home measurements. Scientific Reports, 15(1), 4364.
- Ramu, K., Patthi, S., Prajapati, Y. N., Ramesh, J. V. N., Banerjee, S., Rao, K. B., Alzahrani, S. I., et al. (2025). Hybrid CNN-SVM model for enhanced early detection of chronic kidney disease. Biomedical Signal Processing and Control, 100, 107084.
- Pechprasarn, S., Wetchasit, P., & Pongsuwan, S. (2025). Optimizing chronic kidney disease prediction: A machine learning approach with minimal diagnostic predictors. Journal of Current Science and Technology, 15(1), 76–76.
- Rahman, M. M., Al-Amin, M., & Hossain, J. (2024). Machine learning models for chronic kidney disease diagnosis and prediction. Biomedical Signal Processing and Control, 87, 105368.
- Rezk, N. G., Alshathri, S., Sayed, A., & Hemdan, E. E.-D. (2025). Explainable AI for chronic kidney disease prediction in medical IoT: Integrating GANs and few-shot learning. Bioengineering, 12(4), 356.
- Ghosh, S. K., & Khandoker, A. H. (2024). Investigation on explainable machine learning models to predict chronic kidney diseases. Scientific Reports, 14(1), 3687.
- Ghosh, B. P., Imam, T., Anjum, N., Mia, M. T., Siddiqua, C. U., Sharif, K. S., Khan, M. M., Mamun, M. A. I., & Hossain, M. Z. (2024). Advancing chronic kidney disease prediction: Comparative analysis of machine learning algorithms and a hybrid model. Journal of Computer Science and Technology Studies, 6(3), 15–21.
- Prova, N. N. I. (2024). Advanced machine learning techniques for predictive analysis of health insurance. In 2024 Second International Conference on Intelligent Cyber Physical Systems and Internet of Things (ICoICI) (pp. 1166–1170). IEEE.
- Pasha, A., ur Rahman, S. Z., Tauheed, S., Basha, S. M., & Anwar, B. H. (2024). Comparative Analysis of Ranking Machine Learning Classifier Models for Parkinson's Disease (PD) Prediction. In Disruptive Technologies for Sustainable Development (pp. 237-241). CRC Press.
- Kumar, A., Satheesha, T. Y., Salvador, B. B. L., Mithileysh, S., & Ahmed, S. T. (2023). Augmented Intelligence enabled Deep Neural Networking (AuDNN) framework for skin cancer classification and prediction using multi-dimensional datasets on industrial IoT standards. Microprocessors and Microsystems, 97, 104755.