{"title":"Machine Learning-Based Prediction Model for Health-Related Quality of Life in Diabetic Patients.","authors":"Shinhye Ahn, Minjeong An","doi":"10.1177/10547738251367551","DOIUrl":null,"url":null,"abstract":"<p><p>The increasing prevalence of diabetes mellitus (DM) and patients' lack of self-management awareness have led to a decline in health-related quality of life (HRQoL). Studies identifying potential risk factors for HRQoL in DM patients and presenting generalized models are relatively scarce. The study aimed to develop and evaluate a machine learning (ML)-based model to predict the HRQoL in adult diabetic patients and to examine the important factors affecting HRQoL. This study extracted factors from the Korea National Health and Nutrition Examination Survey database (2016-2020) based on situation-specific theory, and using data from 2,501 adult DM patients. We developed five ML-based HRQoL classifiers (logistic regression, naïve Bayes, random forest, support vector machine, and extreme gradient boosting (XGBoost) in DM patients. The developed ML model was evaluated using six evaluation metrics to determine the best model, and feature importance was computed based on Shapley additive explanations (SHAP) value. The XGBoost model showed the best performance, with an accuracy of 0.940, a recall of 0.943, a precision of 0.940, a specificity of 0.919, an F1-score of 0.942, and an area under the curve score of 0.984. Based on SHAP values, the top five significant predictors of HRQoL were self-rated health (1.898), employment (0.822), triglycerides (0.781), education level (0.618), and aspartate transaminase/alanine transaminase ratio (0.611). The findings confirmed that the ML-based prediction model achieved high accuracy (over 90%) in distinguishing stable and at-risk groups in terms of HRQoL among adult DM patients. The XGBoost model's superior performance supports its potential integration into routine diabetes care as a decision-support tool. Identifying high-risk individuals early can help healthcare providers implement targeted interventions to improve long-term health outcomes.</p>","PeriodicalId":50677,"journal":{"name":"Clinical Nursing Research","volume":" ","pages":"340-353"},"PeriodicalIF":1.8000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical Nursing Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1177/10547738251367551","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/9/10 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"NURSING","Score":null,"Total":0}
引用次数: 0
Abstract
The increasing prevalence of diabetes mellitus (DM) and patients' lack of self-management awareness have led to a decline in health-related quality of life (HRQoL). Studies identifying potential risk factors for HRQoL in DM patients and presenting generalized models are relatively scarce. The study aimed to develop and evaluate a machine learning (ML)-based model to predict the HRQoL in adult diabetic patients and to examine the important factors affecting HRQoL. This study extracted factors from the Korea National Health and Nutrition Examination Survey database (2016-2020) based on situation-specific theory, and using data from 2,501 adult DM patients. We developed five ML-based HRQoL classifiers (logistic regression, naïve Bayes, random forest, support vector machine, and extreme gradient boosting (XGBoost) in DM patients. The developed ML model was evaluated using six evaluation metrics to determine the best model, and feature importance was computed based on Shapley additive explanations (SHAP) value. The XGBoost model showed the best performance, with an accuracy of 0.940, a recall of 0.943, a precision of 0.940, a specificity of 0.919, an F1-score of 0.942, and an area under the curve score of 0.984. Based on SHAP values, the top five significant predictors of HRQoL were self-rated health (1.898), employment (0.822), triglycerides (0.781), education level (0.618), and aspartate transaminase/alanine transaminase ratio (0.611). The findings confirmed that the ML-based prediction model achieved high accuracy (over 90%) in distinguishing stable and at-risk groups in terms of HRQoL among adult DM patients. The XGBoost model's superior performance supports its potential integration into routine diabetes care as a decision-support tool. Identifying high-risk individuals early can help healthcare providers implement targeted interventions to improve long-term health outcomes.
期刊介绍:
Clinical Nursing Research (CNR) is a peer-reviewed quarterly journal that addresses issues of clinical research that are meaningful to practicing nurses, providing an international forum to encourage discussion among clinical practitioners, enhance clinical practice by pinpointing potential clinical applications of the latest scholarly research, and disseminate research findings of particular interest to practicing nurses. This journal is a member of the Committee on Publication Ethics (COPE).