加纳某三级医院T2DM患者死亡率的预测模型和决定因素:机器学习技术的表现如何?

IF 2.8 3区 医学 Q3 ENDOCRINOLOGY & METABOLISM
Godsway Edem Kpene, Sylvester Yao Lokpo, Sandra A Darfour-Oduro
{"title":"加纳某三级医院T2DM患者死亡率的预测模型和决定因素:机器学习技术的表现如何?","authors":"Godsway Edem Kpene, Sylvester Yao Lokpo, Sandra A Darfour-Oduro","doi":"10.1186/s12902-025-01831-5","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The increasing prevalence of type 2 diabetes mellitus (T2DM) in lower and middle - income countries call for preventive public health interventions. Studies from Africa including those from Ghana, consistently reveal high T2DM-related mortality rates. While previous research in the Ho municipality has primarily examined risk factors, comorbidity, and quality of life of T2DM patients, this study specifically investigated mortality predictors among these patients.</p><p><strong>Method: </strong>The study was retrospective involving medical records of T2DM patients. Data extracted included mortality outcome (dead or alive), sociodemographic characteristics (age, sex, marital status, educational level, occupation and location), family history of diseases (diabetes, cardiovascular disease (CVD), or asthma), lifestyle (smoking and alcohol intake), comorbidities (such as skin infections, sickle cell disease, urinary tract infections, and pneumonia) and complications of diabetes (CVD, nephropathy, neuropathy, foot ulcers, and diabetic ketoacidosis) were analyzed using Stata version 16.0 and Python 3.6.1 programming language. Both descriptive and inferential statistics were done to describe and build predictive models respectively. The performance of machine learning (ML) techniques such as support vector machine (SVM), decision tree, k nearest neighbor (kNN), eXtreme Gradient Boosting (XGBoost) and logistic regression were evaluated using the best-fitting predictive model for T2DM mortality.</p><p><strong>Results: </strong>Of the 328 participants, 183 (55.79%) were female, and the percentage of mortality was 11.28%. A 100% mortality was recorded among the T2DM patients with sepsis (p-value = 0.012). T2DM in-patients were 3.83 times as likely to die [AOR = 3.83; 95% CI: (1.53-9.61)] if they had nephropathy compared to T2DM in-patients without nephropathy (p-value = 0.004). The full model which included sociodemographic characteristics, family history, lifestyle variables and complications of T2DM had the best prediction of T2DM mortality outcome (ROC = 72.97%). The accuracy for (test and train datasets) were as follows: (90% and 90%), (100% and 100%), (90% and 90%), (90% and 88%) and (88% and 90%) respectively for the various ML classification techniques: logistic regression, Decision tree classifier, kNN classifier, SVM and XGBoost.</p><p><strong>Conclusion: </strong>This study found that all in-patients with sepsis died. Nephropathy was the identified significant predictor of T2DM mortality. Decision tree classifier provided the best classifying potential.</p>","PeriodicalId":9152,"journal":{"name":"BMC Endocrine Disorders","volume":"25 1","pages":"9"},"PeriodicalIF":2.8000,"publicationDate":"2025-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11720850/pdf/","citationCount":"0","resultStr":"{\"title\":\"Predictive models and determinants of mortality among T2DM patients in a tertiary hospital in Ghana, how do machine learning techniques perform?\",\"authors\":\"Godsway Edem Kpene, Sylvester Yao Lokpo, Sandra A Darfour-Oduro\",\"doi\":\"10.1186/s12902-025-01831-5\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>The increasing prevalence of type 2 diabetes mellitus (T2DM) in lower and middle - income countries call for preventive public health interventions. Studies from Africa including those from Ghana, consistently reveal high T2DM-related mortality rates. While previous research in the Ho municipality has primarily examined risk factors, comorbidity, and quality of life of T2DM patients, this study specifically investigated mortality predictors among these patients.</p><p><strong>Method: </strong>The study was retrospective involving medical records of T2DM patients. Data extracted included mortality outcome (dead or alive), sociodemographic characteristics (age, sex, marital status, educational level, occupation and location), family history of diseases (diabetes, cardiovascular disease (CVD), or asthma), lifestyle (smoking and alcohol intake), comorbidities (such as skin infections, sickle cell disease, urinary tract infections, and pneumonia) and complications of diabetes (CVD, nephropathy, neuropathy, foot ulcers, and diabetic ketoacidosis) were analyzed using Stata version 16.0 and Python 3.6.1 programming language. Both descriptive and inferential statistics were done to describe and build predictive models respectively. The performance of machine learning (ML) techniques such as support vector machine (SVM), decision tree, k nearest neighbor (kNN), eXtreme Gradient Boosting (XGBoost) and logistic regression were evaluated using the best-fitting predictive model for T2DM mortality.</p><p><strong>Results: </strong>Of the 328 participants, 183 (55.79%) were female, and the percentage of mortality was 11.28%. A 100% mortality was recorded among the T2DM patients with sepsis (p-value = 0.012). T2DM in-patients were 3.83 times as likely to die [AOR = 3.83; 95% CI: (1.53-9.61)] if they had nephropathy compared to T2DM in-patients without nephropathy (p-value = 0.004). The full model which included sociodemographic characteristics, family history, lifestyle variables and complications of T2DM had the best prediction of T2DM mortality outcome (ROC = 72.97%). The accuracy for (test and train datasets) were as follows: (90% and 90%), (100% and 100%), (90% and 90%), (90% and 88%) and (88% and 90%) respectively for the various ML classification techniques: logistic regression, Decision tree classifier, kNN classifier, SVM and XGBoost.</p><p><strong>Conclusion: </strong>This study found that all in-patients with sepsis died. Nephropathy was the identified significant predictor of T2DM mortality. Decision tree classifier provided the best classifying potential.</p>\",\"PeriodicalId\":9152,\"journal\":{\"name\":\"BMC Endocrine Disorders\",\"volume\":\"25 1\",\"pages\":\"9\"},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2025-01-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11720850/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Endocrine Disorders\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s12902-025-01831-5\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENDOCRINOLOGY & METABOLISM\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Endocrine Disorders","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12902-025-01831-5","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENDOCRINOLOGY & METABOLISM","Score":null,"Total":0}
引用次数: 0

摘要

背景:2型糖尿病(T2DM)在低收入和中等收入国家的患病率不断上升,要求采取预防性公共卫生干预措施。来自非洲的研究,包括来自加纳的研究,一致表明t2dm相关的死亡率很高。先前的研究主要是研究T2DM患者的危险因素、合并症和生活质量,而本研究专门调查了这些患者的死亡率预测因素。方法:回顾性分析2型糖尿病患者的医疗记录。提取的数据包括死亡率结局(死亡或存活)、社会人口统计学特征(年龄、性别、婚姻状况、教育水平、职业和地点)、疾病家族史(糖尿病、心血管疾病(CVD)或哮喘)、生活方式(吸烟和饮酒)、合并症(如皮肤感染、镰状细胞病、尿路感染和肺炎)和糖尿病并发症(CVD、肾病、神经病变、足部溃疡、采用Stata version 16.0和Python 3.6.1编程语言对糖尿病酮症酸中毒患者进行分析。描述统计和推理统计分别用于描述和建立预测模型。机器学习(ML)技术的性能,如支持向量机(SVM)、决策树、k近邻(kNN)、极限梯度提升(XGBoost)和逻辑回归,使用最佳拟合预测模型对T2DM死亡率进行评估。结果:328例患者中,女性183例(55.79%),死亡率11.28%。T2DM合并脓毒症患者的死亡率为100% (p值= 0.012)。T2DM住院患者的死亡率是前者的3.83倍[AOR = 3.83;95% CI:(1.53-9.61)]是否有肾病,与无肾病的T2DM住院患者相比(p值= 0.004)。纳入社会人口学特征、家族史、生活方式变量和T2DM并发症的完整模型对T2DM死亡结局的预测效果最好(ROC = 72.97%)。对于各种ML分类技术:逻辑回归、决策树分类器、kNN分类器、SVM和XGBoost,(测试和训练数据集)的准确率分别为:(90%和90%)、(100%和100%)、(90%和90%)、(90%和88%)和(88%和90%)。结论:本研究发现住院败血症患者全部死亡。肾病是T2DM死亡率的重要预测因素。决策树分类器提供了最好的分类潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Predictive models and determinants of mortality among T2DM patients in a tertiary hospital in Ghana, how do machine learning techniques perform?

Background: The increasing prevalence of type 2 diabetes mellitus (T2DM) in lower and middle - income countries call for preventive public health interventions. Studies from Africa including those from Ghana, consistently reveal high T2DM-related mortality rates. While previous research in the Ho municipality has primarily examined risk factors, comorbidity, and quality of life of T2DM patients, this study specifically investigated mortality predictors among these patients.

Method: The study was retrospective involving medical records of T2DM patients. Data extracted included mortality outcome (dead or alive), sociodemographic characteristics (age, sex, marital status, educational level, occupation and location), family history of diseases (diabetes, cardiovascular disease (CVD), or asthma), lifestyle (smoking and alcohol intake), comorbidities (such as skin infections, sickle cell disease, urinary tract infections, and pneumonia) and complications of diabetes (CVD, nephropathy, neuropathy, foot ulcers, and diabetic ketoacidosis) were analyzed using Stata version 16.0 and Python 3.6.1 programming language. Both descriptive and inferential statistics were done to describe and build predictive models respectively. The performance of machine learning (ML) techniques such as support vector machine (SVM), decision tree, k nearest neighbor (kNN), eXtreme Gradient Boosting (XGBoost) and logistic regression were evaluated using the best-fitting predictive model for T2DM mortality.

Results: Of the 328 participants, 183 (55.79%) were female, and the percentage of mortality was 11.28%. A 100% mortality was recorded among the T2DM patients with sepsis (p-value = 0.012). T2DM in-patients were 3.83 times as likely to die [AOR = 3.83; 95% CI: (1.53-9.61)] if they had nephropathy compared to T2DM in-patients without nephropathy (p-value = 0.004). The full model which included sociodemographic characteristics, family history, lifestyle variables and complications of T2DM had the best prediction of T2DM mortality outcome (ROC = 72.97%). The accuracy for (test and train datasets) were as follows: (90% and 90%), (100% and 100%), (90% and 90%), (90% and 88%) and (88% and 90%) respectively for the various ML classification techniques: logistic regression, Decision tree classifier, kNN classifier, SVM and XGBoost.

Conclusion: This study found that all in-patients with sepsis died. Nephropathy was the identified significant predictor of T2DM mortality. Decision tree classifier provided the best classifying potential.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
BMC Endocrine Disorders
BMC Endocrine Disorders ENDOCRINOLOGY & METABOLISM-
CiteScore
4.40
自引率
0.00%
发文量
280
审稿时长
>12 weeks
期刊介绍: BMC Endocrine Disorders is an open access, peer-reviewed journal that considers articles on all aspects of the prevention, diagnosis and management of endocrine disorders, as well as related molecular genetics, pathophysiology, and epidemiology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信