使用机器学习预测成年hiv阳性患者抗逆转录病毒治疗依从性状况,西北,埃塞俄比亚,2025。

IF 3.3 3区 医学 Q2 MEDICAL INFORMATICS
Kelemua Aschale Yeneakal, Gizaw Hailiye Teferi, Temesgen T Mihret, Abraham Keffale Mengistu, Sefefe Birhanu Tizie, Maru Meseret Tadele
{"title":"使用机器学习预测成年hiv阳性患者抗逆转录病毒治疗依从性状况,西北,埃塞俄比亚,2025。","authors":"Kelemua Aschale Yeneakal, Gizaw Hailiye Teferi, Temesgen T Mihret, Abraham Keffale Mengistu, Sefefe Birhanu Tizie, Maru Meseret Tadele","doi":"10.1186/s12911-025-03106-4","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Adherence with Anti-Retroviral Therapy (ART) reduces viral load, as well as HIV-related morbidity and mortality. Despite the expanded availability of ART, non-adherence remains a series problem, leads increased viral load, a decline CD4 cell count, and the development of drug resistance. HIV care is currently showing promise with the use of machine learning algorithms for early prediction of future non-adherence. However, as to researcher's Knowledge, there was limited research supporting this evidence in the country. Therefore, the primary aim of this study was to predict ART adherence status using machine learning models and to identify the most important predictors of Adherence at Debre Markos comprehensive specialized hospital.</p><p><strong>Methods: </strong>Secondary data was collected from ART database of Debre Markos comprehensive specialized hospital, spanning from 2005 to 2024. The dataset was split into training (80%) and testing (20%) sets. To address class imbalance, the Synthetic Minority Oversampling Technique (SMOTE) was applied to the training data. Seven machine learning algorithms: support vector machine, random forest, decision tree, logistic regression, gradient boosting, K-nearest neighbors, and artificial neural network were trained. The model performance was evaluated using ROC-AUC, F1 score, accuracy, precision, and recall. To identify important predictor we employed feature importance technique.</p><p><strong>Result: </strong>Out of 4640 patients, who were on antiretroviral therapy, 63.56% (n = 2949) were females, with mean age of 41.8 years (SD ± 11.50). The majority age group was between 40 and 59 years (n = 2152) 46.38% and 98.1% of patients had good adherence while 1.9% had poor adherence. Among the machine learning models tested, the gradient boosting algorithm performed better than all other algorithms with (Accuracy = 0.78, Sensitivity = 0.76, F1score = 0.78, AUC = 0.76). Age, regimen, WHO clinical stage, nutritional status, address status, sex, weight, recent CD4 cell count, viral load and ART dose per day were identified as the most important predictors for adherence status.</p><p><strong>Conclusion: </strong>The study developed a gradient boosting model for predicting adherence status. Age, regimen, WHO clinical stage, nutritional status, address status, sex, weight, recent CD4 cell count, viral load and ART dose per day were the most important predictors for adherence status.</p><p><strong>Clinical trial number: </strong>Not applicable.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"25 1","pages":"259"},"PeriodicalIF":3.3000,"publicationDate":"2025-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12247312/pdf/","citationCount":"0","resultStr":"{\"title\":\"Predicting antiretroviral therapy adherence status of adult HIV-positive patients using machine-learning Northwest, Ethiopia, 2025.\",\"authors\":\"Kelemua Aschale Yeneakal, Gizaw Hailiye Teferi, Temesgen T Mihret, Abraham Keffale Mengistu, Sefefe Birhanu Tizie, Maru Meseret Tadele\",\"doi\":\"10.1186/s12911-025-03106-4\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Adherence with Anti-Retroviral Therapy (ART) reduces viral load, as well as HIV-related morbidity and mortality. Despite the expanded availability of ART, non-adherence remains a series problem, leads increased viral load, a decline CD4 cell count, and the development of drug resistance. HIV care is currently showing promise with the use of machine learning algorithms for early prediction of future non-adherence. However, as to researcher's Knowledge, there was limited research supporting this evidence in the country. Therefore, the primary aim of this study was to predict ART adherence status using machine learning models and to identify the most important predictors of Adherence at Debre Markos comprehensive specialized hospital.</p><p><strong>Methods: </strong>Secondary data was collected from ART database of Debre Markos comprehensive specialized hospital, spanning from 2005 to 2024. The dataset was split into training (80%) and testing (20%) sets. To address class imbalance, the Synthetic Minority Oversampling Technique (SMOTE) was applied to the training data. Seven machine learning algorithms: support vector machine, random forest, decision tree, logistic regression, gradient boosting, K-nearest neighbors, and artificial neural network were trained. The model performance was evaluated using ROC-AUC, F1 score, accuracy, precision, and recall. To identify important predictor we employed feature importance technique.</p><p><strong>Result: </strong>Out of 4640 patients, who were on antiretroviral therapy, 63.56% (n = 2949) were females, with mean age of 41.8 years (SD ± 11.50). The majority age group was between 40 and 59 years (n = 2152) 46.38% and 98.1% of patients had good adherence while 1.9% had poor adherence. Among the machine learning models tested, the gradient boosting algorithm performed better than all other algorithms with (Accuracy = 0.78, Sensitivity = 0.76, F1score = 0.78, AUC = 0.76). Age, regimen, WHO clinical stage, nutritional status, address status, sex, weight, recent CD4 cell count, viral load and ART dose per day were identified as the most important predictors for adherence status.</p><p><strong>Conclusion: </strong>The study developed a gradient boosting model for predicting adherence status. Age, regimen, WHO clinical stage, nutritional status, address status, sex, weight, recent CD4 cell count, viral load and ART dose per day were the most important predictors for adherence status.</p><p><strong>Clinical trial number: </strong>Not applicable.</p>\",\"PeriodicalId\":9340,\"journal\":{\"name\":\"BMC Medical Informatics and Decision Making\",\"volume\":\"25 1\",\"pages\":\"259\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2025-07-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12247312/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Medical Informatics and Decision Making\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s12911-025-03106-4\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MEDICAL INFORMATICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Informatics and Decision Making","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12911-025-03106-4","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0

摘要

背景:坚持抗逆转录病毒治疗(ART)可以降低病毒载量,以及hiv相关的发病率和死亡率。尽管抗逆转录病毒治疗的可获得性有所扩大,但不依从性仍然是一系列问题,导致病毒载量增加、CD4细胞计数下降和耐药性的产生。艾滋病毒护理目前正在使用机器学习算法来早期预测未来的不依从性。然而,就研究人员的知识而言,国内支持这一证据的研究有限。因此,本研究的主要目的是使用机器学习模型预测ART依从性状态,并确定Debre Markos综合专科医院依从性的最重要预测因素。方法:从德布雷马科斯综合专科医院2005 - 2024年ART数据库收集二次资料。数据集分为训练集(80%)和测试集(20%)。为了解决类不平衡问题,对训练数据采用了合成少数派过采样技术(SMOTE)。训练了支持向量机、随机森林、决策树、逻辑回归、梯度增强、k近邻、人工神经网络等7种机器学习算法。使用ROC-AUC、F1评分、准确率、精密度和召回率来评估模型的性能。为了识别重要的预测因子,我们采用了特征重要性技术。结果:4640例接受抗逆转录病毒治疗的患者中,女性占63.56% (n = 2949),平均年龄41.8岁(SD±11.50)。以40 ~ 59岁为主要年龄组(n = 2152), 46.38%和98.1%的患者依从性好,1.9%的患者依从性差。在测试的机器学习模型中,梯度增强算法的表现优于所有其他算法(Accuracy = 0.78, Sensitivity = 0.76, F1score = 0.78, AUC = 0.76)。年龄、治疗方案、世卫组织临床分期、营养状况、住址状况、性别、体重、最近的CD4细胞计数、病毒载量和每天抗逆转录病毒治疗剂量被确定为依从性状况的最重要预测因素。结论:本研究建立了一种预测依循状态的梯度增强模型。年龄、治疗方案、世卫组织临床分期、营养状况、住址状况、性别、体重、最近的CD4细胞计数、病毒载量和每天抗逆转录病毒治疗剂量是依从性状况的最重要预测因素。临床试验号:不适用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Predicting antiretroviral therapy adherence status of adult HIV-positive patients using machine-learning Northwest, Ethiopia, 2025.

Background: Adherence with Anti-Retroviral Therapy (ART) reduces viral load, as well as HIV-related morbidity and mortality. Despite the expanded availability of ART, non-adherence remains a series problem, leads increased viral load, a decline CD4 cell count, and the development of drug resistance. HIV care is currently showing promise with the use of machine learning algorithms for early prediction of future non-adherence. However, as to researcher's Knowledge, there was limited research supporting this evidence in the country. Therefore, the primary aim of this study was to predict ART adherence status using machine learning models and to identify the most important predictors of Adherence at Debre Markos comprehensive specialized hospital.

Methods: Secondary data was collected from ART database of Debre Markos comprehensive specialized hospital, spanning from 2005 to 2024. The dataset was split into training (80%) and testing (20%) sets. To address class imbalance, the Synthetic Minority Oversampling Technique (SMOTE) was applied to the training data. Seven machine learning algorithms: support vector machine, random forest, decision tree, logistic regression, gradient boosting, K-nearest neighbors, and artificial neural network were trained. The model performance was evaluated using ROC-AUC, F1 score, accuracy, precision, and recall. To identify important predictor we employed feature importance technique.

Result: Out of 4640 patients, who were on antiretroviral therapy, 63.56% (n = 2949) were females, with mean age of 41.8 years (SD ± 11.50). The majority age group was between 40 and 59 years (n = 2152) 46.38% and 98.1% of patients had good adherence while 1.9% had poor adherence. Among the machine learning models tested, the gradient boosting algorithm performed better than all other algorithms with (Accuracy = 0.78, Sensitivity = 0.76, F1score = 0.78, AUC = 0.76). Age, regimen, WHO clinical stage, nutritional status, address status, sex, weight, recent CD4 cell count, viral load and ART dose per day were identified as the most important predictors for adherence status.

Conclusion: The study developed a gradient boosting model for predicting adherence status. Age, regimen, WHO clinical stage, nutritional status, address status, sex, weight, recent CD4 cell count, viral load and ART dose per day were the most important predictors for adherence status.

Clinical trial number: Not applicable.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
7.20
自引率
5.70%
发文量
297
审稿时长
1 months
期刊介绍: BMC Medical Informatics and Decision Making is an open access journal publishing original peer-reviewed research articles in relation to the design, development, implementation, use, and evaluation of health information technologies and decision-making for human health.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信