Kelemua Aschale Yeneakal, Gizaw Hailiye Teferi, Temesgen T Mihret, Abraham Keffale Mengistu, Sefefe Birhanu Tizie, Maru Meseret Tadele
{"title":"使用机器学习预测成年hiv阳性患者抗逆转录病毒治疗依从性状况,西北,埃塞俄比亚,2025。","authors":"Kelemua Aschale Yeneakal, Gizaw Hailiye Teferi, Temesgen T Mihret, Abraham Keffale Mengistu, Sefefe Birhanu Tizie, Maru Meseret Tadele","doi":"10.1186/s12911-025-03106-4","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Adherence with Anti-Retroviral Therapy (ART) reduces viral load, as well as HIV-related morbidity and mortality. Despite the expanded availability of ART, non-adherence remains a series problem, leads increased viral load, a decline CD4 cell count, and the development of drug resistance. HIV care is currently showing promise with the use of machine learning algorithms for early prediction of future non-adherence. However, as to researcher's Knowledge, there was limited research supporting this evidence in the country. Therefore, the primary aim of this study was to predict ART adherence status using machine learning models and to identify the most important predictors of Adherence at Debre Markos comprehensive specialized hospital.</p><p><strong>Methods: </strong>Secondary data was collected from ART database of Debre Markos comprehensive specialized hospital, spanning from 2005 to 2024. The dataset was split into training (80%) and testing (20%) sets. To address class imbalance, the Synthetic Minority Oversampling Technique (SMOTE) was applied to the training data. Seven machine learning algorithms: support vector machine, random forest, decision tree, logistic regression, gradient boosting, K-nearest neighbors, and artificial neural network were trained. The model performance was evaluated using ROC-AUC, F1 score, accuracy, precision, and recall. To identify important predictor we employed feature importance technique.</p><p><strong>Result: </strong>Out of 4640 patients, who were on antiretroviral therapy, 63.56% (n = 2949) were females, with mean age of 41.8 years (SD ± 11.50). The majority age group was between 40 and 59 years (n = 2152) 46.38% and 98.1% of patients had good adherence while 1.9% had poor adherence. Among the machine learning models tested, the gradient boosting algorithm performed better than all other algorithms with (Accuracy = 0.78, Sensitivity = 0.76, F1score = 0.78, AUC = 0.76). Age, regimen, WHO clinical stage, nutritional status, address status, sex, weight, recent CD4 cell count, viral load and ART dose per day were identified as the most important predictors for adherence status.</p><p><strong>Conclusion: </strong>The study developed a gradient boosting model for predicting adherence status. Age, regimen, WHO clinical stage, nutritional status, address status, sex, weight, recent CD4 cell count, viral load and ART dose per day were the most important predictors for adherence status.</p><p><strong>Clinical trial number: </strong>Not applicable.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"25 1","pages":"259"},"PeriodicalIF":3.3000,"publicationDate":"2025-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12247312/pdf/","citationCount":"0","resultStr":"{\"title\":\"Predicting antiretroviral therapy adherence status of adult HIV-positive patients using machine-learning Northwest, Ethiopia, 2025.\",\"authors\":\"Kelemua Aschale Yeneakal, Gizaw Hailiye Teferi, Temesgen T Mihret, Abraham Keffale Mengistu, Sefefe Birhanu Tizie, Maru Meseret Tadele\",\"doi\":\"10.1186/s12911-025-03106-4\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Adherence with Anti-Retroviral Therapy (ART) reduces viral load, as well as HIV-related morbidity and mortality. Despite the expanded availability of ART, non-adherence remains a series problem, leads increased viral load, a decline CD4 cell count, and the development of drug resistance. HIV care is currently showing promise with the use of machine learning algorithms for early prediction of future non-adherence. However, as to researcher's Knowledge, there was limited research supporting this evidence in the country. Therefore, the primary aim of this study was to predict ART adherence status using machine learning models and to identify the most important predictors of Adherence at Debre Markos comprehensive specialized hospital.</p><p><strong>Methods: </strong>Secondary data was collected from ART database of Debre Markos comprehensive specialized hospital, spanning from 2005 to 2024. The dataset was split into training (80%) and testing (20%) sets. To address class imbalance, the Synthetic Minority Oversampling Technique (SMOTE) was applied to the training data. Seven machine learning algorithms: support vector machine, random forest, decision tree, logistic regression, gradient boosting, K-nearest neighbors, and artificial neural network were trained. The model performance was evaluated using ROC-AUC, F1 score, accuracy, precision, and recall. To identify important predictor we employed feature importance technique.</p><p><strong>Result: </strong>Out of 4640 patients, who were on antiretroviral therapy, 63.56% (n = 2949) were females, with mean age of 41.8 years (SD ± 11.50). The majority age group was between 40 and 59 years (n = 2152) 46.38% and 98.1% of patients had good adherence while 1.9% had poor adherence. Among the machine learning models tested, the gradient boosting algorithm performed better than all other algorithms with (Accuracy = 0.78, Sensitivity = 0.76, F1score = 0.78, AUC = 0.76). Age, regimen, WHO clinical stage, nutritional status, address status, sex, weight, recent CD4 cell count, viral load and ART dose per day were identified as the most important predictors for adherence status.</p><p><strong>Conclusion: </strong>The study developed a gradient boosting model for predicting adherence status. Age, regimen, WHO clinical stage, nutritional status, address status, sex, weight, recent CD4 cell count, viral load and ART dose per day were the most important predictors for adherence status.</p><p><strong>Clinical trial number: </strong>Not applicable.</p>\",\"PeriodicalId\":9340,\"journal\":{\"name\":\"BMC Medical Informatics and Decision Making\",\"volume\":\"25 1\",\"pages\":\"259\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2025-07-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12247312/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Medical Informatics and Decision Making\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s12911-025-03106-4\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MEDICAL INFORMATICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Informatics and Decision Making","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12911-025-03106-4","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
Predicting antiretroviral therapy adherence status of adult HIV-positive patients using machine-learning Northwest, Ethiopia, 2025.
Background: Adherence with Anti-Retroviral Therapy (ART) reduces viral load, as well as HIV-related morbidity and mortality. Despite the expanded availability of ART, non-adherence remains a series problem, leads increased viral load, a decline CD4 cell count, and the development of drug resistance. HIV care is currently showing promise with the use of machine learning algorithms for early prediction of future non-adherence. However, as to researcher's Knowledge, there was limited research supporting this evidence in the country. Therefore, the primary aim of this study was to predict ART adherence status using machine learning models and to identify the most important predictors of Adherence at Debre Markos comprehensive specialized hospital.
Methods: Secondary data was collected from ART database of Debre Markos comprehensive specialized hospital, spanning from 2005 to 2024. The dataset was split into training (80%) and testing (20%) sets. To address class imbalance, the Synthetic Minority Oversampling Technique (SMOTE) was applied to the training data. Seven machine learning algorithms: support vector machine, random forest, decision tree, logistic regression, gradient boosting, K-nearest neighbors, and artificial neural network were trained. The model performance was evaluated using ROC-AUC, F1 score, accuracy, precision, and recall. To identify important predictor we employed feature importance technique.
Result: Out of 4640 patients, who were on antiretroviral therapy, 63.56% (n = 2949) were females, with mean age of 41.8 years (SD ± 11.50). The majority age group was between 40 and 59 years (n = 2152) 46.38% and 98.1% of patients had good adherence while 1.9% had poor adherence. Among the machine learning models tested, the gradient boosting algorithm performed better than all other algorithms with (Accuracy = 0.78, Sensitivity = 0.76, F1score = 0.78, AUC = 0.76). Age, regimen, WHO clinical stage, nutritional status, address status, sex, weight, recent CD4 cell count, viral load and ART dose per day were identified as the most important predictors for adherence status.
Conclusion: The study developed a gradient boosting model for predicting adherence status. Age, regimen, WHO clinical stage, nutritional status, address status, sex, weight, recent CD4 cell count, viral load and ART dose per day were the most important predictors for adherence status.
期刊介绍:
BMC Medical Informatics and Decision Making is an open access journal publishing original peer-reviewed research articles in relation to the design, development, implementation, use, and evaluation of health information technologies and decision-making for human health.