{"title":"Using machine learning techniques to predict viral suppression among people with HIV.","authors":"Xueying Yang, Ruilie Cai, Yunqing Ma, Hao H Zhang, XiaoWen Sun, Bankole Olatosi, Sharon Weissman, Xiaoming Li, Jiajia Zhang","doi":"10.1097/QAI.0000000000003561","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>This study aims to develop and examine the performance of machine learning (ML) algorithms in predicting viral suppression among statewide people living with HIV (PWH) in South Carolina (SC).</p><p><strong>Methods: </strong>Extracted through the electronic reporting system in SC, the study population was adult PWH who were diagnosed between 2005-2021. Viral suppression was defined as viral load <200 copies/ml. The predictors, includingsocio-demographics, a historical information of viral load indicators (e.g., viral rebound), comorbidities, healthcare utilization, and annual county-level factors (e.g., social vulnerability) were measured in each 4-month windows. Using historic information in different lag time windows (1-, 3- or 5-lagged time windows with each 4-month as a unit), both traditional and ML approaches (e.g., Long Short-Term Memory network [LSTM]) were applied to predict viral suppression. Comparisons of prediction performance between different models were assessed by area under curve (AUC), recall, precision, F1 score, and Youden index.</p><p><strong>Results: </strong>Machine learning approaches outperformed the generalized linear mixed model. In all the three lagged analysis of a total of 15,580 PWH, the LSTM (lag 1: AUC=0.858; lag 3: AUC=0.877; lag 5: AUC=0.881) algorithm outperformed all the other methods in terms of AUC performance for predicting viral suppression. The top-ranking predictors that were common in different models included historical information of viral suppression, viral rebound, and viral blips in the Lag-1 time window. Inclusion of county level variables did not improve the model prediction accuracy.</p><p><strong>Conclusion: </strong>Supervised machine learning algorithms may offer better performance for risk prediction of viral suppression than traditional statistical methods.</p>","PeriodicalId":14588,"journal":{"name":"JAIDS Journal of Acquired Immune Deficiency Syndromes","volume":" ","pages":""},"PeriodicalIF":2.9000,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JAIDS Journal of Acquired Immune Deficiency Syndromes","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1097/QAI.0000000000003561","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"IMMUNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: This study aims to develop and examine the performance of machine learning (ML) algorithms in predicting viral suppression among statewide people living with HIV (PWH) in South Carolina (SC).
Methods: Extracted through the electronic reporting system in SC, the study population was adult PWH who were diagnosed between 2005-2021. Viral suppression was defined as viral load <200 copies/ml. The predictors, includingsocio-demographics, a historical information of viral load indicators (e.g., viral rebound), comorbidities, healthcare utilization, and annual county-level factors (e.g., social vulnerability) were measured in each 4-month windows. Using historic information in different lag time windows (1-, 3- or 5-lagged time windows with each 4-month as a unit), both traditional and ML approaches (e.g., Long Short-Term Memory network [LSTM]) were applied to predict viral suppression. Comparisons of prediction performance between different models were assessed by area under curve (AUC), recall, precision, F1 score, and Youden index.
Results: Machine learning approaches outperformed the generalized linear mixed model. In all the three lagged analysis of a total of 15,580 PWH, the LSTM (lag 1: AUC=0.858; lag 3: AUC=0.877; lag 5: AUC=0.881) algorithm outperformed all the other methods in terms of AUC performance for predicting viral suppression. The top-ranking predictors that were common in different models included historical information of viral suppression, viral rebound, and viral blips in the Lag-1 time window. Inclusion of county level variables did not improve the model prediction accuracy.
Conclusion: Supervised machine learning algorithms may offer better performance for risk prediction of viral suppression than traditional statistical methods.
期刊介绍:
JAIDS: Journal of Acquired Immune Deficiency Syndromes seeks to end the HIV epidemic by presenting important new science across all disciplines that advance our understanding of the biology, treatment and prevention of HIV infection worldwide.
JAIDS: Journal of Acquired Immune Deficiency Syndromes is the trusted, interdisciplinary resource for HIV- and AIDS-related information with a strong focus on basic and translational science, clinical science, and epidemiology and prevention. Co-edited by the foremost leaders in clinical virology, molecular biology, and epidemiology, JAIDS publishes vital information on the advances in diagnosis and treatment of HIV infections, as well as the latest research in the development of therapeutics and vaccine approaches. This ground-breaking journal brings together rigorously peer-reviewed articles, reviews of current research, results of clinical trials, and epidemiologic reports from around the world.