Evan Yu, Jingcheng Du, Yang Xiang, Xinyue Hu, Jingna Feng, Xi Luo, John A Schneider, Degui Zhi, Kayo Fujimoto, Cui Tao
{"title":"Explainable artificial intelligence and domain adaptation for predicting HIV infection with graph neural networks.","authors":"Evan Yu, Jingcheng Du, Yang Xiang, Xinyue Hu, Jingna Feng, Xi Luo, John A Schneider, Degui Zhi, Kayo Fujimoto, Cui Tao","doi":"10.1080/07853890.2024.2407063","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>Investigation of explainable deep learning methods for graph neural networks to predict HIV infections with social network information and performing domain adaptation to evaluate model transferability across different datasets.</p><p><strong>Methods: </strong>Network data from two cohorts of younger sexual minority men (SMM) from two U.S. cities (Chicago, IL, and Houston, TX) were collected between 2014 and 2016. Feature importance from graph attention network (GAT) models were determined using GNNExplainer. Domain adaptation was performed to examine model transferability from one city dataset to the other dataset, training with 100% of the source dataset with 30% of the target dataset and prediction on the remaining 70% from the target dataset.</p><p><strong>Results: </strong>Domain adaptation showed the ability of GAT to improve prediction over training with single city datasets. Feature importance analysis with GAT models in single city training indicated similar features across different cities, reinforcing potential application of GAT models in predicting HIV infections through domain adaptation.</p><p><strong>Conclusion: </strong>GAT models can be used to address the data sparsity issue in HIV study populations. They are powerful tools for predicting individual risk of HIV that can be further explored for better understanding of HIV transmission.</p>","PeriodicalId":93874,"journal":{"name":"Annals of medicine","volume":"56 1","pages":"2407063"},"PeriodicalIF":0.0000,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11488171/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annals of medicine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/07853890.2024.2407063","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/10/17 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Objective: Investigation of explainable deep learning methods for graph neural networks to predict HIV infections with social network information and performing domain adaptation to evaluate model transferability across different datasets.
Methods: Network data from two cohorts of younger sexual minority men (SMM) from two U.S. cities (Chicago, IL, and Houston, TX) were collected between 2014 and 2016. Feature importance from graph attention network (GAT) models were determined using GNNExplainer. Domain adaptation was performed to examine model transferability from one city dataset to the other dataset, training with 100% of the source dataset with 30% of the target dataset and prediction on the remaining 70% from the target dataset.
Results: Domain adaptation showed the ability of GAT to improve prediction over training with single city datasets. Feature importance analysis with GAT models in single city training indicated similar features across different cities, reinforcing potential application of GAT models in predicting HIV infections through domain adaptation.
Conclusion: GAT models can be used to address the data sparsity issue in HIV study populations. They are powerful tools for predicting individual risk of HIV that can be further explored for better understanding of HIV transmission.
目的研究图神经网络的可解释深度学习方法,利用社交网络信息预测艾滋病感染情况,并进行领域适应性调整,以评估模型在不同数据集之间的可转移性:在 2014 年至 2016 年期间,收集了来自美国两个城市(伊利诺伊州芝加哥市和德克萨斯州休斯敦市)的两组年轻性少数群体男性(SMM)的网络数据。使用 GNNExplainer 确定了图注意网络(GAT)模型的特征重要性。为了检验模型从一个城市数据集到另一个数据集的可移植性,我们进行了领域适应,用 100% 的源数据集和 30% 的目标数据集进行训练,并用目标数据集剩余的 70% 进行预测:结果表明,与使用单一城市数据集进行训练相比,领域适应显示了 GAT 改进预测的能力。在单一城市训练中使用 GAT 模型进行的特征重要性分析表明,不同城市的特征相似,这加强了 GAT 模型通过领域适应在预测艾滋病感染中的潜在应用:结论:GAT 模型可用于解决艾滋病研究人群中的数据稀缺问题。结论:GAT 模型可用于解决 HIV 研究人群中数据稀少的问题,是预测个人感染 HIV 风险的有力工具,可进一步加以探索,以更好地了解 HIV 传播情况。