Threshold sensitivity analysis for HIV-1 transmission cluster detection using different genomic regions and subtypes

IF 2.8 3区 医学 Q3 VIROLOGY
Billal M. Obeng , Roger D. Kouyos , Katharina Kusejko , Luisa Salazar-Vizcaya , Huldrych F. Günthard , Anthony D. Kelleher , Francesca Di Giallonardo , the Swiss HIV Cohort Study
{"title":"Threshold sensitivity analysis for HIV-1 transmission cluster detection using different genomic regions and subtypes","authors":"Billal M. Obeng ,&nbsp;Roger D. Kouyos ,&nbsp;Katharina Kusejko ,&nbsp;Luisa Salazar-Vizcaya ,&nbsp;Huldrych F. Günthard ,&nbsp;Anthony D. Kelleher ,&nbsp;Francesca Di Giallonardo ,&nbsp;the Swiss HIV Cohort Study","doi":"10.1016/j.virol.2025.110558","DOIUrl":null,"url":null,"abstract":"<div><div>HIV-1 cluster analysis has been widely used in characterizing HIV-1 transmission and some countries have implemented such molecular epidemiology as part of their prevention strategy. However, HIV-1 sequences derive from varying genome regions, which affects phylogenetic clustering outputs. Here, we apply different tools to run a sensitivity analysis for assessing which threshold give the most cohesive clustering outputs for different data sources. We used a dataset of 174 full-length sequences of subtype B from the Swiss HIV Cohort Study and publicly available subtype C from South Africa. Each dataset was divided into sub-genomic sub-datasets covering <em>gag</em>, <em>pol</em>, and <em>en</em>v. <em>pol</em> was further subdivided into regions commonly used in HIV-1 genotyping laboratories (<em>pr-rt</em>, <em>rt-int</em>, and <em>pr-rt-int)</em>. Cluster analyses for each sub-genomic region was performed specifying varying distance thresholds of 0.5 %–4.5 % and tree branch support of 70 %, 90 % and 99 % in ClusterPicker. Tree topologies and clustering outputs were compared against each other to assess cluster similarity. Pylogenies using <em>pol</em>, <em>pr-rt-int</em>, or <em>rt-int</em> had more robust tree topologies compared to <em>gag</em> and <em>env</em>. Cluster composition changed with increasing genetic distance threshold but was not affected by branch support. Cluster identity was most similar around genetic distances of 2.5 (±0.5)% for all sub-genomic regions and for both subtype B and C. Our study demonstrated the value of performing a sensitivity analysis before setting a genetic distance threshold for clustering output and that the <em>pol</em> region is appropriate for clustering outputs and can be used for near real-time HIV-1 cluster detection.</div></div>","PeriodicalId":23666,"journal":{"name":"Virology","volume":"608 ","pages":"Article 110558"},"PeriodicalIF":2.8000,"publicationDate":"2025-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Virology","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0042682225001710","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"VIROLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

HIV-1 cluster analysis has been widely used in characterizing HIV-1 transmission and some countries have implemented such molecular epidemiology as part of their prevention strategy. However, HIV-1 sequences derive from varying genome regions, which affects phylogenetic clustering outputs. Here, we apply different tools to run a sensitivity analysis for assessing which threshold give the most cohesive clustering outputs for different data sources. We used a dataset of 174 full-length sequences of subtype B from the Swiss HIV Cohort Study and publicly available subtype C from South Africa. Each dataset was divided into sub-genomic sub-datasets covering gag, pol, and env. pol was further subdivided into regions commonly used in HIV-1 genotyping laboratories (pr-rt, rt-int, and pr-rt-int). Cluster analyses for each sub-genomic region was performed specifying varying distance thresholds of 0.5 %–4.5 % and tree branch support of 70 %, 90 % and 99 % in ClusterPicker. Tree topologies and clustering outputs were compared against each other to assess cluster similarity. Pylogenies using pol, pr-rt-int, or rt-int had more robust tree topologies compared to gag and env. Cluster composition changed with increasing genetic distance threshold but was not affected by branch support. Cluster identity was most similar around genetic distances of 2.5 (±0.5)% for all sub-genomic regions and for both subtype B and C. Our study demonstrated the value of performing a sensitivity analysis before setting a genetic distance threshold for clustering output and that the pol region is appropriate for clustering outputs and can be used for near real-time HIV-1 cluster detection.
不同基因组区域和亚型HIV-1传播聚类检测的阈值敏感性分析
HIV-1聚类分析已被广泛用于表征HIV-1传播,一些国家已将这种分子流行病学作为其预防战略的一部分。然而,HIV-1序列来自不同的基因组区域,这影响了系统发育聚类的输出。在这里,我们应用不同的工具来运行灵敏度分析,以评估哪个阈值为不同的数据源提供最具凝聚力的聚类输出。我们使用了来自瑞士HIV队列研究的174个B亚型全长序列和来自南非公开的C亚型序列的数据集。每个数据集被分成亚基因组子数据集,包括gag、pol和env。pol进一步细分为HIV-1基因分型实验室常用的区域(pr-rt、rt-int和pr-rt-int)。对每个亚基因组区域进行聚类分析,在ClusterPicker中指定不同的距离阈值为0.5% - 4.5%,树枝支持度为70%,90%和99%。树拓扑和聚类输出相互比较,以评估聚类相似性。与gag和env相比,使用pol、pr-rt-int或rt-int的Pylogenies具有更健壮的树拓扑结构。聚类组成随遗传距离阈值的增加而变化,但不受枝支持的影响。所有亚基因组区域以及B和c亚型的聚类身份在遗传距离为2.5(±0.5)%时最为相似。我们的研究证明了在为聚类输出设置遗传距离阈值之前进行敏感性分析的价值,并且pol区域适合聚类输出,可用于近实时的HIV-1聚类检测。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Virology
Virology 医学-病毒学
CiteScore
6.00
自引率
0.00%
发文量
157
审稿时长
50 days
期刊介绍: Launched in 1955, Virology is a broad and inclusive journal that welcomes submissions on all aspects of virology including plant, animal, microbial and human viruses. The journal publishes basic research as well as pre-clinical and clinical studies of vaccines, anti-viral drugs and their development, anti-viral therapies, and computational studies of virus infections. Any submission that is of broad interest to the community of virologists/vaccinologists and reporting scientifically accurate and valuable research will be considered for publication, including negative findings and multidisciplinary work.Virology is open to reviews, research manuscripts, short communication, registered reports as well as follow-up manuscripts.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信