Billal M. Obeng , Roger D. Kouyos , Katharina Kusejko , Luisa Salazar-Vizcaya , Huldrych F. Günthard , Anthony D. Kelleher , Francesca Di Giallonardo , the Swiss HIV Cohort Study
{"title":"不同基因组区域和亚型HIV-1传播聚类检测的阈值敏感性分析","authors":"Billal M. Obeng , Roger D. Kouyos , Katharina Kusejko , Luisa Salazar-Vizcaya , Huldrych F. Günthard , Anthony D. Kelleher , Francesca Di Giallonardo , the Swiss HIV Cohort Study","doi":"10.1016/j.virol.2025.110558","DOIUrl":null,"url":null,"abstract":"<div><div>HIV-1 cluster analysis has been widely used in characterizing HIV-1 transmission and some countries have implemented such molecular epidemiology as part of their prevention strategy. However, HIV-1 sequences derive from varying genome regions, which affects phylogenetic clustering outputs. Here, we apply different tools to run a sensitivity analysis for assessing which threshold give the most cohesive clustering outputs for different data sources. We used a dataset of 174 full-length sequences of subtype B from the Swiss HIV Cohort Study and publicly available subtype C from South Africa. Each dataset was divided into sub-genomic sub-datasets covering <em>gag</em>, <em>pol</em>, and <em>en</em>v. <em>pol</em> was further subdivided into regions commonly used in HIV-1 genotyping laboratories (<em>pr-rt</em>, <em>rt-int</em>, and <em>pr-rt-int)</em>. Cluster analyses for each sub-genomic region was performed specifying varying distance thresholds of 0.5 %–4.5 % and tree branch support of 70 %, 90 % and 99 % in ClusterPicker. Tree topologies and clustering outputs were compared against each other to assess cluster similarity. Pylogenies using <em>pol</em>, <em>pr-rt-int</em>, or <em>rt-int</em> had more robust tree topologies compared to <em>gag</em> and <em>env</em>. Cluster composition changed with increasing genetic distance threshold but was not affected by branch support. Cluster identity was most similar around genetic distances of 2.5 (±0.5)% for all sub-genomic regions and for both subtype B and C. Our study demonstrated the value of performing a sensitivity analysis before setting a genetic distance threshold for clustering output and that the <em>pol</em> region is appropriate for clustering outputs and can be used for near real-time HIV-1 cluster detection.</div></div>","PeriodicalId":23666,"journal":{"name":"Virology","volume":"608 ","pages":"Article 110558"},"PeriodicalIF":2.8000,"publicationDate":"2025-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Threshold sensitivity analysis for HIV-1 transmission cluster detection using different genomic regions and subtypes\",\"authors\":\"Billal M. Obeng , Roger D. Kouyos , Katharina Kusejko , Luisa Salazar-Vizcaya , Huldrych F. Günthard , Anthony D. Kelleher , Francesca Di Giallonardo , the Swiss HIV Cohort Study\",\"doi\":\"10.1016/j.virol.2025.110558\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>HIV-1 cluster analysis has been widely used in characterizing HIV-1 transmission and some countries have implemented such molecular epidemiology as part of their prevention strategy. However, HIV-1 sequences derive from varying genome regions, which affects phylogenetic clustering outputs. Here, we apply different tools to run a sensitivity analysis for assessing which threshold give the most cohesive clustering outputs for different data sources. We used a dataset of 174 full-length sequences of subtype B from the Swiss HIV Cohort Study and publicly available subtype C from South Africa. Each dataset was divided into sub-genomic sub-datasets covering <em>gag</em>, <em>pol</em>, and <em>en</em>v. <em>pol</em> was further subdivided into regions commonly used in HIV-1 genotyping laboratories (<em>pr-rt</em>, <em>rt-int</em>, and <em>pr-rt-int)</em>. Cluster analyses for each sub-genomic region was performed specifying varying distance thresholds of 0.5 %–4.5 % and tree branch support of 70 %, 90 % and 99 % in ClusterPicker. Tree topologies and clustering outputs were compared against each other to assess cluster similarity. Pylogenies using <em>pol</em>, <em>pr-rt-int</em>, or <em>rt-int</em> had more robust tree topologies compared to <em>gag</em> and <em>env</em>. Cluster composition changed with increasing genetic distance threshold but was not affected by branch support. Cluster identity was most similar around genetic distances of 2.5 (±0.5)% for all sub-genomic regions and for both subtype B and C. Our study demonstrated the value of performing a sensitivity analysis before setting a genetic distance threshold for clustering output and that the <em>pol</em> region is appropriate for clustering outputs and can be used for near real-time HIV-1 cluster detection.</div></div>\",\"PeriodicalId\":23666,\"journal\":{\"name\":\"Virology\",\"volume\":\"608 \",\"pages\":\"Article 110558\"},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2025-04-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Virology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0042682225001710\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"VIROLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Virology","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0042682225001710","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"VIROLOGY","Score":null,"Total":0}
Threshold sensitivity analysis for HIV-1 transmission cluster detection using different genomic regions and subtypes
HIV-1 cluster analysis has been widely used in characterizing HIV-1 transmission and some countries have implemented such molecular epidemiology as part of their prevention strategy. However, HIV-1 sequences derive from varying genome regions, which affects phylogenetic clustering outputs. Here, we apply different tools to run a sensitivity analysis for assessing which threshold give the most cohesive clustering outputs for different data sources. We used a dataset of 174 full-length sequences of subtype B from the Swiss HIV Cohort Study and publicly available subtype C from South Africa. Each dataset was divided into sub-genomic sub-datasets covering gag, pol, and env. pol was further subdivided into regions commonly used in HIV-1 genotyping laboratories (pr-rt, rt-int, and pr-rt-int). Cluster analyses for each sub-genomic region was performed specifying varying distance thresholds of 0.5 %–4.5 % and tree branch support of 70 %, 90 % and 99 % in ClusterPicker. Tree topologies and clustering outputs were compared against each other to assess cluster similarity. Pylogenies using pol, pr-rt-int, or rt-int had more robust tree topologies compared to gag and env. Cluster composition changed with increasing genetic distance threshold but was not affected by branch support. Cluster identity was most similar around genetic distances of 2.5 (±0.5)% for all sub-genomic regions and for both subtype B and C. Our study demonstrated the value of performing a sensitivity analysis before setting a genetic distance threshold for clustering output and that the pol region is appropriate for clustering outputs and can be used for near real-time HIV-1 cluster detection.
期刊介绍:
Launched in 1955, Virology is a broad and inclusive journal that welcomes submissions on all aspects of virology including plant, animal, microbial and human viruses. The journal publishes basic research as well as pre-clinical and clinical studies of vaccines, anti-viral drugs and their development, anti-viral therapies, and computational studies of virus infections. Any submission that is of broad interest to the community of virologists/vaccinologists and reporting scientifically accurate and valuable research will be considered for publication, including negative findings and multidisciplinary work.Virology is open to reviews, research manuscripts, short communication, registered reports as well as follow-up manuscripts.