Optimization of genetic distance threshold for inferring the CRF01_AE molecular network based on next-generation sequencing

Lijuan Hu, Bin Zhao, Mingchen Liu, Yang Gao, Haibo Ding, Qinghai Hu, Ming-hui An, Hong Shang, Xiaoxu Han
{"title":"Optimization of genetic distance threshold for inferring the CRF01_AE molecular network based on next-generation sequencing","authors":"Lijuan Hu, Bin Zhao, Mingchen Liu, Yang Gao, Haibo Ding, Qinghai Hu, Ming-hui An, Hong Shang, Xiaoxu Han","doi":"10.3389/fcimb.2024.1388059","DOIUrl":null,"url":null,"abstract":"HIV molecular network based on genetic distance (GD) has been extensively utilized. However, the GD threshold for the non-B subtype differs from that of subtype B. This study aimed to optimize the GD threshold for inferring the CRF01_AE molecular network.Next-generation sequencing data of partial CRF01_AE pol sequences were obtained for 59 samples from 12 transmission pairs enrolled from a high-risk cohort during 2009 and 2014. The paired GD was calculated using the Tamura-Nei 93 model to infer a GD threshold range for HIV molecular networks.2,019 CRF01_AE pol sequences and information on recent HIV infection (RHI) from newly diagnosed individuals in Shenyang from 2016 to 2019 were collected to construct molecular networks to assess the ability of the inferred GD thresholds to predict recent transmission events. When HIV transmission occurs within a span of 1-4 years, the mean paired GD between the sequences of the donor and recipient within the same transmission pair were as follow: 0.008, 0.011, 0.013, and 0.023 substitutions/site. Using these four GD thresholds, it was found that 98.9%, 96.0%, 88.2%, and 40.4% of all randomly paired GD values from 12 transmission pairs were correctly identified as originating from the same transmission pairs. In the real world, as the GD threshold increased from 0.001 to 0.02 substitutions/site, the proportion of RHI within the molecular network gradually increased from 16.6% to 92.3%. Meanwhile, the proportion of links with RHI gradually decreased from 87.0% to 48.2%. The two curves intersected at a GD of 0.008 substitutions/site.A suitable range of GD thresholds, 0.008-0.013 substitutions/site, was identified to infer the CRF01_AE molecular transmission network and identify HIV transmission events that occurred within the past three years. This finding provides valuable data for selecting an appropriate GD thresholds in constructing molecular networks for non-B subtypes.","PeriodicalId":505894,"journal":{"name":"Frontiers in Cellular and Infection Microbiology","volume":"54 20","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Cellular and Infection Microbiology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fcimb.2024.1388059","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

HIV molecular network based on genetic distance (GD) has been extensively utilized. However, the GD threshold for the non-B subtype differs from that of subtype B. This study aimed to optimize the GD threshold for inferring the CRF01_AE molecular network.Next-generation sequencing data of partial CRF01_AE pol sequences were obtained for 59 samples from 12 transmission pairs enrolled from a high-risk cohort during 2009 and 2014. The paired GD was calculated using the Tamura-Nei 93 model to infer a GD threshold range for HIV molecular networks.2,019 CRF01_AE pol sequences and information on recent HIV infection (RHI) from newly diagnosed individuals in Shenyang from 2016 to 2019 were collected to construct molecular networks to assess the ability of the inferred GD thresholds to predict recent transmission events. When HIV transmission occurs within a span of 1-4 years, the mean paired GD between the sequences of the donor and recipient within the same transmission pair were as follow: 0.008, 0.011, 0.013, and 0.023 substitutions/site. Using these four GD thresholds, it was found that 98.9%, 96.0%, 88.2%, and 40.4% of all randomly paired GD values from 12 transmission pairs were correctly identified as originating from the same transmission pairs. In the real world, as the GD threshold increased from 0.001 to 0.02 substitutions/site, the proportion of RHI within the molecular network gradually increased from 16.6% to 92.3%. Meanwhile, the proportion of links with RHI gradually decreased from 87.0% to 48.2%. The two curves intersected at a GD of 0.008 substitutions/site.A suitable range of GD thresholds, 0.008-0.013 substitutions/site, was identified to infer the CRF01_AE molecular transmission network and identify HIV transmission events that occurred within the past three years. This finding provides valuable data for selecting an appropriate GD thresholds in constructing molecular networks for non-B subtypes.
优化基于新一代测序推断 CRF01_AE 分子网络的遗传距离阈值
基于遗传距离(GD)的HIV分子网络已被广泛应用。本研究旨在优化推断 CRF01_AE 分子网络的 GD 阈值。研究人员从 2009 年至 2014 年期间从一个高危队列中登记的 12 对传播者的 59 个样本中获得了 CRF01_AE pol 部分序列的下一代测序数据。2,019 份 CRF01_AE pol 序列和 2016 年至 2019 年沈阳新诊断个体的近期 HIV 感染(RHI)信息被收集起来构建分子网络,以评估推断出的 GD 阈值预测近期传播事件的能力。当HIV传播发生在1-4年内时,同一传播对中供体和受体序列之间的平均配对GD如下:0.008、0.011、0.013 和 0.023 个取代/位点。使用这四个 GD 阈值后发现,在 12 个传播配对中随机配对的所有 GD 值中,分别有 98.9%、96.0%、88.2% 和 40.4%被正确识别为来自同一传播配对。在现实世界中,随着 GD 阈值从 0.001 提高到 0.02 个取代/位点,分子网络中 RHI 的比例从 16.6% 逐步提高到 92.3%。同时,含有 RHI 的链接比例从 87.0% 逐渐下降到 48.2%。两条曲线在 0.008 个取代/位点的 GD 值处相交。我们确定了一个合适的 GD 阈值范围(0.008-0.013 个取代/位点),以推断 CRF01_AE 分子传播网络并识别过去三年内发生的 HIV 传播事件。这一发现为构建非B亚型的分子网络时选择合适的GD阈值提供了宝贵的数据。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信