RNA barcode segments for SARS-CoV-2 identification from HCoVs and SARSr-CoV-2 lineages

IF 5.5 3区 医学 Q1 Medicine
Changqiao You , Shuai Jiang , Yunyun Ding , Shunxing Ye , Xiaoxiao Zou , Hongming Zhang , Zeqi Li , Fenglin Chen , Yongliang Li , Xingyi Ge , Xinhong Guo
{"title":"RNA barcode segments for SARS-CoV-2 identification from HCoVs and SARSr-CoV-2 lineages","authors":"Changqiao You ,&nbsp;Shuai Jiang ,&nbsp;Yunyun Ding ,&nbsp;Shunxing Ye ,&nbsp;Xiaoxiao Zou ,&nbsp;Hongming Zhang ,&nbsp;Zeqi Li ,&nbsp;Fenglin Chen ,&nbsp;Yongliang Li ,&nbsp;Xingyi Ge ,&nbsp;Xinhong Guo","doi":"10.1016/j.virs.2024.01.006","DOIUrl":null,"url":null,"abstract":"<div><p>Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the pathogen responsible for coronavirus disease 2019 (COVID-19), continues to evolve, giving rise to more variants and global reinfections. Previous research has demonstrated that barcode segments can effectively and cost-efficiently identify specific species within closely related populations. In this study, we designed and tested RNA barcode segments based on genetic evolutionary relationships to facilitate the efficient and accurate identification of SARS-CoV-2 from extensive virus samples, including human coronaviruses (HCoVs) and SARSr-CoV-2 lineages. Nucleotide sequences sourced from NCBI and GISAID were meticulously selected and curated to construct training sets, encompassing 1733 complete genome sequences of HCoVs and SARSr-CoV-2 lineages. Through genetic-level species testing, we validated the accuracy and reliability of the barcode segments for identifying SARS-CoV-2. Subsequently, 75 main and subordinate species-specific barcode segments for SARS-CoV-2, located in <em>ORF1ab</em>, <em>S</em>, <em>E</em>, <em>ORF7a</em>, and <em>N</em> coding sequences, were intercepted and screened based on single-nucleotide polymorphism sites and weighted scores. Post-testing, these segments exhibited high recall rates (nearly 100%), specificity (almost 30% at the nucleotide level), and precision (100%) performance on identification. They were eventually visualized using one and two-dimensional combined barcodes and deposited in an online database (<span>http://virusbarcodedatabase.top/</span><svg><path></path></svg>). The successful integration of barcoding technology in SARS-CoV-2 identification provides valuable insights for future studies involving complete genome sequence polymorphism analysis. Moreover, this cost-effective and efficient identification approach also provides valuable reference for future research endeavors related to virus surveillance.</p></div>","PeriodicalId":23654,"journal":{"name":"Virologica Sinica","volume":null,"pages":null},"PeriodicalIF":5.5000,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S1995820X24000063/pdfft?md5=d17814d2aee23bb4d075d608e8b938aa&pid=1-s2.0-S1995820X24000063-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Virologica Sinica","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1995820X24000063","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0

Abstract

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the pathogen responsible for coronavirus disease 2019 (COVID-19), continues to evolve, giving rise to more variants and global reinfections. Previous research has demonstrated that barcode segments can effectively and cost-efficiently identify specific species within closely related populations. In this study, we designed and tested RNA barcode segments based on genetic evolutionary relationships to facilitate the efficient and accurate identification of SARS-CoV-2 from extensive virus samples, including human coronaviruses (HCoVs) and SARSr-CoV-2 lineages. Nucleotide sequences sourced from NCBI and GISAID were meticulously selected and curated to construct training sets, encompassing 1733 complete genome sequences of HCoVs and SARSr-CoV-2 lineages. Through genetic-level species testing, we validated the accuracy and reliability of the barcode segments for identifying SARS-CoV-2. Subsequently, 75 main and subordinate species-specific barcode segments for SARS-CoV-2, located in ORF1ab, S, E, ORF7a, and N coding sequences, were intercepted and screened based on single-nucleotide polymorphism sites and weighted scores. Post-testing, these segments exhibited high recall rates (nearly 100%), specificity (almost 30% at the nucleotide level), and precision (100%) performance on identification. They were eventually visualized using one and two-dimensional combined barcodes and deposited in an online database (http://virusbarcodedatabase.top/). The successful integration of barcoding technology in SARS-CoV-2 identification provides valuable insights for future studies involving complete genome sequence polymorphism analysis. Moreover, this cost-effective and efficient identification approach also provides valuable reference for future research endeavors related to virus surveillance.

基于完整基因组序列遗传测试的可公开获取的 RNA 条形码片段,用于从 HCoV 和 SARSr-CoV-2 系中识别 SARS-CoV-2
严重急性呼吸系统综合征冠状病毒 2(SARS-CoV-2)是导致 2019 年冠状病毒病(COVID-19)的病原体,它不断进化,产生了更多的变种和全球再感染。以往的研究表明,条形码片段可以有效而经济地识别密切相关种群中的特定物种。在本研究中,我们设计并测试了基于遗传进化关系的 RNA 条形码片段,以便从广泛的病毒样本(包括人类冠状病毒 (HCoV) 和 SARSr-CoV-2 系)中高效、准确地识别 SARS-CoV-2 。我们对来自 NCBI 和 GISAID 的核苷酸序列进行了精心挑选和整理,以构建训练集,其中包括 1,733 个 HCoV 和 SARSr-CoV-2 世系的完整基因组序列。通过基因水平的物种测试,我们验证了条形码片段识别 SARS-CoV-2 的准确性和可靠性。随后,我们根据单核苷酸多态性位点和加权分数,截取并筛选了 75 个 SARS-CoV-2 的主要和次要物种特异性条形码片段,它们分别位于 ORF1ab、S、E、ORF7a 和 N 编码序列中。经过测试,这些片段在识别方面表现出较高的召回率(接近 100%)、特异性(核苷酸水平接近 30%)和精确度(100%)。这些片段最终通过一维和二维组合条形码实现可视化,并存入在线数据库 (http://virusbarcodedatabase.top/)。条形码技术在 SARS-CoV-2 鉴定中的成功应用为今后涉及全基因组序列多态性分析的研究提供了宝贵的启示。此外,这种经济高效的鉴定方法也为今后的病毒监测研究工作提供了宝贵的参考。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Virologica Sinica
Virologica Sinica Biochemistry, Genetics and Molecular Biology-Molecular Medicine
CiteScore
7.70
自引率
1.80%
发文量
3149
期刊介绍: Virologica Sinica is an international journal which aims at presenting the cutting-edge research on viruses all over the world. The journal publishes peer-reviewed original research articles, reviews, and letters to the editor, to encompass the latest developments in all branches of virology, including research on animal, plant and microbe viruses. The journal welcomes articles on virus discovery and characterization, viral epidemiology, viral pathogenesis, virus-host interaction, vaccine development, antiviral agents and therapies, and virus related bio-techniques. Virologica Sinica, the official journal of Chinese Society for Microbiology, will serve as a platform for the communication and exchange of academic information and ideas in an international context. Electronic ISSN: 1995-820X; Print ISSN: 1674-0769
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信