Phylogeogenomic analysis of the earliest reported sequences of SARS-CoV-2 from 161 countries

IF 2.2 4区 医学 Q4 IMMUNOLOGY
Apmis Pub Date : 2024-11-19 DOI:10.1111/apm.13499
Rezwanuzzaman Laskar, Mehboob Hoque, Safdar Ali
{"title":"Phylogeogenomic analysis of the earliest reported sequences of SARS-CoV-2 from 161 countries","authors":"Rezwanuzzaman Laskar,&nbsp;Mehboob Hoque,&nbsp;Safdar Ali","doi":"10.1111/apm.13499","DOIUrl":null,"url":null,"abstract":"<p>The SARS-CoV-2 is the causative agent of COVID-19 whose evolutionary path with geographical context forms the focus of present study. The first reported sequence from each of the 161 countries was downloaded from the GISAID database. Multiple sequence alignment was performed using MAFFT v.7, and a TCS-based network was constructed using PopART v.1.7. A total of 27 proteins were analyzed including structural and non-structural proteins. NSP3 and NSP12, responsible for viral replication and RNA synthesis, respectively, had the highest mutation incidence and frequency among non-structural proteins. The spike (S) protein, critical for viral attachment and entry, had the highest prevalence and frequency of mutations. ORF3a had the highest mutation incidence and frequency among accessory proteins. The phylogeogenomic network identified six haplogroups containing 35 sequences, while the remaining sequences belonged to different haplotypes. The virus's genetic distinctiveness was higher in European genomes, with four haplogroups dominated by Europe-linked sequences. The triangular-shaped pattern observed in the virus's evolutionary path suggests that it spread to different continents from Asia. Multiple transmission pathways connecting different countries affirm the virus's ability to emerge in multiple countries by early 2020. The possibility of new species emergence through “saltation” due to the pandemic is also discussed.</p>","PeriodicalId":8167,"journal":{"name":"Apmis","volume":"133 1","pages":""},"PeriodicalIF":2.2000,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Apmis","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/apm.13499","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"IMMUNOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

The SARS-CoV-2 is the causative agent of COVID-19 whose evolutionary path with geographical context forms the focus of present study. The first reported sequence from each of the 161 countries was downloaded from the GISAID database. Multiple sequence alignment was performed using MAFFT v.7, and a TCS-based network was constructed using PopART v.1.7. A total of 27 proteins were analyzed including structural and non-structural proteins. NSP3 and NSP12, responsible for viral replication and RNA synthesis, respectively, had the highest mutation incidence and frequency among non-structural proteins. The spike (S) protein, critical for viral attachment and entry, had the highest prevalence and frequency of mutations. ORF3a had the highest mutation incidence and frequency among accessory proteins. The phylogeogenomic network identified six haplogroups containing 35 sequences, while the remaining sequences belonged to different haplotypes. The virus's genetic distinctiveness was higher in European genomes, with four haplogroups dominated by Europe-linked sequences. The triangular-shaped pattern observed in the virus's evolutionary path suggests that it spread to different continents from Asia. Multiple transmission pathways connecting different countries affirm the virus's ability to emerge in multiple countries by early 2020. The possibility of new species emergence through “saltation” due to the pandemic is also discussed.

对 161 个国家最早报告的 SARS-CoV-2 序列进行系统发生学分析。
SARS-CoV-2 是 COVID-19 的病原体,其进化路径与地理背景构成了本研究的重点。从 GISAID 数据库中下载了 161 个国家中每个国家首次报告的序列。使用 MAFFT v.7 进行了多重序列比对,并使用 PopART v.1.7 构建了基于 TCS 的网络。共分析了 27 种蛋白质,包括结构蛋白和非结构蛋白。在非结构蛋白中,分别负责病毒复制和 RNA 合成的 NSP3 和 NSP12 的突变发生率和频率最高。对病毒附着和进入至关重要的尖峰(S)蛋白的突变发生率和频率最高。在附属蛋白中,ORF3a的突变发生率和频率最高。系统发生组学网络确定了包含 35 个序列的 6 个单倍群,其余序列属于不同的单倍型。该病毒在欧洲基因组中的遗传独特性较高,有四个单倍群以与欧洲相关的序列为主。在病毒进化路径中观察到的三角形模式表明,它从亚洲传播到了不同的大陆。连接不同国家的多种传播途径肯定了病毒到 2020 年初在多个国家出现的能力。此外,还讨论了由于大流行而通过 "盐化 "出现新物种的可能性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Apmis
Apmis 医学-病理学
CiteScore
5.20
自引率
0.00%
发文量
91
审稿时长
2 months
期刊介绍: APMIS, formerly Acta Pathologica, Microbiologica et Immunologica Scandinavica, has been published since 1924 by the Scandinavian Societies for Medical Microbiology and Pathology as a non-profit-making scientific journal.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信