Mutational Patterns Observed in SARS-CoV-2 Genomes Sampled From Successive Epochs Delimited by Major Public Health Events in Ontario, Canada: Genomic Surveillance Study.

David Chen, Gurjit S Randhawa, Maximillian Pm Soltysiak, Camila Pe de Souza, Lila Kari, Shiva M Singh, Kathleen A Hill
{"title":"Mutational Patterns Observed in SARS-CoV-2 Genomes Sampled From Successive Epochs Delimited by Major Public Health Events in Ontario, Canada: Genomic Surveillance Study.","authors":"David Chen, Gurjit S Randhawa, Maximillian Pm Soltysiak, Camila Pe de Souza, Lila Kari, Shiva M Singh, Kathleen A Hill","doi":"10.2196/42243","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The emergence of SARS-CoV-2 variants with mutations associated with increased transmissibility and virulence is a public health concern in Ontario, Canada. Characterizing how the mutational patterns of the SARS-CoV-2 genome have changed over time can shed light on the driving factors, including selection for increased fitness and host immune response, that may contribute to the emergence of novel variants. Moreover, the study of SARS-CoV-2 in the microcosm of Ontario, Canada can reveal how different province-specific public health policies over time may be associated with observed mutational patterns as a model system.</p><p><strong>Objective: </strong>This study aimed to perform a comprehensive analysis of single base substitution (SBS) types, counts, and genomic locations observed in SARS-CoV-2 genomic sequences sampled in Ontario, Canada. Comparisons of mutational patterns were conducted between sequences sampled during 4 different epochs delimited by major public health events to track the evolution of the SARS-CoV-2 mutational landscape over 2 years.</p><p><strong>Methods: </strong>In total, 24,244 SARS-CoV-2 genomic sequences and associated metadata sampled in Ontario, Canada from January 1, 2020, to December 31, 2021, were retrieved from the Global Initiative on Sharing All Influenza Data database. Sequences were assigned to 4 epochs delimited by major public health events based on the sampling date. SBSs from each SARS-CoV-2 sequence were identified relative to the MN996528.1 reference genome. Catalogues of SBS types and counts were generated to estimate the impact of selection in each open reading frame, and identify mutation clusters. The estimation of mutational fitness over time was performed using the Augur pipeline.</p><p><strong>Results: </strong>The biases in SBS types and proportions observed support previous reports of host antiviral defense activity involving the SARS-CoV-2 genome. There was an increase in U>C substitutions associated with adenosine deaminase acting on RNA (ADAR) activity uniquely observed during Epoch 4. The burden of novel SBSs observed in SARS-CoV-2 genomic sequences was the greatest in Epoch 2 (median 5), followed by Epoch 3 (median 4). Clusters of SBSs were observed in the spike protein open reading frame, ORF1a, and ORF3a. The high proportion of nonsynonymous SBSs and increasing dN/dS metric (ratio of nonsynonymous to synonymous mutations in a given open reading frame) to above 1 in Epoch 4 indicate positive selection of the spike protein open reading frame.</p><p><strong>Conclusions: </strong>Quantitative analysis of the mutational patterns of the SARS-CoV-2 genome in the microcosm of Ontario, Canada within early consecutive epochs of the pandemic tracked the mutational dynamics in the context of public health events that instigate significant shifts in selection and mutagenesis. Continued genomic surveillance of emergent variants will be useful for the design of public health policies in response to the evolving COVID-19 pandemic.</p>","PeriodicalId":73552,"journal":{"name":"JMIR bioinformatics and biotechnology","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11135226/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JMIR bioinformatics and biotechnology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2196/42243","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Background: The emergence of SARS-CoV-2 variants with mutations associated with increased transmissibility and virulence is a public health concern in Ontario, Canada. Characterizing how the mutational patterns of the SARS-CoV-2 genome have changed over time can shed light on the driving factors, including selection for increased fitness and host immune response, that may contribute to the emergence of novel variants. Moreover, the study of SARS-CoV-2 in the microcosm of Ontario, Canada can reveal how different province-specific public health policies over time may be associated with observed mutational patterns as a model system.

Objective: This study aimed to perform a comprehensive analysis of single base substitution (SBS) types, counts, and genomic locations observed in SARS-CoV-2 genomic sequences sampled in Ontario, Canada. Comparisons of mutational patterns were conducted between sequences sampled during 4 different epochs delimited by major public health events to track the evolution of the SARS-CoV-2 mutational landscape over 2 years.

Methods: In total, 24,244 SARS-CoV-2 genomic sequences and associated metadata sampled in Ontario, Canada from January 1, 2020, to December 31, 2021, were retrieved from the Global Initiative on Sharing All Influenza Data database. Sequences were assigned to 4 epochs delimited by major public health events based on the sampling date. SBSs from each SARS-CoV-2 sequence were identified relative to the MN996528.1 reference genome. Catalogues of SBS types and counts were generated to estimate the impact of selection in each open reading frame, and identify mutation clusters. The estimation of mutational fitness over time was performed using the Augur pipeline.

Results: The biases in SBS types and proportions observed support previous reports of host antiviral defense activity involving the SARS-CoV-2 genome. There was an increase in U>C substitutions associated with adenosine deaminase acting on RNA (ADAR) activity uniquely observed during Epoch 4. The burden of novel SBSs observed in SARS-CoV-2 genomic sequences was the greatest in Epoch 2 (median 5), followed by Epoch 3 (median 4). Clusters of SBSs were observed in the spike protein open reading frame, ORF1a, and ORF3a. The high proportion of nonsynonymous SBSs and increasing dN/dS metric (ratio of nonsynonymous to synonymous mutations in a given open reading frame) to above 1 in Epoch 4 indicate positive selection of the spike protein open reading frame.

Conclusions: Quantitative analysis of the mutational patterns of the SARS-CoV-2 genome in the microcosm of Ontario, Canada within early consecutive epochs of the pandemic tracked the mutational dynamics in the context of public health events that instigate significant shifts in selection and mutagenesis. Continued genomic surveillance of emergent variants will be useful for the design of public health policies in response to the evolving COVID-19 pandemic.

在加拿大安大略省重大公共卫生事件界定的连续时期采样的严重急性呼吸系统综合征冠状病毒2型基因组中观察到的突变模式:一项基因组监测研究(预印本)
背景:在加拿大安大略省,SARS-CoV-2 变异株的出现与传播性和毒力增强有关,是一个公共卫生问题。研究 SARS-CoV-2 基因组的变异模式如何随着时间的推移而发生变化,可以揭示可能导致新型变异体出现的驱动因素,包括对提高适应性和宿主免疫反应的选择。此外,在加拿大安大略省这个微观世界对 SARS-CoV-2 进行研究,可以揭示随着时间推移,不同省份的公共卫生政策如何与作为模型系统的观察到的变异模式相关联:本研究旨在全面分析在加拿大安大略省采样的 SARS-CoV-2 基因组序列中观察到的单碱基置换(SBS)类型、数量和基因组位置。在以重大公共卫生事件为分界线的 4 个不同时期采样的序列之间进行了突变模式比较,以追踪两年来 SARS-CoV-2 突变情况的演变:从全球流感数据共享计划数据库中检索了 2020 年 1 月 1 日至 2021 年 12 月 31 日期间在加拿大安大略省采样的 24,244 个 SARS-CoV-2 基因组序列和相关元数据。根据采样日期,序列被分配到以重大公共卫生事件为分界的 4 个时代。根据 MN996528.1 参考基因组鉴定每个 SARS-CoV-2 序列中的 SBS。生成 SBS 类型和数量的目录,以估计每个开放阅读框中选择的影响,并确定突变群。使用 Augur 管道对随时间变化的突变适配性进行了估计:结果:观察到的 SBS 类型和比例偏差支持以前关于 SARS-CoV-2 基因组中宿主抗病毒防御活动的报道。在第 4 个纪元中,与作用于 RNA 的腺苷脱氨酶(ADAR)活性有关的 U>C 替换有所增加。在 SARS-CoV-2 基因组序列中观察到的新型 SBS 的数量在第 2 个纪元最多(中位数为 5),其次是第 3 个纪元(中位数为 4)。在尖峰蛋白开放阅读框、ORF1a 和 ORF3a 中观察到成群的 SBSs。非同义 SBS 的比例很高,dN/dS 指标(特定开放阅读框中非同义突变与同义突变之比)在第四纪元增至 1 以上,这表明尖峰蛋白开放阅读框存在正选择:对加拿大安大略省微观世界中的 SARS-CoV-2 基因组变异模式进行定量分析,发现了在公共卫生事件背景下的变异动态,这些公共卫生事件引发了选择和诱变的重大转变。继续对新出现的变异体进行基因组监测将有助于制定公共卫生政策,以应对不断演变的 COVID-19 大流行。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
2.90
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信