从同一子午线国家分离 SARS-CoV-2 株系:基因组进化分析。

JMIR bioinformatics and biotechnology Pub Date : 2021-01-22 eCollection Date: 2021-01-01 DOI:10.2196/25995
Emilio Mastriani, Alexey V Rakov, Shu-Lin Liu
{"title":"从同一子午线国家分离 SARS-CoV-2 株系:基因组进化分析。","authors":"Emilio Mastriani, Alexey V Rakov, Shu-Lin Liu","doi":"10.2196/25995","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>COVID-19, caused by the novel SARS-CoV-2, is considered the most threatening respiratory infection in the world, with over 40 million people infected and over 0.934 million related deaths reported worldwide. It is speculated that epidemiological and clinical features of COVID-19 may differ across countries or continents. Genomic comparison of 48,635 SARS-CoV-2 genomes has shown that the average number of mutations per sample was 7.23, and most SARS-CoV-2 strains belong to one of 3 clades characterized by geographic and genomic specificity: Europe, Asia, and North America.</p><p><strong>Objective: </strong>The aim of this study was to compare the genomes of SARS-CoV-2 strains isolated from Italy, Sweden, and Congo, that is, 3 different countries in the same meridian (longitude) but with different climate conditions, and from Brazil (as an outgroup country), to analyze similarities or differences in patterns of possible evolutionary pressure signatures in their genomes.</p><p><strong>Methods: </strong>We obtained data from the Global Initiative on Sharing All Influenza Data repository by sampling all genomes available on that date. Using HyPhy, we achieved the recombination analysis by genetic algorithm recombination detection method, trimming, removal of the stop codons, and phylogenetic tree and mixed effects model of evolution analyses. We also performed secondary structure prediction analysis for both sequences (mutated and wild-type) and \"disorder\" and \"transmembrane\" analyses of the protein. We analyzed both protein structures with an ab initio approach to predict their ontologies and 3D structures.</p><p><strong>Results: </strong>Evolutionary analysis revealed that codon 9628 is under episodic selective pressure for all SARS-CoV-2 strains isolated from the 4 countries, suggesting it is a key site for virus evolution. Codon 9628 encodes the P0DTD3 (Y14_SARS2) uncharacterized protein 14. Further investigation showed that the codon mutation was responsible for helical modification in the secondary structure. The codon was positioned in the more ordered region of the gene (41-59) and near to the area acting as the transmembrane (54-67), suggesting its involvement in the attachment phase of the virus. The predicted protein structures of both wild-type and mutated P0DTD3 confirmed the importance of the codon to define the protein structure. Moreover, ontological analysis of the protein emphasized that the mutation enhances the binding probability.</p><p><strong>Conclusions: </strong>Our results suggest that RNA secondary structure may be affected and, consequently, the protein product changes T (threonine) to G (glycine) in position 50 of the protein. This position is located close to the predicted transmembrane region. Mutation analysis revealed that the change from G (glycine) to D (aspartic acid) may confer a new function to the protein-binding activity, which in turn may be responsible for attaching the virus to human eukaryotic cells. These findings can help design in vitro experiments and possibly facilitate a vaccine design and successful antiviral strategies.</p>","PeriodicalId":73552,"journal":{"name":"JMIR bioinformatics and biotechnology","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2021-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7837406/pdf/","citationCount":"0","resultStr":"{\"title\":\"Isolating SARS-CoV-2 Strains From Countries in the Same Meridian: Genome Evolutionary Analysis.\",\"authors\":\"Emilio Mastriani, Alexey V Rakov, Shu-Lin Liu\",\"doi\":\"10.2196/25995\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>COVID-19, caused by the novel SARS-CoV-2, is considered the most threatening respiratory infection in the world, with over 40 million people infected and over 0.934 million related deaths reported worldwide. It is speculated that epidemiological and clinical features of COVID-19 may differ across countries or continents. Genomic comparison of 48,635 SARS-CoV-2 genomes has shown that the average number of mutations per sample was 7.23, and most SARS-CoV-2 strains belong to one of 3 clades characterized by geographic and genomic specificity: Europe, Asia, and North America.</p><p><strong>Objective: </strong>The aim of this study was to compare the genomes of SARS-CoV-2 strains isolated from Italy, Sweden, and Congo, that is, 3 different countries in the same meridian (longitude) but with different climate conditions, and from Brazil (as an outgroup country), to analyze similarities or differences in patterns of possible evolutionary pressure signatures in their genomes.</p><p><strong>Methods: </strong>We obtained data from the Global Initiative on Sharing All Influenza Data repository by sampling all genomes available on that date. Using HyPhy, we achieved the recombination analysis by genetic algorithm recombination detection method, trimming, removal of the stop codons, and phylogenetic tree and mixed effects model of evolution analyses. We also performed secondary structure prediction analysis for both sequences (mutated and wild-type) and \\\"disorder\\\" and \\\"transmembrane\\\" analyses of the protein. We analyzed both protein structures with an ab initio approach to predict their ontologies and 3D structures.</p><p><strong>Results: </strong>Evolutionary analysis revealed that codon 9628 is under episodic selective pressure for all SARS-CoV-2 strains isolated from the 4 countries, suggesting it is a key site for virus evolution. Codon 9628 encodes the P0DTD3 (Y14_SARS2) uncharacterized protein 14. Further investigation showed that the codon mutation was responsible for helical modification in the secondary structure. The codon was positioned in the more ordered region of the gene (41-59) and near to the area acting as the transmembrane (54-67), suggesting its involvement in the attachment phase of the virus. The predicted protein structures of both wild-type and mutated P0DTD3 confirmed the importance of the codon to define the protein structure. Moreover, ontological analysis of the protein emphasized that the mutation enhances the binding probability.</p><p><strong>Conclusions: </strong>Our results suggest that RNA secondary structure may be affected and, consequently, the protein product changes T (threonine) to G (glycine) in position 50 of the protein. This position is located close to the predicted transmembrane region. Mutation analysis revealed that the change from G (glycine) to D (aspartic acid) may confer a new function to the protein-binding activity, which in turn may be responsible for attaching the virus to human eukaryotic cells. These findings can help design in vitro experiments and possibly facilitate a vaccine design and successful antiviral strategies.</p>\",\"PeriodicalId\":73552,\"journal\":{\"name\":\"JMIR bioinformatics and biotechnology\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-01-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7837406/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"JMIR bioinformatics and biotechnology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2196/25995\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2021/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"JMIR bioinformatics and biotechnology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2196/25995","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2021/1/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

背景:由新型 SARS-CoV-2 引起的 COVID-19 被认为是世界上最具威胁性的呼吸道传染病,全球有超过 4000 万人感染,超过 93.4 万人因此死亡。据推测,COVID-19 的流行病学和临床特征在不同国家或大陆可能有所不同。对 48,635 个 SARS-CoV-2 基因组进行的基因组比较显示,每个样本的平均变异数为 7.23 个,大多数 SARS-CoV-2 株系属于 3 个具有地理和基因组特异性的支系之一:大多数 SARS-CoV-2 株系属于 3 个具有地理和基因组特异性的支系之一:欧洲、亚洲和北美洲:本研究的目的是比较从意大利、瑞典和刚果(即位于同一子午线(经度)但气候条件不同的 3 个不同国家)分离的 SARS-CoV-2 株系以及从巴西(作为外群国家)分离的 SARS-CoV-2 株系的基因组,分析其基因组中可能存在的进化压力特征模式的异同:我们从全球流感数据共享计划(Global Initiative on Sharing All Influenza Data Repository)中获取数据,对当日可用的所有基因组进行采样。使用 HyPhy,我们通过遗传算法重组检测方法、修剪、移除终止密码子、系统发生树和进化混合效应模型分析实现了重组分析。我们还对两个序列(突变型和野生型)进行了二级结构预测分析,并对蛋白质进行了 "紊乱 "和 "跨膜 "分析。我们用ab initio方法分析了这两种蛋白质的结构,以预测它们的本体和三维结构:进化分析表明,在这 4 个国家分离出的所有 SARS-CoV-2 株系中,密码子 9628 都受到偶发性选择压力,这表明它是病毒进化的一个关键位点。密码子 9628 编码 P0DTD3(Y14_SARS2)未定性蛋白 14。进一步研究表明,该密码子突变导致二级结构发生螺旋状改变。该密码子位于基因较有序的区域(41-59),靠近作为跨膜的区域(54-67),表明它参与了病毒的附着阶段。野生型和突变型 P0DTD3 的预测蛋白质结构证实了密码子对确定蛋白质结构的重要性。此外,对蛋白质的本体分析强调,突变增强了结合概率:我们的研究结果表明,RNA 二级结构可能受到了影响,因此蛋白质产物中第 50 位的 T(苏氨酸)变为了 G(甘氨酸)。该位置靠近预测的跨膜区。突变分析表明,从 G(甘氨酸)到 D(天冬氨酸)的变化可能赋予蛋白质结合活性一种新的功能,而这又可能是病毒附着在人类真核细胞上的原因。这些发现有助于设计体外实验,并有可能促进疫苗设计和成功的抗病毒策略。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Isolating SARS-CoV-2 Strains From Countries in the Same Meridian: Genome Evolutionary Analysis.

Background: COVID-19, caused by the novel SARS-CoV-2, is considered the most threatening respiratory infection in the world, with over 40 million people infected and over 0.934 million related deaths reported worldwide. It is speculated that epidemiological and clinical features of COVID-19 may differ across countries or continents. Genomic comparison of 48,635 SARS-CoV-2 genomes has shown that the average number of mutations per sample was 7.23, and most SARS-CoV-2 strains belong to one of 3 clades characterized by geographic and genomic specificity: Europe, Asia, and North America.

Objective: The aim of this study was to compare the genomes of SARS-CoV-2 strains isolated from Italy, Sweden, and Congo, that is, 3 different countries in the same meridian (longitude) but with different climate conditions, and from Brazil (as an outgroup country), to analyze similarities or differences in patterns of possible evolutionary pressure signatures in their genomes.

Methods: We obtained data from the Global Initiative on Sharing All Influenza Data repository by sampling all genomes available on that date. Using HyPhy, we achieved the recombination analysis by genetic algorithm recombination detection method, trimming, removal of the stop codons, and phylogenetic tree and mixed effects model of evolution analyses. We also performed secondary structure prediction analysis for both sequences (mutated and wild-type) and "disorder" and "transmembrane" analyses of the protein. We analyzed both protein structures with an ab initio approach to predict their ontologies and 3D structures.

Results: Evolutionary analysis revealed that codon 9628 is under episodic selective pressure for all SARS-CoV-2 strains isolated from the 4 countries, suggesting it is a key site for virus evolution. Codon 9628 encodes the P0DTD3 (Y14_SARS2) uncharacterized protein 14. Further investigation showed that the codon mutation was responsible for helical modification in the secondary structure. The codon was positioned in the more ordered region of the gene (41-59) and near to the area acting as the transmembrane (54-67), suggesting its involvement in the attachment phase of the virus. The predicted protein structures of both wild-type and mutated P0DTD3 confirmed the importance of the codon to define the protein structure. Moreover, ontological analysis of the protein emphasized that the mutation enhances the binding probability.

Conclusions: Our results suggest that RNA secondary structure may be affected and, consequently, the protein product changes T (threonine) to G (glycine) in position 50 of the protein. This position is located close to the predicted transmembrane region. Mutation analysis revealed that the change from G (glycine) to D (aspartic acid) may confer a new function to the protein-binding activity, which in turn may be responsible for attaching the virus to human eukaryotic cells. These findings can help design in vitro experiments and possibly facilitate a vaccine design and successful antiviral strategies.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
2.90
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信