用于跟踪COVID-19流行病学的系统基因组学:非洲大陆的基因组数据缺口

A. Adebowale, Precious K. Letebele
{"title":"用于跟踪COVID-19流行病学的系统基因组学:非洲大陆的基因组数据缺口","authors":"A. Adebowale, Precious K. Letebele","doi":"10.1109/IMITEC50163.2020.9334111","DOIUrl":null,"url":null,"abstract":"The COVID-19 pandemic has disrupted health systems the world over, resulting in the loss of many lives and destabilising economies. Phylogenomic tracking of the pandemic represents one of the ways to monitor its spread in real-time. However, effective phylogenomic monitoring is dependent on the generation and analysis of rich genomic datasets. In this study, we performed phylogenetic analysis on SARS-CoV-2 genome data for the African continent to illustrate the spread of the pandemic. Africa's contribution to the SARS-CoV-2 genome data stands at just under 2% of the global total, with only seven countries currently represented on the NCBI virus database, and 16 countries on the GISAID database, as of 10 August 2020. A large portion of the data (79%) in NCBI is from Egypt, while sequence data from South Africa (48%) dominates the GISAID collection. Although there exist a massive data gap in terms of geographic coverage and scale across both databases, the inferred phylogeny is consistent with Egypt having the first reported case of COVID-19 on the continent, with multiple independent infections in other parts of Africa. However, we identify significant incongruences in the timing of sampling and placement of sequence on the inferred phylogeny. We surmise that the source of incongruence is a probable discrepancy between sample collection and sequence generation, leading to phylogenetic placements that violate basic rule of molecular evolutionary progression. Consequently, we propose the rapid processing of samples destined for sequencing as soon as they are collected, as Africa gradually increases its SARS-CoV-2 genomic footprint. We also advocate for the release of SARS-CoV-2 genomic sequences to the public domain to facilitate quality research around the virus.","PeriodicalId":349926,"journal":{"name":"2020 2nd International Multidisciplinary Information Technology and Engineering Conference (IMITEC)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Phylogenomics for Tracking the Epidemiology of COVID-19: The Genomic Data Gap for the African Continent\",\"authors\":\"A. Adebowale, Precious K. Letebele\",\"doi\":\"10.1109/IMITEC50163.2020.9334111\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The COVID-19 pandemic has disrupted health systems the world over, resulting in the loss of many lives and destabilising economies. Phylogenomic tracking of the pandemic represents one of the ways to monitor its spread in real-time. However, effective phylogenomic monitoring is dependent on the generation and analysis of rich genomic datasets. In this study, we performed phylogenetic analysis on SARS-CoV-2 genome data for the African continent to illustrate the spread of the pandemic. Africa's contribution to the SARS-CoV-2 genome data stands at just under 2% of the global total, with only seven countries currently represented on the NCBI virus database, and 16 countries on the GISAID database, as of 10 August 2020. A large portion of the data (79%) in NCBI is from Egypt, while sequence data from South Africa (48%) dominates the GISAID collection. Although there exist a massive data gap in terms of geographic coverage and scale across both databases, the inferred phylogeny is consistent with Egypt having the first reported case of COVID-19 on the continent, with multiple independent infections in other parts of Africa. However, we identify significant incongruences in the timing of sampling and placement of sequence on the inferred phylogeny. We surmise that the source of incongruence is a probable discrepancy between sample collection and sequence generation, leading to phylogenetic placements that violate basic rule of molecular evolutionary progression. Consequently, we propose the rapid processing of samples destined for sequencing as soon as they are collected, as Africa gradually increases its SARS-CoV-2 genomic footprint. We also advocate for the release of SARS-CoV-2 genomic sequences to the public domain to facilitate quality research around the virus.\",\"PeriodicalId\":349926,\"journal\":{\"name\":\"2020 2nd International Multidisciplinary Information Technology and Engineering Conference (IMITEC)\",\"volume\":\"11 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 2nd International Multidisciplinary Information Technology and Engineering Conference (IMITEC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IMITEC50163.2020.9334111\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 2nd International Multidisciplinary Information Technology and Engineering Conference (IMITEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IMITEC50163.2020.9334111","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

2019冠状病毒病大流行扰乱了世界各地的卫生系统,导致许多人丧生,并破坏了经济的稳定。大流行的系统基因组追踪是实时监测其传播的方法之一。然而,有效的系统基因组监测依赖于丰富的基因组数据集的生成和分析。在这项研究中,我们对非洲大陆的SARS-CoV-2基因组数据进行了系统发育分析,以说明大流行的传播。截至2020年8月10日,非洲对SARS-CoV-2基因组数据的贡献略低于全球总量的2%,目前NCBI病毒数据库中只有7个国家,GISAID数据库中只有16个国家。NCBI的大部分数据(79%)来自埃及,而GISAID收集的序列数据主要来自南非(48%)。尽管这两个数据库在地理覆盖和规模方面存在巨大的数据缺口,但推断出的系统发育与埃及是非洲大陆第一例报告的COVID-19病例一致,非洲其他地区也有多例独立感染。然而,我们发现在采样的时间和序列的放置在推断的系统发育显著不一致。我们推测,不一致的来源可能是样本收集和序列生成之间的差异,导致违反分子进化进程基本规则的系统发育定位。因此,随着非洲逐渐增加其SARS-CoV-2基因组足迹,我们建议在收集到样本后立即对其进行快速处理。我们还主张向公众发布新冠病毒基因组序列,以促进围绕该病毒的高质量研究。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Phylogenomics for Tracking the Epidemiology of COVID-19: The Genomic Data Gap for the African Continent
The COVID-19 pandemic has disrupted health systems the world over, resulting in the loss of many lives and destabilising economies. Phylogenomic tracking of the pandemic represents one of the ways to monitor its spread in real-time. However, effective phylogenomic monitoring is dependent on the generation and analysis of rich genomic datasets. In this study, we performed phylogenetic analysis on SARS-CoV-2 genome data for the African continent to illustrate the spread of the pandemic. Africa's contribution to the SARS-CoV-2 genome data stands at just under 2% of the global total, with only seven countries currently represented on the NCBI virus database, and 16 countries on the GISAID database, as of 10 August 2020. A large portion of the data (79%) in NCBI is from Egypt, while sequence data from South Africa (48%) dominates the GISAID collection. Although there exist a massive data gap in terms of geographic coverage and scale across both databases, the inferred phylogeny is consistent with Egypt having the first reported case of COVID-19 on the continent, with multiple independent infections in other parts of Africa. However, we identify significant incongruences in the timing of sampling and placement of sequence on the inferred phylogeny. We surmise that the source of incongruence is a probable discrepancy between sample collection and sequence generation, leading to phylogenetic placements that violate basic rule of molecular evolutionary progression. Consequently, we propose the rapid processing of samples destined for sequencing as soon as they are collected, as Africa gradually increases its SARS-CoV-2 genomic footprint. We also advocate for the release of SARS-CoV-2 genomic sequences to the public domain to facilitate quality research around the virus.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信