对SARS-CoV-2基因组帧移的误检:需要进一步协调和有效监测数据工作流程

IF 5.4
Rok Kogoj, Mauro Petrillo, Samo Zakotnik, Alen Suljič, Miša Korva, Gabriele Leoni
{"title":"对SARS-CoV-2基因组帧移的误检:需要进一步协调和有效监测数据工作流程","authors":"Rok Kogoj, Mauro Petrillo, Samo Zakotnik, Alen Suljič, Miša Korva, Gabriele Leoni","doi":"10.1093/bioinformatics/btaf516","DOIUrl":null,"url":null,"abstract":"<p><p>Five years after the outbreak of the SARS-CoV-2 pandemic in 2020, diagnostic laboratories have moved from massive sequencing of thousands of samples to routine surveillance of SARS-CoV-2 cases as with all other respiratory viruses. Surveillance remains of paramount importance to prevent a further SARS-CoV-2 surge, as the virus has been shown to mutate rapidly and can render available drugs and vaccines ineffective. During the pandemic, several bioinformatics pipelines and workflows have been developed to streamline analysis, shorten turn-around time and ensure reproducibility. As the number of samples decreases, laboratories are moving towards more flexible sequencing strategies and optimising the cost per sample. However, workflow redesigns, even if individual steps have proven successful time and time again, can lead to challenges when changes in a bioinformatics pipeline are introduced (e.g., version updates, implementation of new features, etc.), a new combination of viral mutations emerge or, a change in wet-lab procedures lead to unpredictable results. Here we present a report of misidentified frameshift mutations in the consensus sequence of SARS-CoV-2, which led to an incorrect assumption of mutations in the spike and nucleocapsid viral proteins with the potential to affect PCR detection or even antigen testing. This investigation exemplifies the need for better awareness of the challenges that can occur even when using routinely applied protocols and analytical workflows and highlights the need for cooperation between experts of NGS, bioinformaticians and decision-makers towards more harmonised data workflows.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":5.4000,"publicationDate":"2025-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Misdetection of frameshifts in SARS-CoV-2 genomes: need for additional harmonisation and efficient monitoring of data workflows.\",\"authors\":\"Rok Kogoj, Mauro Petrillo, Samo Zakotnik, Alen Suljič, Miša Korva, Gabriele Leoni\",\"doi\":\"10.1093/bioinformatics/btaf516\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Five years after the outbreak of the SARS-CoV-2 pandemic in 2020, diagnostic laboratories have moved from massive sequencing of thousands of samples to routine surveillance of SARS-CoV-2 cases as with all other respiratory viruses. Surveillance remains of paramount importance to prevent a further SARS-CoV-2 surge, as the virus has been shown to mutate rapidly and can render available drugs and vaccines ineffective. During the pandemic, several bioinformatics pipelines and workflows have been developed to streamline analysis, shorten turn-around time and ensure reproducibility. As the number of samples decreases, laboratories are moving towards more flexible sequencing strategies and optimising the cost per sample. However, workflow redesigns, even if individual steps have proven successful time and time again, can lead to challenges when changes in a bioinformatics pipeline are introduced (e.g., version updates, implementation of new features, etc.), a new combination of viral mutations emerge or, a change in wet-lab procedures lead to unpredictable results. Here we present a report of misidentified frameshift mutations in the consensus sequence of SARS-CoV-2, which led to an incorrect assumption of mutations in the spike and nucleocapsid viral proteins with the potential to affect PCR detection or even antigen testing. This investigation exemplifies the need for better awareness of the challenges that can occur even when using routinely applied protocols and analytical workflows and highlights the need for cooperation between experts of NGS, bioinformaticians and decision-makers towards more harmonised data workflows.</p>\",\"PeriodicalId\":93899,\"journal\":{\"name\":\"Bioinformatics (Oxford, England)\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":5.4000,\"publicationDate\":\"2025-09-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Bioinformatics (Oxford, England)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1093/bioinformatics/btaf516\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics (Oxford, England)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioinformatics/btaf516","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

在2020年SARS-CoV-2大流行爆发五年后,诊断实验室已经从对数千个样本进行大规模测序转向像对所有其他呼吸道病毒一样对SARS-CoV-2病例进行常规监测。监测对于防止SARS-CoV-2进一步激增仍然至关重要,因为该病毒已被证明会迅速变异,并可能使现有的药物和疫苗失效。在大流行期间,已制定了若干生物信息学管道和工作流程,以简化分析、缩短周转时间并确保可重复性。随着样品数量的减少,实验室正朝着更灵活的测序策略和优化每个样品的成本迈进。然而,工作流程的重新设计,即使个别步骤一次又一次被证明是成功的,当引入生物信息学管道中的变化时(例如,版本更新,新功能的实现等),病毒突变的新组合出现,或者湿实验室程序的变化导致不可预测的结果时,可能会导致挑战。在这里,我们报告了SARS-CoV-2共识序列中被错误识别的移码突变,这导致了对刺突和核衣壳病毒蛋白突变的错误假设,这些突变可能影响PCR检测甚至抗原检测。这项调查表明,需要更好地认识到即使在使用常规应用的协议和分析工作流程时也可能出现的挑战,并突出了NGS专家、生物信息学家和决策者之间合作的必要性,以实现更协调的数据工作流程。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Misdetection of frameshifts in SARS-CoV-2 genomes: need for additional harmonisation and efficient monitoring of data workflows.

Five years after the outbreak of the SARS-CoV-2 pandemic in 2020, diagnostic laboratories have moved from massive sequencing of thousands of samples to routine surveillance of SARS-CoV-2 cases as with all other respiratory viruses. Surveillance remains of paramount importance to prevent a further SARS-CoV-2 surge, as the virus has been shown to mutate rapidly and can render available drugs and vaccines ineffective. During the pandemic, several bioinformatics pipelines and workflows have been developed to streamline analysis, shorten turn-around time and ensure reproducibility. As the number of samples decreases, laboratories are moving towards more flexible sequencing strategies and optimising the cost per sample. However, workflow redesigns, even if individual steps have proven successful time and time again, can lead to challenges when changes in a bioinformatics pipeline are introduced (e.g., version updates, implementation of new features, etc.), a new combination of viral mutations emerge or, a change in wet-lab procedures lead to unpredictable results. Here we present a report of misidentified frameshift mutations in the consensus sequence of SARS-CoV-2, which led to an incorrect assumption of mutations in the spike and nucleocapsid viral proteins with the potential to affect PCR detection or even antigen testing. This investigation exemplifies the need for better awareness of the challenges that can occur even when using routinely applied protocols and analytical workflows and highlights the need for cooperation between experts of NGS, bioinformaticians and decision-makers towards more harmonised data workflows.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信