通过将基于 qPCR 的简单直接方案调整与成熟的机器学习算法相结合,加强 SARS-CoV-2 世系监测。

IF 6.7 1区 化学 Q1 CHEMISTRY, ANALYTICAL
Analytical Chemistry Pub Date : 2024-11-19 Epub Date: 2024-11-04 DOI:10.1021/acs.analchem.4c04492
Cleber Furtado Aksenen, Debora Maria Almeida Ferreira, Pedro Miguel Carneiro Jeronimo, Thais de Oliveira Costa, Ticiane Cavalcante de Souza, Bruna Maria Nepomuceno Sousa Lino, Allysson Allan de Farias, Fabio Miyajima
{"title":"通过将基于 qPCR 的简单直接方案调整与成熟的机器学习算法相结合,加强 SARS-CoV-2 世系监测。","authors":"Cleber Furtado Aksenen, Debora Maria Almeida Ferreira, Pedro Miguel Carneiro Jeronimo, Thais de Oliveira Costa, Ticiane Cavalcante de Souza, Bruna Maria Nepomuceno Sousa Lino, Allysson Allan de Farias, Fabio Miyajima","doi":"10.1021/acs.analchem.4c04492","DOIUrl":null,"url":null,"abstract":"<p><p>Emerging and evolving Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) lineages, adapted to changing epidemiological conditions, present unprecedented challenges to global public health systems. Here, we introduce an adapted analytical approach that complements genomic sequencing, applying a cost-effective quantitative polymerase chain reaction (qPCR)-based assay. Viral RNA samples from SARS-CoV-2 positive cases detected by diagnostic laboratories or public health network units in Ceará, Brazil, were tracked for genomic surveillance and analyzed by using paired-end sequencing combined with integrative genomic analysis. Validation of a key structural variation was conducted with gel electrophoresis for the presence of a specific <i>open reading frame 7a</i>(<i>ORF7a</i>) gene deletion within the \"BE.9\" lineages tracked. The analytical innovation of our method is the optimization of a simple intercalating dye-based qPCR assay through repositioning primers from the ARTIC v4.1 amplicon panel to detect large molecular patterns. This assay distinguishes between \"BE.9\" and \"non-BE.9\" lineages, particularly BQ.1, without the need for expensive probes or sequencing. The protocol was validated against lineage predictions from next-generation sequencing (NGS) using 525 paired samples, achieving 93.3% sensitivity, 95.1% specificity, and 92.4% agreement, as measured by Cohen's Kappa coefficient. Machine learning (ML) models were trained using the melting curves from intercalating dye-based qPCR of 1724 samples, enabling highly accurate lineage assignment. Among them, the support vector machine (SVM) model had the best performance and after fine-tuning showed ∼96.52% (333/345) accuracy in comparison to the test data set. Our integrated approach provides an adapted analytical method that is both cost-effective and scalable, suitable for rapid assessment of emerging variants, especially in resource-limited settings. In this work, the protocol is applied to improve the monitoring of SARS-CoV-2 sublineages but can be extended to track any key molecular signature, including large insertions and deletions (indels) commonly observed in pathogenic agent subtypes. By offering a complement to traditional sequencing methods and utilizing easily trainable machine learning algorithms, our methodology contributes to enhanced molecular surveillance strategies and supports global efforts in pandemic control.</p>","PeriodicalId":27,"journal":{"name":"Analytical Chemistry","volume":" ","pages":"18537-18544"},"PeriodicalIF":6.7000,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enhancing SARS-CoV-2 Lineage Surveillance through the Integration of a Simple and Direct qPCR-Based Protocol Adaptation with Established Machine Learning Algorithms.\",\"authors\":\"Cleber Furtado Aksenen, Debora Maria Almeida Ferreira, Pedro Miguel Carneiro Jeronimo, Thais de Oliveira Costa, Ticiane Cavalcante de Souza, Bruna Maria Nepomuceno Sousa Lino, Allysson Allan de Farias, Fabio Miyajima\",\"doi\":\"10.1021/acs.analchem.4c04492\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Emerging and evolving Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) lineages, adapted to changing epidemiological conditions, present unprecedented challenges to global public health systems. Here, we introduce an adapted analytical approach that complements genomic sequencing, applying a cost-effective quantitative polymerase chain reaction (qPCR)-based assay. Viral RNA samples from SARS-CoV-2 positive cases detected by diagnostic laboratories or public health network units in Ceará, Brazil, were tracked for genomic surveillance and analyzed by using paired-end sequencing combined with integrative genomic analysis. Validation of a key structural variation was conducted with gel electrophoresis for the presence of a specific <i>open reading frame 7a</i>(<i>ORF7a</i>) gene deletion within the \\\"BE.9\\\" lineages tracked. The analytical innovation of our method is the optimization of a simple intercalating dye-based qPCR assay through repositioning primers from the ARTIC v4.1 amplicon panel to detect large molecular patterns. This assay distinguishes between \\\"BE.9\\\" and \\\"non-BE.9\\\" lineages, particularly BQ.1, without the need for expensive probes or sequencing. The protocol was validated against lineage predictions from next-generation sequencing (NGS) using 525 paired samples, achieving 93.3% sensitivity, 95.1% specificity, and 92.4% agreement, as measured by Cohen's Kappa coefficient. Machine learning (ML) models were trained using the melting curves from intercalating dye-based qPCR of 1724 samples, enabling highly accurate lineage assignment. Among them, the support vector machine (SVM) model had the best performance and after fine-tuning showed ∼96.52% (333/345) accuracy in comparison to the test data set. Our integrated approach provides an adapted analytical method that is both cost-effective and scalable, suitable for rapid assessment of emerging variants, especially in resource-limited settings. In this work, the protocol is applied to improve the monitoring of SARS-CoV-2 sublineages but can be extended to track any key molecular signature, including large insertions and deletions (indels) commonly observed in pathogenic agent subtypes. By offering a complement to traditional sequencing methods and utilizing easily trainable machine learning algorithms, our methodology contributes to enhanced molecular surveillance strategies and supports global efforts in pandemic control.</p>\",\"PeriodicalId\":27,\"journal\":{\"name\":\"Analytical Chemistry\",\"volume\":\" \",\"pages\":\"18537-18544\"},\"PeriodicalIF\":6.7000,\"publicationDate\":\"2024-11-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Analytical Chemistry\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.1021/acs.analchem.4c04492\",\"RegionNum\":1,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/11/4 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, ANALYTICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Analytical Chemistry","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1021/acs.analchem.4c04492","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/11/4 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
引用次数: 0

摘要

新出现和不断演变的严重急性呼吸系统综合征冠状病毒 2(SARS-CoV-2)病毒系适应了不断变化的流行病学条件,给全球公共卫生系统带来了前所未有的挑战。在这里,我们介绍了一种经过调整的分析方法,它是对基因组测序的补充,应用了一种基于定量聚合酶链反应(qPCR)的经济有效的检测方法。我们对巴西塞阿拉州诊断实验室或公共卫生网络单位检测到的 SARS-CoV-2 阳性病例的病毒 RNA 样本进行了基因组监测跟踪,并使用成对端测序结合综合基因组分析进行了分析。通过凝胶电泳验证了在所追踪的 "BE.9 "系中是否存在特定开放阅读框 7a(ORF7a)基因缺失的关键结构变异。我们的方法在分析上的创新之处在于,通过重新定位 ARTIC v4.1 扩增片段面板上的引物,优化了一种基于插层染料的简单 qPCR 检测方法,以检测大分子模式。这种检测方法可区分 "BE.9 "和 "非 BE.9 "血系,尤其是 BQ.1,而无需昂贵的探针或测序。根据科恩卡帕系数(Cohen's Kappa coefficient)的测量结果,该方法的灵敏度为 93.3%,特异性为 95.1%,一致性为 92.4%。利用基于插层染料的 qPCR 对 1724 个样本的熔解曲线训练了机器学习(ML)模型,从而实现了高精度的血缘分配。其中,支持向量机(SVM)模型性能最佳,经过微调后,与测试数据集相比,准确率达到 96.52%(333/345)。我们的综合方法提供了一种既经济又可扩展的分析方法,适用于快速评估新出现的变异体,尤其是在资源有限的环境中。在这项工作中,该方案被用于改进对 SARS-CoV-2 亚系的监测,但也可扩展用于追踪任何关键的分子特征,包括在致病剂亚型中常见的大插入和缺失(indels)。通过对传统测序方法的补充和利用易于训练的机器学习算法,我们的方法有助于加强分子监测战略,支持全球大流行病控制工作。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Enhancing SARS-CoV-2 Lineage Surveillance through the Integration of a Simple and Direct qPCR-Based Protocol Adaptation with Established Machine Learning Algorithms.

Enhancing SARS-CoV-2 Lineage Surveillance through the Integration of a Simple and Direct qPCR-Based Protocol Adaptation with Established Machine Learning Algorithms.

Emerging and evolving Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) lineages, adapted to changing epidemiological conditions, present unprecedented challenges to global public health systems. Here, we introduce an adapted analytical approach that complements genomic sequencing, applying a cost-effective quantitative polymerase chain reaction (qPCR)-based assay. Viral RNA samples from SARS-CoV-2 positive cases detected by diagnostic laboratories or public health network units in Ceará, Brazil, were tracked for genomic surveillance and analyzed by using paired-end sequencing combined with integrative genomic analysis. Validation of a key structural variation was conducted with gel electrophoresis for the presence of a specific open reading frame 7a(ORF7a) gene deletion within the "BE.9" lineages tracked. The analytical innovation of our method is the optimization of a simple intercalating dye-based qPCR assay through repositioning primers from the ARTIC v4.1 amplicon panel to detect large molecular patterns. This assay distinguishes between "BE.9" and "non-BE.9" lineages, particularly BQ.1, without the need for expensive probes or sequencing. The protocol was validated against lineage predictions from next-generation sequencing (NGS) using 525 paired samples, achieving 93.3% sensitivity, 95.1% specificity, and 92.4% agreement, as measured by Cohen's Kappa coefficient. Machine learning (ML) models were trained using the melting curves from intercalating dye-based qPCR of 1724 samples, enabling highly accurate lineage assignment. Among them, the support vector machine (SVM) model had the best performance and after fine-tuning showed ∼96.52% (333/345) accuracy in comparison to the test data set. Our integrated approach provides an adapted analytical method that is both cost-effective and scalable, suitable for rapid assessment of emerging variants, especially in resource-limited settings. In this work, the protocol is applied to improve the monitoring of SARS-CoV-2 sublineages but can be extended to track any key molecular signature, including large insertions and deletions (indels) commonly observed in pathogenic agent subtypes. By offering a complement to traditional sequencing methods and utilizing easily trainable machine learning algorithms, our methodology contributes to enhanced molecular surveillance strategies and supports global efforts in pandemic control.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Analytical Chemistry
Analytical Chemistry 化学-分析化学
CiteScore
12.10
自引率
12.20%
发文量
1949
审稿时长
1.4 months
期刊介绍: Analytical Chemistry, a peer-reviewed research journal, focuses on disseminating new and original knowledge across all branches of analytical chemistry. Fundamental articles may explore general principles of chemical measurement science and need not directly address existing or potential analytical methodology. They can be entirely theoretical or report experimental results. Contributions may cover various phases of analytical operations, including sampling, bioanalysis, electrochemistry, mass spectrometry, microscale and nanoscale systems, environmental analysis, separations, spectroscopy, chemical reactions and selectivity, instrumentation, imaging, surface analysis, and data processing. Papers discussing known analytical methods should present a significant, original application of the method, a notable improvement, or results on an important analyte.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信