SARS-CoV-2 Screening on the Multiplex Real-Time RT-PCR Gene Cycle Threshold - A Machine Learning Approach

IF 0.2 Q4 Biochemistry, Genetics and Molecular Biology
Sekaran Karthik, R. Gnanasambandan, Iyyadurai Ramya, G. Karthik, Priya Doss C. George
{"title":"SARS-CoV-2 Screening on the Multiplex Real-Time RT-PCR Gene Cycle Threshold - A Machine Learning Approach","authors":"Sekaran Karthik, R. Gnanasambandan, Iyyadurai Ramya, G. Karthik, Priya Doss C. George","doi":"10.25303/1810rjbt1830190","DOIUrl":null,"url":null,"abstract":"Early diagnosis of the lethal SARS-CoV-2 virus determines a patient’s survival rate. Highly transmissible novel coronavirus prevention is possible with effective, rapid diagnostic strategies. The reverse transcription polymerase chain reaction (RT-PCR), a globally adopted SARS-CoV-2 detection technique, provides better diagnosis results. The output of the RT-PCR test produces the amplified gene scores of ORF1a/b, S, N, E and RdRp. This study intends to evaluate the performance of the RT-PCR-based COVID-19 diagnosis using machine learning models. The confirmatory genes ORF1b, E and RdRp and their cycle threshold (Ct) values are the main parameters used to build the machine learning model for SARS-CoV-2 screening. The real-time dataset collected from the Indian Council of Medical Research (ICMR) database containing missing, redundant information is processed and eliminated. Statistical interpretations are performed with demographic information to understand the dynamics of the disease prevalence in India. Binary classification models delivered promising results in discriminating the samples of two classes. The models were examined further to scrutinize their performance via evaluation metrics such as balanced accuracy, f1-score, ROC curve, precision and recall. This algorithmic assessment exhibits a better outcome on the RT-PCR-based SARS-CoV-2 disease diagnosis.","PeriodicalId":21091,"journal":{"name":"Research Journal of Biotechnology","volume":null,"pages":null},"PeriodicalIF":0.2000,"publicationDate":"2023-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Research Journal of Biotechnology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.25303/1810rjbt1830190","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Biochemistry, Genetics and Molecular Biology","Score":null,"Total":0}
引用次数: 0

Abstract

Early diagnosis of the lethal SARS-CoV-2 virus determines a patient’s survival rate. Highly transmissible novel coronavirus prevention is possible with effective, rapid diagnostic strategies. The reverse transcription polymerase chain reaction (RT-PCR), a globally adopted SARS-CoV-2 detection technique, provides better diagnosis results. The output of the RT-PCR test produces the amplified gene scores of ORF1a/b, S, N, E and RdRp. This study intends to evaluate the performance of the RT-PCR-based COVID-19 diagnosis using machine learning models. The confirmatory genes ORF1b, E and RdRp and their cycle threshold (Ct) values are the main parameters used to build the machine learning model for SARS-CoV-2 screening. The real-time dataset collected from the Indian Council of Medical Research (ICMR) database containing missing, redundant information is processed and eliminated. Statistical interpretations are performed with demographic information to understand the dynamics of the disease prevalence in India. Binary classification models delivered promising results in discriminating the samples of two classes. The models were examined further to scrutinize their performance via evaluation metrics such as balanced accuracy, f1-score, ROC curve, precision and recall. This algorithmic assessment exhibits a better outcome on the RT-PCR-based SARS-CoV-2 disease diagnosis.
基于多重实时RT-PCR基因周期阈值的SARS-CoV-2筛选——一种机器学习方法
对致命的SARS-CoV-2病毒的早期诊断决定了患者的存活率。通过有效、快速的诊断策略,可以预防高度传染性的新型冠状病毒。逆转录聚合酶链反应(RT-PCR)是全球通用的SARS-CoV-2检测技术,其诊断效果更好。RT-PCR检测输出ORF1a/b、S、N、E和RdRp的扩增基因评分。本研究旨在利用机器学习模型评估基于rt - pcr的COVID-19诊断的性能。验证性基因ORF1b、E和RdRp及其周期阈值(Ct)值是构建SARS-CoV-2筛选机器学习模型的主要参数。从印度医学研究委员会(ICMR)数据库收集的包含缺失和冗余信息的实时数据集经过处理并消除。利用人口统计信息进行统计解释,以了解印度疾病流行的动态。二元分类模型在区分两类样本方面取得了令人满意的结果。通过平衡精度、f1-score、ROC曲线、精密度和召回率等评价指标进一步检验模型的性能。该算法评估在基于rt - pcr的SARS-CoV-2疾病诊断中显示出更好的结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Research Journal of Biotechnology
Research Journal of Biotechnology 工程技术-生物工程与应用微生物
CiteScore
0.60
自引率
0.00%
发文量
192
审稿时长
1.5 months
期刊介绍: We invite you to contribute Research Papers / Short Communications / Review Papers: -In any field of Biotechnology, Biochemistry, Microbiology and Industrial Microbiology, Soil Technology, Agriculture Biotechnology. -in any field related to Food Biotechnology, Nutrition Biotechnology, Genetic Engineering and Commercial Biotechnology. -in any field of Biotechnology related to Drugs and Pharmaceutical products for human beings, animals and plants. -in any field related to Environmental Biotechnolgy, Waste Treatment of Liquids, Soilds and Gases; Sustainability. -in inter-realted field of Chemical Sciences, Biological Sciences, Environmental Sciences and Life Sciences. -in any field related to Biotechnological Engineering, Industrial Biotechnology and Instrumentation. -in any field related to Nano-technology. -in any field related to Plant Biotechnology.
文献相关原料
公司名称 产品信息 采购帮参考价格
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信