An interpretable machine learning model for covid-19 screening

Q2 Medicine
Gustavo Carreiro Pinasco, Eduardo Moreno Júdice de Mattos Farina, Fabiano Novaes Barcellos Filho, Willer França Fiorotti, Matheus Coradini Mariano Ferreira, Sheila Cristina de Souza Cruz, Andre Louzada Colodette, Luciene Rossati Loureiro, Tatiane Comerio, Dilzilene Cunha Sivirino Farias, Katia Valéria Manhambusque, E. de Fátima Almeida Lima
{"title":"An interpretable machine learning model for covid-19 screening","authors":"Gustavo Carreiro Pinasco, Eduardo Moreno Júdice de Mattos Farina, Fabiano Novaes Barcellos Filho, Willer França Fiorotti, Matheus Coradini Mariano Ferreira, Sheila Cristina de Souza Cruz, Andre Louzada Colodette, Luciene Rossati Loureiro, Tatiane Comerio, Dilzilene Cunha Sivirino Farias, Katia Valéria Manhambusque, E. de Fátima Almeida Lima","doi":"10.36311/jhgd.v32.13324","DOIUrl":null,"url":null,"abstract":"Introduction: the Coronavirus Disease 2019 (COVID-19) is a viral disease which has been declared a pandemic by the WHO. Diagnostic tests are expensive and are not always available. Researches using machine learning (ML) approach for diagnosing SARS-CoV-2 infection have been proposed in the literature to reduce cost and allow better control of the pandemic.\nObjective: we aim to develop a machine learning model to predict if a patient has COVID-19 with epidemiological data and clinical features.\nMethods: we used six ML algorithms for COVID-19 screening through diagnostic prediction and did an interpretative analysis using SHAP models and feature importances.\nResults: our best model was XGBoost (XGB) which obtained an area under the ROC curve of 0.752, a sensitivity of 90%, a specificity of 40%, a positive predictive value (PPV) of 42.16%, and a negative predictive value (NPV) of 91.0%. The best predictors were fever, cough, history of international travel less than 14 days ago, male gender, and nasal congestion, respectively.\nConclusion: We conclude that ML is an important tool for screening with high sensitivity, compared to rapid tests, and can be used to empower clinical precision in COVID-19, a disease in which symptoms are very unspecific.\n ","PeriodicalId":35218,"journal":{"name":"Journal of Human Growth and Development","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Human Growth and Development","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.36311/jhgd.v32.13324","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0

Abstract

Introduction: the Coronavirus Disease 2019 (COVID-19) is a viral disease which has been declared a pandemic by the WHO. Diagnostic tests are expensive and are not always available. Researches using machine learning (ML) approach for diagnosing SARS-CoV-2 infection have been proposed in the literature to reduce cost and allow better control of the pandemic. Objective: we aim to develop a machine learning model to predict if a patient has COVID-19 with epidemiological data and clinical features. Methods: we used six ML algorithms for COVID-19 screening through diagnostic prediction and did an interpretative analysis using SHAP models and feature importances. Results: our best model was XGBoost (XGB) which obtained an area under the ROC curve of 0.752, a sensitivity of 90%, a specificity of 40%, a positive predictive value (PPV) of 42.16%, and a negative predictive value (NPV) of 91.0%. The best predictors were fever, cough, history of international travel less than 14 days ago, male gender, and nasal congestion, respectively. Conclusion: We conclude that ML is an important tool for screening with high sensitivity, compared to rapid tests, and can be used to empower clinical precision in COVID-19, a disease in which symptoms are very unspecific.  
新冠肺炎筛查的可解释机器学习模型
简介:2019冠状病毒病(新冠肺炎)是一种已被世界卫生组织宣布为大流行的病毒性疾病。诊断测试费用高昂,而且并不总是可用的。文献中提出了使用机器学习(ML)方法诊断严重急性呼吸系统综合征冠状病毒2型感染的研究,以降低成本并更好地控制疫情。目的:我们旨在开发一个机器学习模型,根据流行病学数据和临床特征预测患者是否患有新冠肺炎。方法:我们使用六种ML算法通过诊断预测进行新冠肺炎筛查,并使用SHAP模型和特征重要性进行解释性分析。结果:我们的最佳模型是XGBoost(XGB),其ROC曲线下面积为0.752,敏感性为90%,特异性为40%,阳性预测值(PPV)为42.16%,阴性预测值(NPV)为91.0%。最佳预测值分别为发烧、咳嗽、14天以下国际旅行史、男性和鼻塞。结论:我们得出结论,与快速检测相比,ML是一种高灵敏度筛查的重要工具,可用于增强新冠肺炎的临床准确性,这是一种症状非常不特异性的疾病。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Human Growth and Development
Journal of Human Growth and Development Social Sciences-Life-span and Life-course Studies
CiteScore
2.70
自引率
0.00%
发文量
37
审稿时长
22 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信