Clinical presentation of COVID-19 - a model derived by a machine learning algorithm.

IF 1.8 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Journal of Integrative Bioinformatics Pub Date : 2021-03-04 DOI:10.1515/jib-2020-0050

Malik Yousef, Louise C Showe, Izhar Ben Shlomo

{"title":"Clinical presentation of COVID-19 - a model derived by a machine learning algorithm.","authors":"Malik Yousef, Louise C Showe, Izhar Ben Shlomo","doi":"10.1515/jib-2020-0050","DOIUrl":null,"url":null,"abstract":"<p><p>COVID-19 pandemic has flooded all triage stations, making it difficult to carefully select those most likely infected. Data on total patients tested, infected, and hospitalized is fragmentary making it difficult to easily select those most likely to be infected. The Israeli Ministry of Health made public its registry of immediate clinical data and the respective status of infected/not infected for all viral DNA tests performed up to Apr. 18th, 2020 including almost 120,000 tests. We used a machine-learning algorithm to find out which immediate clinical elements mattered the most in identifying the true status of the tested persons including age or gender matter, to enable future better allocation of surveillance policy for those belonging to high-risk groups. In addition to the analyses applied on the first batch of the available data (Apr. 11th), we further tested the algorithm on the independent second batch (Apr. 12th to 18th). Fever, cough and headache were the most diagnostic, differing in degree of importance in different subgroups. Higher percentage of men were found positive (9.3 vs. 7.3%), but gender did not matter for the clinical presentation. The prediction power of the model was high, with accuracy of 0.84 and area under the curve 0.92. We provide a hand-held short checklist with verbal description of importance for the leading symptoms, which should expedite the triage and enable proper selection of people for further follow-up.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":"18 1","pages":"3-8"},"PeriodicalIF":1.8000,"publicationDate":"2021-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8035960/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Integrative Bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1515/jib-2020-0050","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

COVID-19 pandemic has flooded all triage stations, making it difficult to carefully select those most likely infected. Data on total patients tested, infected, and hospitalized is fragmentary making it difficult to easily select those most likely to be infected. The Israeli Ministry of Health made public its registry of immediate clinical data and the respective status of infected/not infected for all viral DNA tests performed up to Apr. 18th, 2020 including almost 120,000 tests. We used a machine-learning algorithm to find out which immediate clinical elements mattered the most in identifying the true status of the tested persons including age or gender matter, to enable future better allocation of surveillance policy for those belonging to high-risk groups. In addition to the analyses applied on the first batch of the available data (Apr. 11th), we further tested the algorithm on the independent second batch (Apr. 12th to 18th). Fever, cough and headache were the most diagnostic, differing in degree of importance in different subgroups. Higher percentage of men were found positive (9.3 vs. 7.3%), but gender did not matter for the clinical presentation. The prediction power of the model was high, with accuracy of 0.84 and area under the curve 0.92. We provide a hand-held short checklist with verbal description of importance for the leading symptoms, which should expedite the triage and enable proper selection of people for further follow-up.

Abstract Image

查看原文本刊更多论文

COVID-19 的临床表现--通过机器学习算法得出的模型。

COVID-19 大流行充斥着所有分流站，很难仔细挑选出最有可能被感染的患者。关于接受检测、受感染和住院患者总数的数据非常零散，因此很难轻松选出最有可能受感染的患者。以色列卫生部公布了截至 2020 年 4 月 18 日进行的所有病毒 DNA 检测的即时临床数据和感染/未感染状态登记，其中包括近 12 万次检测。我们使用了一种机器学习算法，以找出哪些即时临床要素（包括年龄或性别要素）对确定受检者的真实状况最为重要，从而在未来更好地分配针对高危人群的监测政策。除了对第一批可用数据（4 月 11 日）进行分析外，我们还对独立的第二批数据（4 月 12 日至 18 日）进行了进一步测试。发热、咳嗽和头痛是最具诊断意义的症状，在不同的亚群中其重要程度不同。男性阳性比例较高（9.3% 对 7.3%），但性别与临床表现无关。该模型的预测能力很强，准确率为 0.84，曲线下面积为 0.92。我们提供了一份手持式简短核对表，并对主要症状的重要性进行了口头描述，该核对表应能加快分诊速度，并能正确选择需要进一步随访的人群。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊