结合血液检测和定量计算机断层扫描参数的COVID-19重症病例早期检测机器学习辅助模型的开发

Xiaoqi Huang, Ke Shi, Jie Zhou, Yuxuan Liang, Yaliang Liu, Jinpin Zhang, Youmin Guo, C. Jin
{"title":"结合血液检测和定量计算机断层扫描参数的COVID-19重症病例早期检测机器学习辅助模型的开发","authors":"Xiaoqi Huang, Ke Shi, Jie Zhou, Yuxuan Liang, Yaliang Liu, Jinpin Zhang, Youmin Guo, C. Jin","doi":"10.1166/jmihi.2021.3866","DOIUrl":null,"url":null,"abstract":"Purpose: This study aimed to identify severe Coronavirus Disease 2019 (COVID-19) cases combining blood test results and imaging parameters based on a machine learning classifier at the initial admission. Materials\n and methods: Ninety-five non-severe and 22 severe laboratory-confirmed COVID-19 cases treated between January 23, 2020 and March 25, 2020 were examined in this retrospective trial. Blood test results and chest computed tomography (CT) images were obtained at the initial\n admission. The lesions on CT images were segmented using an artificial intelligent (AI) tool. Then, quantitative CT (QCT) parameters, including the volume, percentage, ground glass opacity (GGO) percentage and heterogeneity of the lesions were calculated. Correlations of blood test results\n and QCT parameters were analyzed by the Pearson test first. Then, discriminative features for detecting severe cases were selected by both the independent samples t test and least absolute shrinkage and selection operator (LASSO) regression. Next, support vector machine (SVM),\n Gaussian naïve Bayes (GNB), Knearest neighbor (KNN), decision tree (DT), random forest (RF) and multi-layer perceptron-neural net (MLP-NN) algorithms were used as classifiers, and their accuracies were assessed by 10-fold-cross-validation. Results:\n Blood test indexes and CT parameters were moderately to medially correlated. Of all selected features, lesion percentage contributed mostly to the classification of the two groups, followed by lesion volume, patient age, lymphocyte count, neutrophil count, GGO percentage and tumor heterogeneity.\n RF-assisted identification had the highest accuracy of 91.38%, followed by GNB (87.83%), KNN (87.93%), SVM (86.21%), MLP-NN (85.34%) and DT (84.48%). Conclusions: The RF-assisted model combining blood test and QCT parameters is\n helpful in the identification of severe COVID-19 cases.","PeriodicalId":393031,"journal":{"name":"J. Medical Imaging Health Informatics","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Development of a Machine Learning-Assisted Model for the Early Detection of Severe COVID-19 Cases Combining Blood Test and Quantitative Computed Tomography Parameters\",\"authors\":\"Xiaoqi Huang, Ke Shi, Jie Zhou, Yuxuan Liang, Yaliang Liu, Jinpin Zhang, Youmin Guo, C. Jin\",\"doi\":\"10.1166/jmihi.2021.3866\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Purpose: This study aimed to identify severe Coronavirus Disease 2019 (COVID-19) cases combining blood test results and imaging parameters based on a machine learning classifier at the initial admission. Materials\\n and methods: Ninety-five non-severe and 22 severe laboratory-confirmed COVID-19 cases treated between January 23, 2020 and March 25, 2020 were examined in this retrospective trial. Blood test results and chest computed tomography (CT) images were obtained at the initial\\n admission. The lesions on CT images were segmented using an artificial intelligent (AI) tool. Then, quantitative CT (QCT) parameters, including the volume, percentage, ground glass opacity (GGO) percentage and heterogeneity of the lesions were calculated. Correlations of blood test results\\n and QCT parameters were analyzed by the Pearson test first. Then, discriminative features for detecting severe cases were selected by both the independent samples t test and least absolute shrinkage and selection operator (LASSO) regression. Next, support vector machine (SVM),\\n Gaussian naïve Bayes (GNB), Knearest neighbor (KNN), decision tree (DT), random forest (RF) and multi-layer perceptron-neural net (MLP-NN) algorithms were used as classifiers, and their accuracies were assessed by 10-fold-cross-validation. Results:\\n Blood test indexes and CT parameters were moderately to medially correlated. Of all selected features, lesion percentage contributed mostly to the classification of the two groups, followed by lesion volume, patient age, lymphocyte count, neutrophil count, GGO percentage and tumor heterogeneity.\\n RF-assisted identification had the highest accuracy of 91.38%, followed by GNB (87.83%), KNN (87.93%), SVM (86.21%), MLP-NN (85.34%) and DT (84.48%). Conclusions: The RF-assisted model combining blood test and QCT parameters is\\n helpful in the identification of severe COVID-19 cases.\",\"PeriodicalId\":393031,\"journal\":{\"name\":\"J. Medical Imaging Health Informatics\",\"volume\":\"36 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"J. Medical Imaging Health Informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1166/jmihi.2021.3866\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"J. Medical Imaging Health Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1166/jmihi.2021.3866","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

目的:基于机器学习分类器,结合入院时的血液检查结果和影像学参数,识别2019年严重冠状病毒病(COVID-19)病例。材料与方法:回顾性分析2020年1月23日至2020年3月25日收治的95例非重症病例和22例重症实验室确诊病例。入院时进行血液检查和胸部计算机断层扫描(CT)。使用人工智能(AI)工具对CT图像上的病变进行分割。然后计算定量CT (QCT)参数,包括病灶的体积、百分比、磨玻璃不透明度(GGO)百分比和异质性。首先用Pearson检验分析血液检测结果与QCT参数的相关性。然后,通过独立样本t检验和最小绝对收缩和选择算子(LASSO)回归选择检测重症病例的判别特征。接下来,采用支持向量机(SVM)、高斯naïve贝叶斯(GNB)、最近邻(KNN)、决策树(DT)、随机森林(RF)和多层感知器-神经网络(MLP-NN)算法作为分类器,并通过10倍交叉验证对其准确率进行评估。结果:血检指标与CT参数呈中、中等相关性。在所有选择的特征中,病变百分率对两组的分类贡献最大,其次是病变体积、患者年龄、淋巴细胞计数、中性粒细胞计数、GGO百分比和肿瘤异质性。rf辅助识别准确率最高,为91.38%,其次是GNB(87.83%)、KNN(87.93%)、SVM(86.21%)、MLP-NN(85.34%)和DT(84.48%)。结论:结合血液检测和QCT参数的rf辅助模型有助于COVID-19重症病例的识别。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Development of a Machine Learning-Assisted Model for the Early Detection of Severe COVID-19 Cases Combining Blood Test and Quantitative Computed Tomography Parameters
Purpose: This study aimed to identify severe Coronavirus Disease 2019 (COVID-19) cases combining blood test results and imaging parameters based on a machine learning classifier at the initial admission. Materials and methods: Ninety-five non-severe and 22 severe laboratory-confirmed COVID-19 cases treated between January 23, 2020 and March 25, 2020 were examined in this retrospective trial. Blood test results and chest computed tomography (CT) images were obtained at the initial admission. The lesions on CT images were segmented using an artificial intelligent (AI) tool. Then, quantitative CT (QCT) parameters, including the volume, percentage, ground glass opacity (GGO) percentage and heterogeneity of the lesions were calculated. Correlations of blood test results and QCT parameters were analyzed by the Pearson test first. Then, discriminative features for detecting severe cases were selected by both the independent samples t test and least absolute shrinkage and selection operator (LASSO) regression. Next, support vector machine (SVM), Gaussian naïve Bayes (GNB), Knearest neighbor (KNN), decision tree (DT), random forest (RF) and multi-layer perceptron-neural net (MLP-NN) algorithms were used as classifiers, and their accuracies were assessed by 10-fold-cross-validation. Results: Blood test indexes and CT parameters were moderately to medially correlated. Of all selected features, lesion percentage contributed mostly to the classification of the two groups, followed by lesion volume, patient age, lymphocyte count, neutrophil count, GGO percentage and tumor heterogeneity. RF-assisted identification had the highest accuracy of 91.38%, followed by GNB (87.83%), KNN (87.93%), SVM (86.21%), MLP-NN (85.34%) and DT (84.48%). Conclusions: The RF-assisted model combining blood test and QCT parameters is helpful in the identification of severe COVID-19 cases.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信