COVID-19CT+：用于COVID-19回顾性分析的CT图像公共数据集。

IF 1.4 3区医学 Q3 INSTRUMENTS & INSTRUMENTATION

Journal of X-Ray Science and Technology Pub Date : 2025-09-01 Epub Date: 2025-05-23 DOI:10.1177/08953996251332793

Yihao Sun, Tianming Du, Bin Wang, Md Mamunur Rahaman, Xinghao Wang, Xinyu Huang, Tao Jiang, Marcin Grzegorzek, Hongzan Sun, Jian Xu, Chen Li

{"title":"COVID-19CT+：用于COVID-19回顾性分析的CT图像公共数据集。","authors":"Yihao Sun, Tianming Du, Bin Wang, Md Mamunur Rahaman, Xinghao Wang, Xinyu Huang, Tao Jiang, Marcin Grzegorzek, Hongzan Sun, Jian Xu, Chen Li","doi":"10.1177/08953996251332793","DOIUrl":null,"url":null,"abstract":"Background and objectiveCOVID-19 is considered as the biggest global health disaster in the 21st century, and it has a huge impact on the world.MethodsThis paper publishes a publicly available dataset of CT images of multiple types of pneumonia (COVID-19CT+). Specifically, the dataset contains 409,619 CT images of 1333 patients, with subset-A containing 312 community-acquired pneumonia cases and subset-B containing 1021 COVID-19 cases. In order to demonstrate that there are differences in the methods used to classify COVID-19CT+ images across time, we selected 13 classical machine learning classifiers and 5 deep learning classifiers to test the image classification task.ResultsIn this study, two sets of experiments are conducted using traditional machine learning and deep learning methods, the first set of experiments is the classification of COVID-19 in Subset-B versus COVID-19 white lung disease, and the second set of experiments is the classification of community-acquired pneumonia in Subset-A versus COVID-19 in Subset-B, demonstrating that the different periods of the methods differed on COVID-19CT+. On the first set of experiments, the accuracy of traditional machine learning reaches a maximum of 97.3% and a minimum of only 62.6%. Deep learning algorithms reaches a maximum of 97.9% and a minimum of 85.7%. On the second set of experiments, traditional machine learning reaches a high of 94.6% accuracy and a low of 56.8%. The deep learning algorithm reaches a high of 91.9% and a low of 86.3%.ConclusionsThe COVID-19CT+ in this study covers a large number of CT images of patients with COVID-19 and community-acquired pneumonia and is one of the largest datasets available. We expect that this dataset will attract more researchers to participate in exploring new automated diagnostic algorithms to contribute to the improvement of the diagnostic accuracy and efficiency of COVID-19.","PeriodicalId":49948,"journal":{"name":"Journal of X-Ray Science and Technology","volume":" ","pages":"901-915"},"PeriodicalIF":1.4000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"COVID-19CT+: A public dataset of CT images for COVID-19 retrospective analysis.\",\"authors\":\"Yihao Sun, Tianming Du, Bin Wang, Md Mamunur Rahaman, Xinghao Wang, Xinyu Huang, Tao Jiang, Marcin Grzegorzek, Hongzan Sun, Jian Xu, Chen Li\",\"doi\":\"10.1177/08953996251332793\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background and objectiveCOVID-19 is considered as the biggest global health disaster in the 21st century, and it has a huge impact on the world.MethodsThis paper publishes a publicly available dataset of CT images of multiple types of pneumonia (COVID-19CT+). Specifically, the dataset contains 409,619 CT images of 1333 patients, with subset-A containing 312 community-acquired pneumonia cases and subset-B containing 1021 COVID-19 cases. In order to demonstrate that there are differences in the methods used to classify COVID-19CT+ images across time, we selected 13 classical machine learning classifiers and 5 deep learning classifiers to test the image classification task.ResultsIn this study, two sets of experiments are conducted using traditional machine learning and deep learning methods, the first set of experiments is the classification of COVID-19 in Subset-B versus COVID-19 white lung disease, and the second set of experiments is the classification of community-acquired pneumonia in Subset-A versus COVID-19 in Subset-B, demonstrating that the different periods of the methods differed on COVID-19CT+. On the first set of experiments, the accuracy of traditional machine learning reaches a maximum of 97.3% and a minimum of only 62.6%. Deep learning algorithms reaches a maximum of 97.9% and a minimum of 85.7%. On the second set of experiments, traditional machine learning reaches a high of 94.6% accuracy and a low of 56.8%. The deep learning algorithm reaches a high of 91.9% and a low of 86.3%.ConclusionsThe COVID-19CT+ in this study covers a large number of CT images of patients with COVID-19 and community-acquired pneumonia and is one of the largest datasets available. We expect that this dataset will attract more researchers to participate in exploring new automated diagnostic algorithms to contribute to the improvement of the diagnostic accuracy and efficiency of COVID-19.\",\"PeriodicalId\":49948,\"journal\":{\"name\":\"Journal of X-Ray Science and Technology\",\"volume\":\" \",\"pages\":\"901-915\"},\"PeriodicalIF\":1.4000,\"publicationDate\":\"2025-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of X-Ray Science and Technology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1177/08953996251332793\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/5/23 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q3\",\"JCRName\":\"INSTRUMENTS & INSTRUMENTATION\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of X-Ray Science and Technology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1177/08953996251332793","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/5/23 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"INSTRUMENTS & INSTRUMENTATION","Score":null,"Total":0}

引用次数: 0

摘要

背景与目的2019冠状病毒病（covid -19）被认为是21世纪最大的全球性卫生灾难，对世界产生了巨大影响。方法公开多类型肺炎（COVID-19CT+） CT图像数据集。具体而言，该数据集包含1333例患者的409619张CT图像，其中子集a包含312例社区获得性肺炎病例，子集b包含1021例COVID-19病例。为了证明不同时间对COVID-19CT+图像进行分类的方法存在差异，我们选择了13个经典机器学习分类器和5个深度学习分类器来测试图像分类任务。结果本研究采用传统机器学习和深度学习两种方法进行了两组实验，第一组实验是subset b中COVID-19与COVID-19白肺病的分类，第二组实验是subset a中社区获得性肺炎与subset b中COVID-19的分类，表明不同时期的方法在COVID-19CT+上存在差异。在第一组实验中，传统机器学习的准确率最高达到97.3%，最低只有62.6%。深度学习算法最高可达97.9%，最低可达85.7%。在第二组实验中，传统机器学习的准确率最高达到94.6%，最低达到56.8%。深度学习算法最高达到91.9%，最低达到86.3%。结论本研究的COVID-19CT+涵盖了大量COVID-19和社区获得性肺炎患者的CT图像，是目前最大的数据集之一。我们希望该数据集能够吸引更多的研究人员参与探索新的自动诊断算法，为提高COVID-19的诊断准确性和效率做出贡献。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

COVID-19CT+: A public dataset of CT images for COVID-19 retrospective analysis.

Background and objectiveCOVID-19 is considered as the biggest global health disaster in the 21st century, and it has a huge impact on the world.MethodsThis paper publishes a publicly available dataset of CT images of multiple types of pneumonia (COVID-19CT+). Specifically, the dataset contains 409,619 CT images of 1333 patients, with subset-A containing 312 community-acquired pneumonia cases and subset-B containing 1021 COVID-19 cases. In order to demonstrate that there are differences in the methods used to classify COVID-19CT+ images across time, we selected 13 classical machine learning classifiers and 5 deep learning classifiers to test the image classification task.ResultsIn this study, two sets of experiments are conducted using traditional machine learning and deep learning methods, the first set of experiments is the classification of COVID-19 in Subset-B versus COVID-19 white lung disease, and the second set of experiments is the classification of community-acquired pneumonia in Subset-A versus COVID-19 in Subset-B, demonstrating that the different periods of the methods differed on COVID-19CT+. On the first set of experiments, the accuracy of traditional machine learning reaches a maximum of 97.3% and a minimum of only 62.6%. Deep learning algorithms reaches a maximum of 97.9% and a minimum of 85.7%. On the second set of experiments, traditional machine learning reaches a high of 94.6% accuracy and a low of 56.8%. The deep learning algorithm reaches a high of 91.9% and a low of 86.3%.ConclusionsThe COVID-19CT+ in this study covers a large number of CT images of patients with COVID-19 and community-acquired pneumonia and is one of the largest datasets available. We expect that this dataset will attract more researchers to participate in exploring new automated diagnostic algorithms to contribute to the improvement of the diagnostic accuracy and efficiency of COVID-19.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of X-Ray Science and Technology 工程技术-光学

CiteScore

4.90

自引率

23.30%

发文量

150

审稿时长

3 months

期刊介绍： Research areas within the scope of the journal include: Interaction of x-rays with matter: x-ray phenomena, biological effects of radiation, radiation safety and optical constants X-ray sources: x-rays from synchrotrons, x-ray lasers, plasmas, and other sources, conventional or unconventional Optical elements: grazing incidence optics, multilayer mirrors, zone plates, gratings, other diffraction optics Optical instruments: interferometers, spectrometers, microscopes, telescopes, microprobes