{"title":"预训练胸部疾病检测系统对大规模胸部x射线域数据集的影响","authors":"Shafinul Haque, Jonathan H. Chan","doi":"10.1145/3486713.3486735","DOIUrl":null,"url":null,"abstract":"The COVID-19 pandemic has impacted many countries around the world resulting in the need to develop quick and effective screening methods to ease the burden and overcome the limitations of varying healthcare capacities. Given the nature of the disease, the use of Chest X-ray (CXR) medical imaging has proven to be very useful which has prompted the exploration of computer-aided diagnosis tools to augment and assist radiologists. However, recent reports have deemed many of the proposed methods to be impractical for use in real-life applications due to models with poor generalization capabilities, an issue closely related to the quality of current datasets in the CXR domain. Typically, deep convolutional neural network (CNN) based classification systems utilize transfer learning techniques when data is limited. We suggest first training models on publicly available large-scale and CXR specific datasets, such as CheXpert, and using these pretrained weights when initializing the final model. Compared with a CNN pretrained on the more general ImageNet dataset, pretraining on large-scale domain specific data increased the model's ability to generalize to unseen data.","PeriodicalId":268366,"journal":{"name":"The 12th International Conference on Computational Systems-Biology and Bioinformatics","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"The Effect of PreTraining Thoracic Disease Detection Systems on Large-Scale Chest X-Ray Domain Datasets\",\"authors\":\"Shafinul Haque, Jonathan H. Chan\",\"doi\":\"10.1145/3486713.3486735\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The COVID-19 pandemic has impacted many countries around the world resulting in the need to develop quick and effective screening methods to ease the burden and overcome the limitations of varying healthcare capacities. Given the nature of the disease, the use of Chest X-ray (CXR) medical imaging has proven to be very useful which has prompted the exploration of computer-aided diagnosis tools to augment and assist radiologists. However, recent reports have deemed many of the proposed methods to be impractical for use in real-life applications due to models with poor generalization capabilities, an issue closely related to the quality of current datasets in the CXR domain. Typically, deep convolutional neural network (CNN) based classification systems utilize transfer learning techniques when data is limited. We suggest first training models on publicly available large-scale and CXR specific datasets, such as CheXpert, and using these pretrained weights when initializing the final model. Compared with a CNN pretrained on the more general ImageNet dataset, pretraining on large-scale domain specific data increased the model's ability to generalize to unseen data.\",\"PeriodicalId\":268366,\"journal\":{\"name\":\"The 12th International Conference on Computational Systems-Biology and Bioinformatics\",\"volume\":\"31 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The 12th International Conference on Computational Systems-Biology and Bioinformatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3486713.3486735\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The 12th International Conference on Computational Systems-Biology and Bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3486713.3486735","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The Effect of PreTraining Thoracic Disease Detection Systems on Large-Scale Chest X-Ray Domain Datasets
The COVID-19 pandemic has impacted many countries around the world resulting in the need to develop quick and effective screening methods to ease the burden and overcome the limitations of varying healthcare capacities. Given the nature of the disease, the use of Chest X-ray (CXR) medical imaging has proven to be very useful which has prompted the exploration of computer-aided diagnosis tools to augment and assist radiologists. However, recent reports have deemed many of the proposed methods to be impractical for use in real-life applications due to models with poor generalization capabilities, an issue closely related to the quality of current datasets in the CXR domain. Typically, deep convolutional neural network (CNN) based classification systems utilize transfer learning techniques when data is limited. We suggest first training models on publicly available large-scale and CXR specific datasets, such as CheXpert, and using these pretrained weights when initializing the final model. Compared with a CNN pretrained on the more general ImageNet dataset, pretraining on large-scale domain specific data increased the model's ability to generalize to unseen data.