Khaled M. Elshabrawy, Mayar M. Alfares, Mohammed Abdel-Megeed Salem
{"title":"基于集成联邦学习的非ii型COVID-19检测","authors":"Khaled M. Elshabrawy, Mayar M. Alfares, Mohammed Abdel-Megeed Salem","doi":"10.1109/icci54321.2022.9756090","DOIUrl":null,"url":null,"abstract":"In light of the COVID-19 pandemic, the need for a chest X-ray scans classifier is crucial in order to diagnose patients and classify scans into normal, COVID-infected, and pneumonia. Federated learning was chosen for the classification as it uses a decentralized approach to train the model at the local servers belonging to each entity in various geographic locations. Therefore, information leakage that could happen from the traditional centralized approach of training is prevented, besides saving the huge cost of central storage. However, between the vast difference in the number of X-ray scans per data-silo (i.e. hospital), the dissimilar image-acquisition techniques, and the diverse morphological structures of the human chest, non-IID (non-Independent and Identically Distributed) skews are introduced in the data. In this paper, real-world datasets of COVID and pneumonia scans are used to satisfy all the non-IID data skews. An experiment was then conducted to test the effect of these skews using five federated learning algorithms, FedAvg, FedProx, FedNova, SCAFFOLD, and FedBN, under the same metrics. The obtained accuracy values are 79.5%, 76.92%, 5.57%, 79.18%, and 84.4%, respectively. In this paper, we present the different effects of non-IID skews on the training process and discuss the different federated learning variations to mitigate the data heterogeneity.","PeriodicalId":122550,"journal":{"name":"2022 5th International Conference on Computing and Informatics (ICCI)","volume":"116 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Ensemble Federated Learning for Non-II D COVID-19 Detection\",\"authors\":\"Khaled M. Elshabrawy, Mayar M. Alfares, Mohammed Abdel-Megeed Salem\",\"doi\":\"10.1109/icci54321.2022.9756090\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In light of the COVID-19 pandemic, the need for a chest X-ray scans classifier is crucial in order to diagnose patients and classify scans into normal, COVID-infected, and pneumonia. Federated learning was chosen for the classification as it uses a decentralized approach to train the model at the local servers belonging to each entity in various geographic locations. Therefore, information leakage that could happen from the traditional centralized approach of training is prevented, besides saving the huge cost of central storage. However, between the vast difference in the number of X-ray scans per data-silo (i.e. hospital), the dissimilar image-acquisition techniques, and the diverse morphological structures of the human chest, non-IID (non-Independent and Identically Distributed) skews are introduced in the data. In this paper, real-world datasets of COVID and pneumonia scans are used to satisfy all the non-IID data skews. An experiment was then conducted to test the effect of these skews using five federated learning algorithms, FedAvg, FedProx, FedNova, SCAFFOLD, and FedBN, under the same metrics. The obtained accuracy values are 79.5%, 76.92%, 5.57%, 79.18%, and 84.4%, respectively. In this paper, we present the different effects of non-IID skews on the training process and discuss the different federated learning variations to mitigate the data heterogeneity.\",\"PeriodicalId\":122550,\"journal\":{\"name\":\"2022 5th International Conference on Computing and Informatics (ICCI)\",\"volume\":\"116 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-03-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 5th International Conference on Computing and Informatics (ICCI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/icci54321.2022.9756090\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 5th International Conference on Computing and Informatics (ICCI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icci54321.2022.9756090","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Ensemble Federated Learning for Non-II D COVID-19 Detection
In light of the COVID-19 pandemic, the need for a chest X-ray scans classifier is crucial in order to diagnose patients and classify scans into normal, COVID-infected, and pneumonia. Federated learning was chosen for the classification as it uses a decentralized approach to train the model at the local servers belonging to each entity in various geographic locations. Therefore, information leakage that could happen from the traditional centralized approach of training is prevented, besides saving the huge cost of central storage. However, between the vast difference in the number of X-ray scans per data-silo (i.e. hospital), the dissimilar image-acquisition techniques, and the diverse morphological structures of the human chest, non-IID (non-Independent and Identically Distributed) skews are introduced in the data. In this paper, real-world datasets of COVID and pneumonia scans are used to satisfy all the non-IID data skews. An experiment was then conducted to test the effect of these skews using five federated learning algorithms, FedAvg, FedProx, FedNova, SCAFFOLD, and FedBN, under the same metrics. The obtained accuracy values are 79.5%, 76.92%, 5.57%, 79.18%, and 84.4%, respectively. In this paper, we present the different effects of non-IID skews on the training process and discuss the different federated learning variations to mitigate the data heterogeneity.