{"title":"Performance Measurement of Federated Learning on Imbalanced Data","authors":"Pramote Sittijuk, Kriengsuk Tamee","doi":"10.1109/JCSSE53117.2021.9493819","DOIUrl":null,"url":null,"abstract":"AI often suffers from getting imbalanced data distribution as unequal samples in classes which will increase the bias of machine learning algorithms. This research aimed to study effects of skew data distribution towards development of data rebalancing on Federated learning (FL) in the future. This research sets left skewed distribution, right skewed distribution and symmetric distribution on Modified National Institute of Standards and Technology database (MNIST) to operate on Convolutional neural network (CNN) in FL mechanism. Then, FL’s performance for working on these imbalanced distributions was tested. Results showed that in overview, the symmetric, left skewed, and right skewed distribution were not different in accuracy but theses imbalanced distributions were different in accuracy from the balanced distribution which has equal samples in all classes at significant level of.05. Standard deviation (SD) of data distribution was directly correlated with FL’s accuracy in high level.","PeriodicalId":437534,"journal":{"name":"2021 18th International Joint Conference on Computer Science and Software Engineering (JCSSE)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 18th International Joint Conference on Computer Science and Software Engineering (JCSSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/JCSSE53117.2021.9493819","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
AI often suffers from getting imbalanced data distribution as unequal samples in classes which will increase the bias of machine learning algorithms. This research aimed to study effects of skew data distribution towards development of data rebalancing on Federated learning (FL) in the future. This research sets left skewed distribution, right skewed distribution and symmetric distribution on Modified National Institute of Standards and Technology database (MNIST) to operate on Convolutional neural network (CNN) in FL mechanism. Then, FL’s performance for working on these imbalanced distributions was tested. Results showed that in overview, the symmetric, left skewed, and right skewed distribution were not different in accuracy but theses imbalanced distributions were different in accuracy from the balanced distribution which has equal samples in all classes at significant level of.05. Standard deviation (SD) of data distribution was directly correlated with FL’s accuracy in high level.