{"title":"Multi-View Federated Learning with Data Collaboration","authors":"Yitao Yang, Xiucai Ye, Tetsuya Sakurai","doi":"10.1145/3529836.3529904","DOIUrl":null,"url":null,"abstract":"Under the privacy protection policy, federated learning has received more and more attention. Vertical federated learning (VFL) uses the same samples local in different parties to build prediction model. However, the same samples (overlapping samples) may be limited, while a large number of non-overlapping samples in each party are not utilized. If the non-overlapping samples can be utilized for training, it can benefit the prediction model. In this paper, we propose a novel VFL method, called Multi-View Federated Learning with Data collaboration (FedMC), to solve the problem of insufficient overlapping samples by exploiting suitable non-overlapping samples for data training. The proposed FedMC method first constructs a common feature space based on the overlapping samples, then projects the non-overlapping samples into the common feature space. We measure the similarity for each pair of the non-overlapping samples by calculating their distance in this space. When the distance is less than a threshold, we match them and add this pair to the overlapping samples. The expanded overlapping samples are finally used for training to build the prediction model. We evaluate the proposed method on real-world datasets. The experimental results show that the proposed method can improve the classification result by exploiting the non-overlapping samples for training.","PeriodicalId":285191,"journal":{"name":"2022 14th International Conference on Machine Learning and Computing (ICMLC)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 14th International Conference on Machine Learning and Computing (ICMLC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3529836.3529904","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
Under the privacy protection policy, federated learning has received more and more attention. Vertical federated learning (VFL) uses the same samples local in different parties to build prediction model. However, the same samples (overlapping samples) may be limited, while a large number of non-overlapping samples in each party are not utilized. If the non-overlapping samples can be utilized for training, it can benefit the prediction model. In this paper, we propose a novel VFL method, called Multi-View Federated Learning with Data collaboration (FedMC), to solve the problem of insufficient overlapping samples by exploiting suitable non-overlapping samples for data training. The proposed FedMC method first constructs a common feature space based on the overlapping samples, then projects the non-overlapping samples into the common feature space. We measure the similarity for each pair of the non-overlapping samples by calculating their distance in this space. When the distance is less than a threshold, we match them and add this pair to the overlapping samples. The expanded overlapping samples are finally used for training to build the prediction model. We evaluate the proposed method on real-world datasets. The experimental results show that the proposed method can improve the classification result by exploiting the non-overlapping samples for training.