{"title":"A Privacy Preserving Scheme with Dimensionality Reduction for Distributed Machine Learning","authors":"Zhao Chen, Kazumasa Omote","doi":"10.1109/AsiaJCIS53848.2021.00017","DOIUrl":null,"url":null,"abstract":"To obtain useful results in machine learning, it is required to collect data from multiple institutions and learn with larger-scale data. However, data collected from multiple institutions may contain a lot of personal information and should not be explicitly shared. The existing research has proposed various methods to protect privacy by using encryption or anonymization, but encryption causes large computational costs, and anonymization may greatly reduce the usefulness of data. In this research, we propose a privacy protection method using dimensionality reduction that is difficult to reverse while maintaining the high usefulness of data. The main idea of our method is that combining dimensionality reduction algorithms with noise addition is useful for privacy-preserving data analysis with high accuracy. Furthermore, we evaluate the effectiveness and security of this method and show the utility of the proposed method.","PeriodicalId":134911,"journal":{"name":"2021 16th Asia Joint Conference on Information Security (AsiaJCIS)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 16th Asia Joint Conference on Information Security (AsiaJCIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AsiaJCIS53848.2021.00017","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
To obtain useful results in machine learning, it is required to collect data from multiple institutions and learn with larger-scale data. However, data collected from multiple institutions may contain a lot of personal information and should not be explicitly shared. The existing research has proposed various methods to protect privacy by using encryption or anonymization, but encryption causes large computational costs, and anonymization may greatly reduce the usefulness of data. In this research, we propose a privacy protection method using dimensionality reduction that is difficult to reverse while maintaining the high usefulness of data. The main idea of our method is that combining dimensionality reduction algorithms with noise addition is useful for privacy-preserving data analysis with high accuracy. Furthermore, we evaluate the effectiveness and security of this method and show the utility of the proposed method.