A Privacy Preserving Scheme with Dimensionality Reduction for Distributed Machine Learning

2021 16th Asia Joint Conference on Information Security (AsiaJCIS) Pub Date : 2021-08-01 DOI:10.1109/AsiaJCIS53848.2021.00017

Zhao Chen, Kazumasa Omote

引用次数: 5

Abstract

To obtain useful results in machine learning, it is required to collect data from multiple institutions and learn with larger-scale data. However, data collected from multiple institutions may contain a lot of personal information and should not be explicitly shared. The existing research has proposed various methods to protect privacy by using encryption or anonymization, but encryption causes large computational costs, and anonymization may greatly reduce the usefulness of data. In this research, we propose a privacy protection method using dimensionality reduction that is difficult to reverse while maintaining the high usefulness of data. The main idea of our method is that combining dimensionality reduction algorithms with noise addition is useful for privacy-preserving data analysis with high accuracy. Furthermore, we evaluate the effectiveness and security of this method and show the utility of the proposed method.

查看原文本刊更多论文

面向分布式机器学习的降维隐私保护方案

为了在机器学习中获得有用的结果，需要从多个机构收集数据，并使用更大规模的数据进行学习。然而，从多个机构收集的数据可能包含大量个人信息，不应明确共享。现有的研究提出了多种保护隐私的方法，包括加密或匿名化，但加密会导致大量的计算成本，而匿名化可能会大大降低数据的有用性。在本研究中，我们提出了一种在保持数据高有用性的同时难以逆转的降维隐私保护方法。该方法的主要思想是将降维算法与噪声相加相结合，有助于实现高精度的隐私保护数据分析。此外，我们评估了该方法的有效性和安全性，并展示了该方法的实用性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 16th Asia Joint Conference on Information Security (AsiaJCIS)

自引率

0.00%

发文量