Predicting the Performance of Privacy-Preserving Data Analytics Using Architecture Modelling and Simulation

2018 IEEE International Conference on Software Architecture (ICSA) Pub Date : 2018-04-01 DOI:10.1109/ICSA.2018.00026

Rajitha Yasaweerasinghelage, M. Staples, I. Weber, Hye-young Paik

{"title":"Predicting the Performance of Privacy-Preserving Data Analytics Using Architecture Modelling and Simulation","authors":"Rajitha Yasaweerasinghelage, M. Staples, I. Weber, Hye-young Paik","doi":"10.1109/ICSA.2018.00026","DOIUrl":null,"url":null,"abstract":"Privacy-preserving data analytics is an emerging technology which allows multiple parties to perform joint data analytics without disclosing source data to each other or a trusted third-party. A variety of platforms and protocols have been proposed in this domain. However, these systems are not yet widely used, and little is known about them from a software architecture and performance perspective. Here we investigate the feasibility of using architectural performance modelling and simulation tools for predicting the performance of privacy-preserving data analytics systems. We report on a lab-based experimental study of a privacy-preserving credit scoring application that uses an implementation of a partial homomorphic encryption scheme. The main experiments are on the impact of analytic problem size (number of data items and number of features), and cryptographic key length for the overall system performance. Our modelling approach performed with a relative error consistently under 5\\% when predicting the median learning time for the scoring application. We find that the use of this approach is feasible in this technology domain, and discuss how it can support architectural decision making on trade-offs between properties such as performance, cost, and security. We expect this to enable the evaluation and optimisation of privacy-preserving data analytics technologies.","PeriodicalId":142325,"journal":{"name":"2018 IEEE International Conference on Software Architecture (ICSA)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE International Conference on Software Architecture (ICSA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSA.2018.00026","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Privacy-preserving data analytics is an emerging technology which allows multiple parties to perform joint data analytics without disclosing source data to each other or a trusted third-party. A variety of platforms and protocols have been proposed in this domain. However, these systems are not yet widely used, and little is known about them from a software architecture and performance perspective. Here we investigate the feasibility of using architectural performance modelling and simulation tools for predicting the performance of privacy-preserving data analytics systems. We report on a lab-based experimental study of a privacy-preserving credit scoring application that uses an implementation of a partial homomorphic encryption scheme. The main experiments are on the impact of analytic problem size (number of data items and number of features), and cryptographic key length for the overall system performance. Our modelling approach performed with a relative error consistently under 5\% when predicting the median learning time for the scoring application. We find that the use of this approach is feasible in this technology domain, and discuss how it can support architectural decision making on trade-offs between properties such as performance, cost, and security. We expect this to enable the evaluation and optimisation of privacy-preserving data analytics technologies.

查看原文本刊更多论文

使用架构建模和仿真预测隐私保护数据分析的性能

隐私保护数据分析是一种新兴技术，它允许多方执行联合数据分析，而无需向彼此或可信的第三方泄露源数据。在这个领域已经提出了各种各样的平台和协议。然而，这些系统还没有被广泛使用，从软件架构和性能的角度来看，人们对它们知之甚少。在这里，我们研究了使用架构性能建模和仿真工具来预测隐私保护数据分析系统性能的可行性。我们报告了一个基于实验室的实验研究的隐私保护信用评分应用程序，该应用程序使用部分同态加密方案的实现。主要的实验是分析问题的大小(数据项的数量和特征的数量)和加密密钥长度对整个系统性能的影响。在预测评分应用程序的中位数学习时间时，我们的建模方法的相对误差始终低于5%。我们发现在这个技术领域中使用这种方法是可行的，并讨论了它如何支持在性能、成本和安全性等属性之间进行权衡的体系结构决策。我们希望这能够对保护隐私的数据分析技术进行评估和优化。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 IEEE International Conference on Software Architecture (ICSA)

自引率

0.00%

发文量