HPC - privacy model for collaborative skyline processing

B. Y. L. Chan, V. Ng
{"title":"HPC - privacy model for collaborative skyline processing","authors":"B. Y. L. Chan, V. Ng","doi":"10.1109/ISI.2010.5484743","DOIUrl":null,"url":null,"abstract":"In general, skyline query is defined as finding a set of interesting database objects, which are not dominated to one another objects. A typical example is to find the hotel that is cheap and close to the beach. Since the introduction of skyline operator by Borzsonyi et al into database community, there has been a number of research works evolving and related publications related in last decade. However, there is only a few of them working on distributed skyline processing in collaborative computing environments. None of them considered the issue of privacy enforcement. The problem is that server has to disclose the sub-skylines (the actual skyline points) without privacy protection. In this paper, we propose the Hierarchical Piecewise Curve (HPC) model to enforce privacy during collaborative skyline processing and the private information can be released in a hierarchically controllable manner. Firstly we develop the polynomial expressions of Piecewise Curve (PC) by Spline interpolation to approximate the actual sub-skyline points. Figure 1 graphically showed the approximation. With Spline function, PC in R knocks are defined as: equations where there is no intersection among all knocks and the corresponding Mean Square Error (MSE) is defined as: equations. Secondly, we define the operators for the PC. If we have two servers working on the skyline query, we may have two Curve, c1 and c2 with respective intervals as a ≤ x ≤ b and n ≤ x ≤ m. We observed that there are 3 categories of relationships. First, c1 totally dominates c2; Second, c1 and c2 are totally independent; Third, c1 partially dominates c2. In the experiments, we observe that increasing the order of the polynomial and/or the number of PC resulted in reduction of MSE. Moreover, we observed the performance dropped when number of object in database increased. Meanwhile, the performance of skyline processing by the HPC model with 10 servers and 20 servers were relatively static when the database size increased. The poor performance of traditional approach was bottlenecked at constructing the global database for computing the global skyline. In the contrary, HPC model enabled distributed sub-skyline processing. Although there was computation overhead for merging curves (by equation 13), it could take advantage of distributing skyline computation among servers. Technically, we demonstrated Piecewise Curves (PC) as an answer approximation to response to the skyline query instead of actual skyline points. From the preliminary experimental results, we observed that the performance of HPC model for skyline processing out performance the traditional approach in distributed and cooperative computing environments.","PeriodicalId":434501,"journal":{"name":"2010 IEEE International Conference on Intelligence and Security Informatics","volume":"40 9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE International Conference on Intelligence and Security Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISI.2010.5484743","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In general, skyline query is defined as finding a set of interesting database objects, which are not dominated to one another objects. A typical example is to find the hotel that is cheap and close to the beach. Since the introduction of skyline operator by Borzsonyi et al into database community, there has been a number of research works evolving and related publications related in last decade. However, there is only a few of them working on distributed skyline processing in collaborative computing environments. None of them considered the issue of privacy enforcement. The problem is that server has to disclose the sub-skylines (the actual skyline points) without privacy protection. In this paper, we propose the Hierarchical Piecewise Curve (HPC) model to enforce privacy during collaborative skyline processing and the private information can be released in a hierarchically controllable manner. Firstly we develop the polynomial expressions of Piecewise Curve (PC) by Spline interpolation to approximate the actual sub-skyline points. Figure 1 graphically showed the approximation. With Spline function, PC in R knocks are defined as: equations where there is no intersection among all knocks and the corresponding Mean Square Error (MSE) is defined as: equations. Secondly, we define the operators for the PC. If we have two servers working on the skyline query, we may have two Curve, c1 and c2 with respective intervals as a ≤ x ≤ b and n ≤ x ≤ m. We observed that there are 3 categories of relationships. First, c1 totally dominates c2; Second, c1 and c2 are totally independent; Third, c1 partially dominates c2. In the experiments, we observe that increasing the order of the polynomial and/or the number of PC resulted in reduction of MSE. Moreover, we observed the performance dropped when number of object in database increased. Meanwhile, the performance of skyline processing by the HPC model with 10 servers and 20 servers were relatively static when the database size increased. The poor performance of traditional approach was bottlenecked at constructing the global database for computing the global skyline. In the contrary, HPC model enabled distributed sub-skyline processing. Although there was computation overhead for merging curves (by equation 13), it could take advantage of distributing skyline computation among servers. Technically, we demonstrated Piecewise Curves (PC) as an answer approximation to response to the skyline query instead of actual skyline points. From the preliminary experimental results, we observed that the performance of HPC model for skyline processing out performance the traditional approach in distributed and cooperative computing environments.
协同天际线处理的HPC -隐私模型
一般来说,skyline查询被定义为查找一组感兴趣的数据库对象,这些对象之间不受其他对象的支配。一个典型的例子是找一家便宜又靠近海滩的酒店。自从Borzsonyi等人将skyline算子引入数据库界以来,近十年来出现了大量的研究工作和相关出版物。然而,在协作计算环境下的分布式天际线处理方面的研究还很少。他们都没有考虑到隐私执法的问题。问题是服务器必须在没有隐私保护的情况下公开子天际线(实际的天际线点)。本文提出了层次分段曲线(HPC)模型来增强协同天际线处理过程中的隐私性,使隐私信息以层次可控的方式发布。首先,我们利用样条插值建立分段曲线(PC)的多项式表达式来逼近实际的次天际线点。图1以图形方式显示了近似结果。用样条函数定义R次敲打中的PC为:,所有敲打之间不相交的方程,相应的均方误差(MSE)定义为:方程。其次,我们定义了PC机的操作符。如果我们有两个服务器处理skyline查询,我们可能有两个曲线c1和c2,它们各自的间隔为a≤x≤b和n≤x≤m。我们观察到有3类关系。首先,c1完全优于c2;第二,c1和c2是完全独立的;第三,c1部分优于c2。在实验中,我们观察到增加多项式的阶数和/或PC的数量会导致MSE的降低。此外,我们观察到,随着数据库中对象数量的增加,性能下降。同时,当数据库规模增加时,10台服务器和20台服务器的HPC模型的skyline处理性能相对静态。传统方法在构建用于计算全球天际线的全球数据库方面存在性能不佳的瓶颈。相反,HPC模型使分布式子天际线处理成为可能。虽然合并曲线有计算开销(通过公式13),但它可以利用在服务器之间分配天际线计算的优势。从技术上讲,我们演示了分段曲线(PC)作为响应天际线查询的近似答案,而不是实际的天际线点。从初步的实验结果来看,HPC模型在分布式和协同计算环境下处理天际线的性能优于传统方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信