基于优先级的不完整数据Skyline查询处理

Proceedings of the 25th International Database Engineering & Applications Symposium Pub Date : 2021-07-14 DOI:10.1145/3472163.3472272

ChuangMing Liu, Denis Pak, Ari Ernesto Ortiz Castellanos

{"title":"基于优先级的不完整数据Skyline查询处理","authors":"ChuangMing Liu, Denis Pak, Ari Ernesto Ortiz Castellanos","doi":"10.1145/3472163.3472272","DOIUrl":null,"url":null,"abstract":"Over the years, several skyline query techniques have been introduced to handle incompleteness of data, the most recent of which has proposed to sort the points of a dataset into several distinct lists based on each dimension. The points would be accessed based on these lists in round robin fashion, and the points that haven’t been dominated by the end would compose the final skyline. The work is based on the assumption that relatively dominant points, if sorted, would be processed first, and even if the point wouldn’t be a skyline point, it would prune huge amount of data. However, that approach doesn’t take into consideration that the dominance of a point depends not only on the highest value of a given dimension, but also on the number of complete dimensions a point has. Hence, we propose a Priority-First Sort-Based Incomplete Data Skyline (PFSIDS) that utilizes a different indexing technique that allows optimization of access based on both number of complete dimensions a point has as well as sorting of the data.","PeriodicalId":242683,"journal":{"name":"Proceedings of the 25th International Database Engineering & Applications Symposium","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Priority-Based Skyline Query Processing for Incomplete Data\",\"authors\":\"ChuangMing Liu, Denis Pak, Ari Ernesto Ortiz Castellanos\",\"doi\":\"10.1145/3472163.3472272\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Over the years, several skyline query techniques have been introduced to handle incompleteness of data, the most recent of which has proposed to sort the points of a dataset into several distinct lists based on each dimension. The points would be accessed based on these lists in round robin fashion, and the points that haven’t been dominated by the end would compose the final skyline. The work is based on the assumption that relatively dominant points, if sorted, would be processed first, and even if the point wouldn’t be a skyline point, it would prune huge amount of data. However, that approach doesn’t take into consideration that the dominance of a point depends not only on the highest value of a given dimension, but also on the number of complete dimensions a point has. Hence, we propose a Priority-First Sort-Based Incomplete Data Skyline (PFSIDS) that utilizes a different indexing technique that allows optimization of access based on both number of complete dimensions a point has as well as sorting of the data.\",\"PeriodicalId\":242683,\"journal\":{\"name\":\"Proceedings of the 25th International Database Engineering & Applications Symposium\",\"volume\":\"20 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-07-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 25th International Database Engineering & Applications Symposium\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3472163.3472272\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 25th International Database Engineering & Applications Symposium","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3472163.3472272","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

多年来，已经引入了几种天际线查询技术来处理数据的不完整性，最近的一种技术提出将数据集的点根据每个维度排序为几个不同的列表。这些点将以循环方式基于这些列表进行访问，而未被最终控制的点将组成最终的天际线。这项工作是基于这样的假设:相对重要的点，如果排序，将首先被处理，即使这个点不是天际线点，它也会减少大量的数据。然而，这种方法没有考虑到一个点的主导地位不仅取决于给定维度的最大值，还取决于一个点所拥有的完整维度的数量。因此，我们提出了一种基于优先排序的不完整数据天际线(PFSIDS)，它利用一种不同的索引技术，允许基于一个点的完整维度数和数据排序来优化访问。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Priority-Based Skyline Query Processing for Incomplete Data

Over the years, several skyline query techniques have been introduced to handle incompleteness of data, the most recent of which has proposed to sort the points of a dataset into several distinct lists based on each dimension. The points would be accessed based on these lists in round robin fashion, and the points that haven’t been dominated by the end would compose the final skyline. The work is based on the assumption that relatively dominant points, if sorted, would be processed first, and even if the point wouldn’t be a skyline point, it would prune huge amount of data. However, that approach doesn’t take into consideration that the dominance of a point depends not only on the highest value of a given dimension, but also on the number of complete dimensions a point has. Hence, we propose a Priority-First Sort-Based Incomplete Data Skyline (PFSIDS) that utilizes a different indexing technique that allows optimization of access based on both number of complete dimensions a point has as well as sorting of the data.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 25th International Database Engineering & Applications Symposium

自引率

0.00%

发文量