ChuangMing Liu, Denis Pak, Ari Ernesto Ortiz Castellanos
{"title":"基于优先级的不完整数据Skyline查询处理","authors":"ChuangMing Liu, Denis Pak, Ari Ernesto Ortiz Castellanos","doi":"10.1145/3472163.3472272","DOIUrl":null,"url":null,"abstract":"Over the years, several skyline query techniques have been introduced to handle incompleteness of data, the most recent of which has proposed to sort the points of a dataset into several distinct lists based on each dimension. The points would be accessed based on these lists in round robin fashion, and the points that haven’t been dominated by the end would compose the final skyline. The work is based on the assumption that relatively dominant points, if sorted, would be processed first, and even if the point wouldn’t be a skyline point, it would prune huge amount of data. However, that approach doesn’t take into consideration that the dominance of a point depends not only on the highest value of a given dimension, but also on the number of complete dimensions a point has. Hence, we propose a Priority-First Sort-Based Incomplete Data Skyline (PFSIDS) that utilizes a different indexing technique that allows optimization of access based on both number of complete dimensions a point has as well as sorting of the data.","PeriodicalId":242683,"journal":{"name":"Proceedings of the 25th International Database Engineering & Applications Symposium","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Priority-Based Skyline Query Processing for Incomplete Data\",\"authors\":\"ChuangMing Liu, Denis Pak, Ari Ernesto Ortiz Castellanos\",\"doi\":\"10.1145/3472163.3472272\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Over the years, several skyline query techniques have been introduced to handle incompleteness of data, the most recent of which has proposed to sort the points of a dataset into several distinct lists based on each dimension. The points would be accessed based on these lists in round robin fashion, and the points that haven’t been dominated by the end would compose the final skyline. The work is based on the assumption that relatively dominant points, if sorted, would be processed first, and even if the point wouldn’t be a skyline point, it would prune huge amount of data. However, that approach doesn’t take into consideration that the dominance of a point depends not only on the highest value of a given dimension, but also on the number of complete dimensions a point has. Hence, we propose a Priority-First Sort-Based Incomplete Data Skyline (PFSIDS) that utilizes a different indexing technique that allows optimization of access based on both number of complete dimensions a point has as well as sorting of the data.\",\"PeriodicalId\":242683,\"journal\":{\"name\":\"Proceedings of the 25th International Database Engineering & Applications Symposium\",\"volume\":\"20 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-07-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 25th International Database Engineering & Applications Symposium\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3472163.3472272\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 25th International Database Engineering & Applications Symposium","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3472163.3472272","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Priority-Based Skyline Query Processing for Incomplete Data
Over the years, several skyline query techniques have been introduced to handle incompleteness of data, the most recent of which has proposed to sort the points of a dataset into several distinct lists based on each dimension. The points would be accessed based on these lists in round robin fashion, and the points that haven’t been dominated by the end would compose the final skyline. The work is based on the assumption that relatively dominant points, if sorted, would be processed first, and even if the point wouldn’t be a skyline point, it would prune huge amount of data. However, that approach doesn’t take into consideration that the dominance of a point depends not only on the highest value of a given dimension, but also on the number of complete dimensions a point has. Hence, we propose a Priority-First Sort-Based Incomplete Data Skyline (PFSIDS) that utilizes a different indexing technique that allows optimization of access based on both number of complete dimensions a point has as well as sorting of the data.