{"title":"用于不确定边缘物联网数据 K 主导天际线分析的分布式索引方案","authors":"Chuan-Chi Lai;Hsuan-Yu Lin;Chuan-Ming Liu","doi":"10.1109/TETC.2023.3326295","DOIUrl":null,"url":null,"abstract":"Skyline queries typically search a Pareto-optimal set from a given data set to solve the corresponding multiobjective optimization problem. As the number of criteria increases, the skyline presumes excessive data items, which yield a meaningless result. To address this curse of dimensionality, we proposed a \n<inline-formula><tex-math>$k$</tex-math></inline-formula>\n-dominant skyline in which the number of skyline members was reduced by relaxing the restriction on the number of dimensions, considering the uncertainty of data. Specifically, each data item was associated with a probability of appearance, which represented the probability of becoming a member of the \n<inline-formula><tex-math>$k$</tex-math></inline-formula>\n-dominant skyline. As data items appear continuously in data streams, the corresponding \n<inline-formula><tex-math>$k$</tex-math></inline-formula>\n-dominant skyline may vary with time. Therefore, an effective and rapid mechanism of updating the \n<inline-formula><tex-math>$k$</tex-math></inline-formula>\n-dominant skyline becomes crucial. Herein, we proposed two time-efficient schemes, Middle Indexing (MI) and All Indexing (AI), for \n<inline-formula><tex-math>$k$</tex-math></inline-formula>\n-dominant skyline in distributed edge-computing environments, where irrelevant data items can be effectively excluded from the compute to reduce the processing duration. Furthermore, the proposed schemes were validated with extensive experimental simulations. The experimental results demonstrated that the proposed MI and AI schemes reduced the computation time by approximately 13% and 56%, respectively, compared with the existing method.","PeriodicalId":13156,"journal":{"name":"IEEE Transactions on Emerging Topics in Computing","volume":"12 3","pages":"878-890"},"PeriodicalIF":5.1000,"publicationDate":"2023-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Distributed Indexing Schemes for K-Dominant Skyline Analytics on Uncertain Edge-IoT Data\",\"authors\":\"Chuan-Chi Lai;Hsuan-Yu Lin;Chuan-Ming Liu\",\"doi\":\"10.1109/TETC.2023.3326295\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Skyline queries typically search a Pareto-optimal set from a given data set to solve the corresponding multiobjective optimization problem. As the number of criteria increases, the skyline presumes excessive data items, which yield a meaningless result. To address this curse of dimensionality, we proposed a \\n<inline-formula><tex-math>$k$</tex-math></inline-formula>\\n-dominant skyline in which the number of skyline members was reduced by relaxing the restriction on the number of dimensions, considering the uncertainty of data. Specifically, each data item was associated with a probability of appearance, which represented the probability of becoming a member of the \\n<inline-formula><tex-math>$k$</tex-math></inline-formula>\\n-dominant skyline. As data items appear continuously in data streams, the corresponding \\n<inline-formula><tex-math>$k$</tex-math></inline-formula>\\n-dominant skyline may vary with time. Therefore, an effective and rapid mechanism of updating the \\n<inline-formula><tex-math>$k$</tex-math></inline-formula>\\n-dominant skyline becomes crucial. Herein, we proposed two time-efficient schemes, Middle Indexing (MI) and All Indexing (AI), for \\n<inline-formula><tex-math>$k$</tex-math></inline-formula>\\n-dominant skyline in distributed edge-computing environments, where irrelevant data items can be effectively excluded from the compute to reduce the processing duration. Furthermore, the proposed schemes were validated with extensive experimental simulations. The experimental results demonstrated that the proposed MI and AI schemes reduced the computation time by approximately 13% and 56%, respectively, compared with the existing method.\",\"PeriodicalId\":13156,\"journal\":{\"name\":\"IEEE Transactions on Emerging Topics in Computing\",\"volume\":\"12 3\",\"pages\":\"878-890\"},\"PeriodicalIF\":5.1000,\"publicationDate\":\"2023-10-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Emerging Topics in Computing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10298037/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Emerging Topics in Computing","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10298037/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
摘要
天际线查询通常是从给定数据集中搜索帕累托最优集,以解决相应的多目标优化问题。随着标准数量的增加,天际线会假定过多的数据项,从而产生毫无意义的结果。考虑到数据的不确定性,我们提出了一种 "k$主导天际线",通过放宽维数限制来减少天际线成员的数量。具体来说,每个数据项都与出现概率相关联,而出现概率代表了成为 $k$ 主导天际线成员的概率。由于数据项在数据流中不断出现,相应的 $k$ 主导天际线可能会随时间而变化。因此,一种有效而快速的 $k$ 主导天际线更新机制变得至关重要。在此,我们针对分布式边缘计算环境中的 $k$ 主导天际线提出了两种省时方案:中间索引(MI)和全部索引(AI),其中不相关的数据项可以有效地排除在计算之外,从而缩短处理时间。此外,还通过大量的实验模拟验证了所提出的方案。实验结果表明,与现有方法相比,所提出的 MI 和 AI 方案分别减少了约 13% 和 56% 的计算时间。
Distributed Indexing Schemes for K-Dominant Skyline Analytics on Uncertain Edge-IoT Data
Skyline queries typically search a Pareto-optimal set from a given data set to solve the corresponding multiobjective optimization problem. As the number of criteria increases, the skyline presumes excessive data items, which yield a meaningless result. To address this curse of dimensionality, we proposed a
$k$
-dominant skyline in which the number of skyline members was reduced by relaxing the restriction on the number of dimensions, considering the uncertainty of data. Specifically, each data item was associated with a probability of appearance, which represented the probability of becoming a member of the
$k$
-dominant skyline. As data items appear continuously in data streams, the corresponding
$k$
-dominant skyline may vary with time. Therefore, an effective and rapid mechanism of updating the
$k$
-dominant skyline becomes crucial. Herein, we proposed two time-efficient schemes, Middle Indexing (MI) and All Indexing (AI), for
$k$
-dominant skyline in distributed edge-computing environments, where irrelevant data items can be effectively excluded from the compute to reduce the processing duration. Furthermore, the proposed schemes were validated with extensive experimental simulations. The experimental results demonstrated that the proposed MI and AI schemes reduced the computation time by approximately 13% and 56%, respectively, compared with the existing method.
期刊介绍:
IEEE Transactions on Emerging Topics in Computing publishes papers on emerging aspects of computer science, computing technology, and computing applications not currently covered by other IEEE Computer Society Transactions. Some examples of emerging topics in computing include: IT for Green, Synthetic and organic computing structures and systems, Advanced analytics, Social/occupational computing, Location-based/client computer systems, Morphic computer design, Electronic game systems, & Health-care IT.