An Efficient Algorithm for top-k Queries on Uncertain Data Streams

2012 11th International Conference on Machine Learning and Applications Pub Date : 2012-12-12 DOI:10.1109/ICMLA.2012.57

Caiyan Dai, Ling Chen, Yixin Chen, Keming Tang

引用次数: 1

Abstract

We tackle the problem of answering maximum probabilistic top-k tuple set queries. We use a sliding-window model on uncertain data streams and present an efficient algorithm for processing sliding-window queries on uncertain streams. In each sliding window, the algorithm selects the k tuples with the highest probabilities from sets of different numbers of the tuples with the highest scores. Then, the algorithm computes existential probability of the top-k tuples, and chooses the set with the highest probability as the top-k query result. We theoretically prove the correctness of the algorithm. Our experimental results show that our algorithm requires lower time and space complexity than other existing algorithms.

查看原文本刊更多论文

不确定数据流上top-k查询的一种高效算法

我们解决了回答最大概率top-k元组集查询的问题。在不确定数据流上使用滑动窗口模型，提出了一种处理不确定数据流上滑动窗口查询的有效算法。在每个滑动窗口中，算法从得分最高的不同数量的元组中选择概率最高的k个元组。然后，算法计算top-k元组的存在概率，选择概率最高的集合作为top-k查询结果。从理论上证明了算法的正确性。实验结果表明，该算法所需的时间和空间复杂度较现有算法低。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2012 11th International Conference on Machine Learning and Applications

自引率

0.00%

发文量