{"title":"Dynamic top-k range reporting in external memory","authors":"Cheng Sheng, Yufei Tao","doi":"10.1145/2213556.2213576","DOIUrl":null,"url":null,"abstract":"In the <i>top-K range reporting</i> problem, the dataset contains <i>N</i> points in the real domain ℜ, each of which is associated with a real-valued <i>score</i>. Given an interval <i>x</i><sub>1</sub>,<i>x</i><sub>2</sub> in ℜ and an integer <i>K</i>≤ <i>N</i>, a query returns the <i>K</i> points in <i>x</i><sub>1</sub>,<i>x</i><sub>2</sub> having the smallest scores. We want to store the dataset in a structure so that queries can be answered efficiently. In the external memory model, the state of the art is a static structure that consumes <i>O</i>(<i>N/B</i>) space, answers a query in <i>O</i>(log<i><sub>B</sub> N</i> + <i>K/B</i>) time, and can be constructed in <i>O</i>(<i>N</i> + (<i>N</i> log <i>N / B</i>) log <i><sub>M/B</sub></i> (<i>N/B</i>)) time, where <i>B</i> is the size of a disk block, and <i>M</i> the size of memory. We present a fully-dynamic structure that retains the same space and query bounds, and can be updated in <i>O</i>(log<i><sup>2</sup><sub>B</sub> N</i>) amortized time per insertion and deletion. Our structure can be constructed in <i>O</i>((<i>N/B</i>) log <i><sub>M/B</sub></i> (N/B)) time.","PeriodicalId":92118,"journal":{"name":"Proceedings of the ... ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems. ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems","volume":"20 1","pages":"121-130"},"PeriodicalIF":0.0000,"publicationDate":"2012-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"17","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ... ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems. ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2213556.2213576","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 17
Abstract
In the top-K range reporting problem, the dataset contains N points in the real domain ℜ, each of which is associated with a real-valued score. Given an interval x1,x2 in ℜ and an integer K≤ N, a query returns the K points in x1,x2 having the smallest scores. We want to store the dataset in a structure so that queries can be answered efficiently. In the external memory model, the state of the art is a static structure that consumes O(N/B) space, answers a query in O(logB N + K/B) time, and can be constructed in O(N + (N log N / B) log M/B (N/B)) time, where B is the size of a disk block, and M the size of memory. We present a fully-dynamic structure that retains the same space and query bounds, and can be updated in O(log2B N) amortized time per insertion and deletion. Our structure can be constructed in O((N/B) log M/B (N/B)) time.