{"title":"外部内存中的动态top-k范围报告","authors":"Cheng Sheng, Yufei Tao","doi":"10.1145/2213556.2213576","DOIUrl":null,"url":null,"abstract":"In the <i>top-K range reporting</i> problem, the dataset contains <i>N</i> points in the real domain ℜ, each of which is associated with a real-valued <i>score</i>. Given an interval <i>x</i><sub>1</sub>,<i>x</i><sub>2</sub> in ℜ and an integer <i>K</i>≤ <i>N</i>, a query returns the <i>K</i> points in <i>x</i><sub>1</sub>,<i>x</i><sub>2</sub> having the smallest scores. We want to store the dataset in a structure so that queries can be answered efficiently. In the external memory model, the state of the art is a static structure that consumes <i>O</i>(<i>N/B</i>) space, answers a query in <i>O</i>(log<i><sub>B</sub> N</i> + <i>K/B</i>) time, and can be constructed in <i>O</i>(<i>N</i> + (<i>N</i> log <i>N / B</i>) log <i><sub>M/B</sub></i> (<i>N/B</i>)) time, where <i>B</i> is the size of a disk block, and <i>M</i> the size of memory. We present a fully-dynamic structure that retains the same space and query bounds, and can be updated in <i>O</i>(log<i><sup>2</sup><sub>B</sub> N</i>) amortized time per insertion and deletion. Our structure can be constructed in <i>O</i>((<i>N/B</i>) log <i><sub>M/B</sub></i> (N/B)) time.","PeriodicalId":92118,"journal":{"name":"Proceedings of the ... ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems. ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems","volume":"20 1","pages":"121-130"},"PeriodicalIF":0.0000,"publicationDate":"2012-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"17","resultStr":"{\"title\":\"Dynamic top-k range reporting in external memory\",\"authors\":\"Cheng Sheng, Yufei Tao\",\"doi\":\"10.1145/2213556.2213576\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the <i>top-K range reporting</i> problem, the dataset contains <i>N</i> points in the real domain ℜ, each of which is associated with a real-valued <i>score</i>. Given an interval <i>x</i><sub>1</sub>,<i>x</i><sub>2</sub> in ℜ and an integer <i>K</i>≤ <i>N</i>, a query returns the <i>K</i> points in <i>x</i><sub>1</sub>,<i>x</i><sub>2</sub> having the smallest scores. We want to store the dataset in a structure so that queries can be answered efficiently. In the external memory model, the state of the art is a static structure that consumes <i>O</i>(<i>N/B</i>) space, answers a query in <i>O</i>(log<i><sub>B</sub> N</i> + <i>K/B</i>) time, and can be constructed in <i>O</i>(<i>N</i> + (<i>N</i> log <i>N / B</i>) log <i><sub>M/B</sub></i> (<i>N/B</i>)) time, where <i>B</i> is the size of a disk block, and <i>M</i> the size of memory. We present a fully-dynamic structure that retains the same space and query bounds, and can be updated in <i>O</i>(log<i><sup>2</sup><sub>B</sub> N</i>) amortized time per insertion and deletion. Our structure can be constructed in <i>O</i>((<i>N/B</i>) log <i><sub>M/B</sub></i> (N/B)) time.\",\"PeriodicalId\":92118,\"journal\":{\"name\":\"Proceedings of the ... ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems. ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems\",\"volume\":\"20 1\",\"pages\":\"121-130\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-05-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"17\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the ... ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems. ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2213556.2213576\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ... ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems. ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2213556.2213576","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
In the top-K range reporting problem, the dataset contains N points in the real domain ℜ, each of which is associated with a real-valued score. Given an interval x1,x2 in ℜ and an integer K≤ N, a query returns the K points in x1,x2 having the smallest scores. We want to store the dataset in a structure so that queries can be answered efficiently. In the external memory model, the state of the art is a static structure that consumes O(N/B) space, answers a query in O(logB N + K/B) time, and can be constructed in O(N + (N log N / B) log M/B (N/B)) time, where B is the size of a disk block, and M the size of memory. We present a fully-dynamic structure that retains the same space and query bounds, and can be updated in O(log2B N) amortized time per insertion and deletion. Our structure can be constructed in O((N/B) log M/B (N/B)) time.