{"title":"构建Top-k结构的通用技术","authors":"Saladi Rahul, Yufei Tao","doi":"https://dl.acm.org/doi/10.1145/3546074","DOIUrl":null,"url":null,"abstract":"<p>A <i>reporting query</i> returns the objects satisfying a predicate <i>q</i> from an input set. In <i>prioritized reporting</i>, each object carries a real-valued weight (which can be query dependent), and a query returns the objects that satisfy <i>q</i> and have weights at least a threshold τ. A <i>top-<i>k</i> query</i> finds, among all the objects satisfying <i>q</i>, the <i>k</i> ones of the largest weights; a <i>max query</i> is a special instance with <i>k</i> = 1. We want to design data structures of small space to support queries (and possibly updates) efficiently.</p><p>Previous work has shown that a top-<i>k</i> structure can also support max and prioritized queries with no performance deterioration. This article explores the opposite direction: do prioritized queries, possibly combined with max queries, imply top-<i>k</i> search? Subject to mild conditions, we provide affirmative answers with two reduction techniques. The first converts a prioritized structure into a static top-<i>k</i> structure with the same space complexity and only a logarithmic blowup in query time. If a max structure is available in addition, our second reduction yields a top-<i>k</i> structure with no degradation in expected performance (this holds for the space, query, and update complexities). Our techniques significantly simplify the design of top-<i>k</i> structures because structures for max and prioritized queries are often easier to obtain. We demonstrate this by developing top-<i>k</i> structures for interval stabbing, 3D dominance, halfspace reporting, linear ranking, and <i>L</i><sub>∞</sub> nearest neighbor search in the RAM and the external memory computation models.</p>","PeriodicalId":50922,"journal":{"name":"ACM Transactions on Algorithms","volume":"27 1","pages":""},"PeriodicalIF":0.9000,"publicationDate":"2022-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Generic Techniques for Building Top-k Structures\",\"authors\":\"Saladi Rahul, Yufei Tao\",\"doi\":\"https://dl.acm.org/doi/10.1145/3546074\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>A <i>reporting query</i> returns the objects satisfying a predicate <i>q</i> from an input set. In <i>prioritized reporting</i>, each object carries a real-valued weight (which can be query dependent), and a query returns the objects that satisfy <i>q</i> and have weights at least a threshold τ. A <i>top-<i>k</i> query</i> finds, among all the objects satisfying <i>q</i>, the <i>k</i> ones of the largest weights; a <i>max query</i> is a special instance with <i>k</i> = 1. We want to design data structures of small space to support queries (and possibly updates) efficiently.</p><p>Previous work has shown that a top-<i>k</i> structure can also support max and prioritized queries with no performance deterioration. This article explores the opposite direction: do prioritized queries, possibly combined with max queries, imply top-<i>k</i> search? Subject to mild conditions, we provide affirmative answers with two reduction techniques. The first converts a prioritized structure into a static top-<i>k</i> structure with the same space complexity and only a logarithmic blowup in query time. If a max structure is available in addition, our second reduction yields a top-<i>k</i> structure with no degradation in expected performance (this holds for the space, query, and update complexities). Our techniques significantly simplify the design of top-<i>k</i> structures because structures for max and prioritized queries are often easier to obtain. We demonstrate this by developing top-<i>k</i> structures for interval stabbing, 3D dominance, halfspace reporting, linear ranking, and <i>L</i><sub>∞</sub> nearest neighbor search in the RAM and the external memory computation models.</p>\",\"PeriodicalId\":50922,\"journal\":{\"name\":\"ACM Transactions on Algorithms\",\"volume\":\"27 1\",\"pages\":\"\"},\"PeriodicalIF\":0.9000,\"publicationDate\":\"2022-10-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACM Transactions on Algorithms\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/https://dl.acm.org/doi/10.1145/3546074\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, THEORY & METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Algorithms","FirstCategoryId":"94","ListUrlMain":"https://doi.org/https://dl.acm.org/doi/10.1145/3546074","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
A reporting query returns the objects satisfying a predicate q from an input set. In prioritized reporting, each object carries a real-valued weight (which can be query dependent), and a query returns the objects that satisfy q and have weights at least a threshold τ. A top-k query finds, among all the objects satisfying q, the k ones of the largest weights; a max query is a special instance with k = 1. We want to design data structures of small space to support queries (and possibly updates) efficiently.
Previous work has shown that a top-k structure can also support max and prioritized queries with no performance deterioration. This article explores the opposite direction: do prioritized queries, possibly combined with max queries, imply top-k search? Subject to mild conditions, we provide affirmative answers with two reduction techniques. The first converts a prioritized structure into a static top-k structure with the same space complexity and only a logarithmic blowup in query time. If a max structure is available in addition, our second reduction yields a top-k structure with no degradation in expected performance (this holds for the space, query, and update complexities). Our techniques significantly simplify the design of top-k structures because structures for max and prioritized queries are often easier to obtain. We demonstrate this by developing top-k structures for interval stabbing, 3D dominance, halfspace reporting, linear ranking, and L∞ nearest neighbor search in the RAM and the external memory computation models.
期刊介绍:
ACM Transactions on Algorithms welcomes submissions of original research of the highest quality dealing with algorithms that are inherently discrete and finite, and having mathematical content in a natural way, either in the objective or in the analysis. Most welcome are new algorithms and data structures, new and improved analyses, and complexity results. Specific areas of computation covered by the journal include
combinatorial searches and objects;
counting;
discrete optimization and approximation;
randomization and quantum computation;
parallel and distributed computation;
algorithms for
graphs,
geometry,
arithmetic,
number theory,
strings;
on-line analysis;
cryptography;
coding;
data compression;
learning algorithms;
methods of algorithmic analysis;
discrete algorithms for application areas such as
biology,
economics,
game theory,
communication,
computer systems and architecture,
hardware design,
scientific computing