ACM Transactions on Algorithms (TALG)最新文献_第9页

An Optimal Algorithm for ℓ1-Heavy Hitters in Insertion Streams and Related Problems 插入流中1-重子的最优算法及相关问题

ACM Transactions on Algorithms (TALG) Pub Date : 2018-10-22 DOI: 10.1145/3264427

Arnab Bhattacharyya, P. Dey, David P. Woodruff

{"title":"An Optimal Algorithm for ℓ1-Heavy Hitters in Insertion Streams and Related Problems","authors":"Arnab Bhattacharyya, P. Dey, David P. Woodruff","doi":"10.1145/3264427","DOIUrl":"https://doi.org/10.1145/3264427","url":null,"abstract":"We give the first optimal bounds for returning the ℓ1-heavy hitters in a data stream of insertions, together with their approximate frequencies, closing a long line of work on this problem. For a stream of m items in { 1, 2, … , n} and parameters 0 < ε < φ ⩽ 1, let fi denote the frequency of item i, i.e., the number of times item i occurs in the stream. With arbitrarily large constant probability, our algorithm returns all items i for which fi ⩾ φ m, returns no items j for which fj ⩽ (φ −ε)m, and returns approximations f˜i with |f˜i − fi| ⩽ ε m for each item i that it returns. Our algorithm uses O(ε−1 log φ −1 + φ −1 log n + log log m) bits of space, processes each stream update in O(1) worst-case time, and can report its output in time linear in the output size. We also prove a lower bound, which implies that our algorithm is optimal up to a constant factor in its space complexity. A modification of our algorithm can be used to estimate the maximum frequency up to an additive ε m error in the above amount of space, resolving Question 3 in the IITK 2006 Workshop on Algorithms for Data Streams for the case of ℓ1-heavy hitters. We also introduce several variants of the heavy hitters and maximum frequency problems, inspired by rank aggregation and voting schemes, and show how our techniques can be applied in such settings. Unlike the traditional heavy hitters problem, some of these variants look at comparisons between items rather than numerical values to determine the frequency of an item.","PeriodicalId":154047,"journal":{"name":"ACM Transactions on Algorithms (TALG)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115294819","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 12

Randomized Contractions Meet Lean Decompositions 随机收缩与精益分解

ACM Transactions on Algorithms (TALG) Pub Date : 2018-10-16 DOI: 10.1145/3426738

Marek Cygan, Pawel Komosa, D. Lokshtanov, Michal Pilipczuk, Marcin Pilipczuk, Saket Saurabh

引用次数: 27

Entropy and Optimal Compression of Some General Plane Trees 一些通用平面树的熵和最优压缩

ACM Transactions on Algorithms (TALG) Pub Date : 2018-10-01 DOI: 10.1145/3275444

Z. Golebiewski, A. Magner, W. Szpankowski

引用次数: 1

Enumerating Minimal Dominating Sets in Kt-free Graphs and Variants 无kt图及其变体中的极小支配集枚举

ACM Transactions on Algorithms (TALG) Pub Date : 2018-10-01 DOI: 10.1145/3386686

Marthe Bonamy, Oscar Defrain, Marc Heinrich, Michal Pilipczuk, Jean-Florent Raymond

引用次数: 10

Stream Sampling Framework and Application for Frequency Cap Statistics 频率帽统计的流采样框架及应用

ACM Transactions on Algorithms (TALG) Pub Date : 2018-09-24 DOI: 10.1145/3234338

E. Cohen

{"title":"Stream Sampling Framework and Application for Frequency Cap Statistics","authors":"E. Cohen","doi":"10.1145/3234338","DOIUrl":"https://doi.org/10.1145/3234338","url":null,"abstract":"Unaggregated data, in a streamed or distributed form, are prevalent and come from diverse sources such as interactions of users with web services and IP traffic. Data elements have keys (cookies, users, queries), and elements with different keys interleave. Analytics on such data typically utilizes statistics expressed as a sum over keys in a specified segment of a function f applied to the frequency (the total number of occurrences) of the key. In particular, Distinct is the number of active keys in the segment, Sum is the sum of their frequencies, and both are special cases of frequency cap statistics, which cap the frequency by a parameter T. Random samples can be very effective for quick and efficient estimation of statistics at query time. Ideally, to estimate statistics for a given function f, our sample would include a key with frequency w with probability roughly proportional to f(w). The challenge is that while such “gold-standard” samples can be easily computed after aggregating the data (computing the set of key-frequency pairs), this aggregation is costly: It requires structure of size that is proportional to the number of active keys, which can be very large. We present a sampling framework for unaggregated data that uses a single pass (for streams) or two passes (for distributed data) and structure size proportional to the desired sample size. Our design unifies classic solutions for Distinct and Sum. Specifically, our ℓ-capped samples provide nonnegative unbiased estimates of any monotone non-decreasing frequency statistics and statistical guarantees on quality that are close to gold standard for cap statistics with T=Θ (ℓ). Furthermore, our multi-objective samples provide these statistical guarantees on quality for all concave sub-linear statistics (the nonnegative span of cap functions) while incurring only a logarithmic overhead on sample size.","PeriodicalId":154047,"journal":{"name":"ACM Transactions on Algorithms (TALG)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123116316","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 9

Online Submodular Maximization with Free Disposal 在线子模块最大化与自由处置

ACM Transactions on Algorithms (TALG) Pub Date : 2018-09-17 DOI: 10.1145/3242770

T-H. Hubert Chan, Zhiyi Huang, S. Jiang, N. Kang, Zhihao Gavin Tang

{"title":"Online Submodular Maximization with Free Disposal","authors":"T-H. Hubert Chan, Zhiyi Huang, S. Jiang, N. Kang, Zhihao Gavin Tang","doi":"10.1145/3242770","DOIUrl":"https://doi.org/10.1145/3242770","url":null,"abstract":"We study the online submodular maximization problem with free disposal under a matroid constraint. Elements from some ground set arrive one by one in rounds, and the algorithm maintains a feasible set that is independent in the underlying matroid. In each round when a new element arrives, the algorithm may accept the new element into its feasible set and possibly remove elements from it, provided that the resulting set is still independent. The goal is to maximize the value of the final feasible set under some monotone submodular function, to which the algorithm has oracle access. For k-uniform matroids, we give a deterministic algorithm with competitive ratio at least 0.2959, and the ratio approaches 1/α∞≈ 0.3178 as k approaches infinity, improving the previous best ratio of 0.25 by Chakrabarti and Kale (IPCO 2014), Buchbinder et al. (SODA 2015), and Chekuri et al. (ICALP 2015). We also show that our algorithm is optimal among a class of deterministic monotone algorithms that accept a new arriving element only if the objective is strictly increased. Further, we prove that no deterministic monotone algorithm can be strictly better than 0.25-competitive even for partition matroids, the most modest generalization of k-uniform matroids, matching the competitive ratio by Chakrabarti and Kale (IPCO 2014) and Chekuri et al. (ICALP 2015). Interestingly, we show that randomized algorithms are strictly more powerful by giving a (non-monotone) randomized algorithm for partition matroids with ratio 1/α∞≈ 0.3178.","PeriodicalId":154047,"journal":{"name":"ACM Transactions on Algorithms (TALG)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124143030","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 8

Fully Dynamic MIS in Uniformly Sparse Graphs 均匀稀疏图中的全动态MIS

ACM Transactions on Algorithms (TALG) Pub Date : 2018-08-30 DOI: 10.1145/3378025

Krzysztof Onak, B. Schieber, Shay Solomon, Nicole Wein

引用次数: 23

Packing Groups of Items into Multiple Knapsacks 将物品打包成多个背包

ACM Transactions on Algorithms (TALG) Pub Date : 2018-08-21 DOI: 10.1145/3233524

Lin Chen, Guochuan Zhang

{"title":"Packing Groups of Items into Multiple Knapsacks","authors":"Lin Chen, Guochuan Zhang","doi":"10.1145/3233524","DOIUrl":"https://doi.org/10.1145/3233524","url":null,"abstract":"We consider a natural generalization of the classical multiple knapsack problem in which instead of packing single items we are packing groups of items. In this problem, we have multiple knapsacks and a set of items partitioned into groups. Each item has an individual weight, while the profit is associated with groups rather than items. The profit of a group can be attained if and only if every item of this group is packed. Such a general model finds applications in various practical problems, e.g., delivering bundles of goods. The tractability of this problem relies heavily on how large a group could be. Deciding if a group of items of total weight 2 could be packed into two knapsacks of unit capacity is already NP-hard and it thus rules out a constant-approximation algorithm for this problem in general. We then focus on the parameterized version where the total weight of items in each group is bounded by a factor δ of the total capacity of all knapsacks. Both approximation and inapproximability results with respect to δ are derived. We also show that, depending on whether the number of knapsacks is a constant or part of the input, the approximation ratio for the problem, as a function on δ, changes substantially, which has a clear difference from the classical multiple knapsack problem.","PeriodicalId":154047,"journal":{"name":"ACM Transactions on Algorithms (TALG)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124195396","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 14

Approximation Guarantees for the Minimum Linear Arrangement Problem by Higher Eigenvalues 高特征值下最小线性排列问题的逼近保证

ACM Transactions on Algorithms (TALG) Pub Date : 2018-08-21 DOI: 10.1145/3228342

Suguru Tamaki, Yuichi Yoshida

引用次数: 2

Graph Reconstruction and Verification 图的重构与验证

ACM Transactions on Algorithms (TALG) Pub Date : 2018-08-09 DOI: 10.1145/3199606

Sampath Kannan, Claire Mathieu, Hang Zhou

引用次数: 18