Journal of the ACM最新文献_第3页

Learning to branch: Generalization guarantees and limits of data-independent discretization 学会分支：与数据无关的离散化的泛化保证和极限

IF 2.5 2区计算机科学

Journal of the ACM Pub Date : 2023-12-25 DOI: 10.1145/3637840

Maria-Florina Balcan, Travis Dick, Tuomas Sandholm, Ellen Vitercik

{"title":"Learning to branch: Generalization guarantees and limits of data-independent discretization","authors":"Maria-Florina Balcan, Travis Dick, Tuomas Sandholm, Ellen Vitercik","doi":"10.1145/3637840","DOIUrl":"https://doi.org/10.1145/3637840","url":null,"abstract":"Tree search algorithms, such as branch-and-bound, are the most widely used tools for solving combinatorial and non-convex problems. For example, they are the foremost method for solving (mixed) integer programs and constraint satisfaction problems. Tree search algorithms come with a variety of tunable parameters that are notoriously challenging to tune by hand. A growing body of research has demonstrated the power of using a data-driven approach to automatically optimize the parameters of tree search algorithms. These techniques use a training set of integer programs sampled from an application-specific instance distribution to find a parameter setting that has strong average performance over the training set. However, with too few samples, a parameter setting may have strong average performance on the training set but poor expected performance on future integer programs from the same application. Our main contribution is to provide the first sample complexity guarantees for tree search parameter tuning. These guarantees bound the number of samples sufficient to ensure that the average performance of tree search over the samples nearly matches its future expected performance on the unknown instance distribution. In particular, the parameters we analyze weight scoring rules used for variable selection. Proving these guarantees is challenging because tree size is a volatile function of these parameters: we prove that for any discretization (uniform or not) of the parameter space, there exists a distribution over integer programs such that every parameter setting in the discretization results in a tree with exponential expected size, yet there exist parameter settings between the discretized points that result in trees of constant size. In addition, we provide data-dependent guarantees that depend on the volatility of these tree-size functions: our guarantees improve if the tree-size functions can be well-approximated by simpler functions. Finally, via experiments, we illustrate that learning an optimal weighting of scoring rules reduces tree size.","PeriodicalId":50022,"journal":{"name":"Journal of the ACM","volume":"104 1","pages":""},"PeriodicalIF":2.5,"publicationDate":"2023-12-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139035130","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Faster Modular Composition 更快的模块化合成

IF 2.5 2区计算机科学

Journal of the ACM Pub Date : 2023-12-25 DOI: 10.1145/3638349

Vincent Neiger, Bruno Salvy, Éric Schost, Gilles Villard

引用次数: 0

Dominantly Truthful Peer Prediction Mechanisms with a Finite Number of Tasks 任务数量有限的优势真实同行预测机制

IF 2.5 2区计算机科学

Journal of the ACM Pub Date : 2023-12-23 DOI: 10.1145/3638239

Yuqing Kong

{"title":"Dominantly Truthful Peer Prediction Mechanisms with a Finite Number of Tasks","authors":"Yuqing Kong","doi":"10.1145/3638239","DOIUrl":"https://doi.org/10.1145/3638239","url":null,"abstract":"In the setting where participants are asked multiple similar possibly subjective multi-choice questions (e.g. Do you like Panda Express? Y/N; do you like Chick-fil-A? Y/N), a series of peer prediction mechanisms have been designed to incentivize honest reports and some of them achieve dominantly truthfulness: truth-telling is a dominant strategy and strictly dominate other “non-permutation strategy” with some mild conditions. However, those mechanisms require the participants to perform an infinite number of tasks. When the participants perform a finite number of tasks, these mechanisms only achieve approximated dominant truthfulness. The existence of a dominantly truthful multi-task peer prediction mechanism that only requires a finite number of tasks remains to be an open question that may have a negative result, even with full prior knowledge. This paper answers this open question by proposing a family of mechanisms, VMI-Mechanisms, that are dominantly truthful with a finite number of tasks. A special case of this family, DMI-Mechanism, only requires ≥ 2C tasks where C is the number of choices for each question (C = 2 for binary-choice questions). The implementation of these mechanisms does not require any prior knowledge (detail-free) and only requires ≥ 2 participants. To the best of our knowledge, any mechanism of the family is the first dominantly truthful peer prediction mechanism that works for a finite number of tasks. The core of these new mechanisms is a new family of information-monotone information measures: volume mutual information (VMI). VMI is based on a simple geometric information measure design method, the volume method. The volume method measures the informativeness of an object by “counting” the number of objects that are less informative than it. In other words, the more objects that the object of interest dominates, the more informative it is considered to be. Finally, in the setting where agents need to invest efforts to obtain their private signals, we show how to select the mechanism to optimally incentivize efforts among a proper set of VMI-Mechanisms.","PeriodicalId":50022,"journal":{"name":"Journal of the ACM","volume":"23 4 1","pages":""},"PeriodicalIF":2.5,"publicationDate":"2023-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139031647","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Parallel Acyclic Joins: Optimal Algorithms and Cyclicity Separation 并行无环连接:最优算法和环分离

IF 2.5 2区计算机科学

Journal of the ACM Pub Date : 2023-12-01 DOI: 10.1145/3633512

Xiao Hu, Yufei Tao

{"title":"Parallel Acyclic Joins: Optimal Algorithms and Cyclicity Separation","authors":"Xiao Hu, Yufei Tao","doi":"10.1145/3633512","DOIUrl":"https://doi.org/10.1145/3633512","url":null,"abstract":"We study equi-join computation in the massively parallel computation (MPC) model. Currently, a main open question under this topic is whether it is possible to design an algorithm that can process any join with load (O(N {rm {polylog}} N / p^{1/rho ^*}) ) — measured in the number of words communicated per machine — where N is the total number of tuples in the input relations, ρ* is the join’s fractional edge covering number, and p is the number of machines. We settle the question in the negative for the class of tuple-based algorithms (all the known MPC join algorithms fall in this class) by proving the existence of a join query with ρ* = 2 that requires a load of Ω(N/p1/3) to evaluate. Our lower bound provides solid evidence that the “AGM bound” alone is not sufficient for characterizing the hardness of join evaluation in MPC (a phenomenon that does not exist in RAM). The hard join instance identified in our argument is cyclic, which leaves the question of whether (O(N {rm {polylog}} N / p^{1/rho ^*}) ) is still possible for acyclic joins. We answer this question in the affirmative by showing that any acyclic join can be evaluated with load (O(N / p^{1/rho ^*}) ), which is asymptotically optimal (there are no polylogarithmic factors in our bound). The separation between cyclic and acyclic joins is yet another phenomenon that is absent in RAM. Our algorithm owes to the discovery of a new mathematical structure — we call “canonical edge cover” — of acyclic hypergraphs, which has numerous non-trivial properties and makes an elegant addition to database theory.","PeriodicalId":50022,"journal":{"name":"Journal of the ACM","volume":"59 12","pages":""},"PeriodicalIF":2.5,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138525673","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Optimal Auctions through Deep Learning: Advances in Differentiable Economics 基于深度学习的最优拍卖:可微分经济学的进展

2区计算机科学

Journal of the ACM Pub Date : 2023-11-11 DOI: 10.1145/3630749

Paul Dütting, Zhe Feng, Harikrishna Narasimhan, David C. Parkes, Sai Srivatsa Ravindranath

引用次数: 2

Probabilistic Programming with Exact Conditions 具有精确条件的概率规划

2区计算机科学

Journal of the ACM Pub Date : 2023-11-11 DOI: 10.1145/3632170

Dario Stein, Sam Staton

引用次数: 0

The Space Complexity of Consensus from Swap 交换共识的空间复杂度

2区计算机科学

Journal of the ACM Pub Date : 2023-11-02 DOI: 10.1145/3631390

Sean Ovens

{"title":"The Space Complexity of Consensus from Swap","authors":"Sean Ovens","doi":"10.1145/3631390","DOIUrl":"https://doi.org/10.1145/3631390","url":null,"abstract":"Nearly thirty years ago, it was shown that (Omega (sqrt {n}) ) read/write registers are needed to solve randomized wait-free consensus among n processes. This lower bound was improved to n registers in 2018, which exactly matches known algorithms. The (Omega (sqrt {n}) ) space complexity lower bound actually applies to a class of objects called historyless objects, which includes registers, test-and-set objects, and readable swap objects. However, every known n -process obstruction-free consensus algorithm from historyless objects uses Ω ( n ) objects. In this paper, we give the first Ω ( n ) space complexity lower bounds on consensus algorithms for two kinds of historyless objects. First, we show that any obstruction-free consensus algorithm from swap objects uses at least n − 1 objects. More generally, we prove that any obstruction-free k -set agreement algorithm from swap objects uses at least (lceil frac{n}{k}rceil - 1 ) objects. The k -set agreement problem is a generalization of consensus in which processes agree on no more than k different output values. This is the first non-constant lower bound on the space complexity of solving k -set agreement with swap objects when k > 1. We also present an obstruction-free k -set agreement algorithm from n − k swap objects, which exactly matches our lower bound when k = 1. Second, we show that any obstruction-free binary consensus algorithm from readable swap objects with domain size b uses at least (frac{n-2}{3b+1} ) objects. When b is a constant, this asymptotically matches the best known obstruction-free consensus algorithms from readable swap objects with unbounded domains. Since any historyless object can be simulated by a readable swap object with the same domain, our results imply that any obstruction-free consensus algorithm from historyless objects with domain size b uses at least (frac{n-2}{3b+1} ) objects. For b = 2, we show a slightly better lower bound of n − 2. There is an obstruction-free binary consensus algorithm using 2 n − 1 readable swap objects with domain size 2, asymptotically matching our lower bound.","PeriodicalId":50022,"journal":{"name":"Journal of the ACM","volume":"15 2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135875758","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A New Minimax Theorem for Randomized Algorithms 随机化算法的一个新的极大极小定理

2区计算机科学

Journal of the ACM Pub Date : 2023-10-18 DOI: 10.1145/3626514

Shalev Ben-David, Eric Blais

{"title":"A New Minimax Theorem for Randomized Algorithms","authors":"Shalev Ben-David, Eric Blais","doi":"10.1145/3626514","DOIUrl":"https://doi.org/10.1145/3626514","url":null,"abstract":"The celebrated minimax principle of Yao (1977) says that for any Boolean-valued function f with finite domain, there is a distribution μ over the domain of f such that computing f to error ϵ against inputs from μ is just as hard as computing f to error ϵ on worst-case inputs. Notably, however, the distribution μ depends on the target error level ϵ: the hard distribution which is tight for bounded error might be trivial to solve to small bias, and the hard distribution which is tight for a small bias level might be far from tight for bounded error levels. In this work, we introduce a new type of minimax theorem which can provide a hard distribution μ that works for all bias levels at once. We show that this works for randomized query complexity, randomized communication complexity, some randomized circuit models, quantum query and communication complexities, approximate polynomial degree, and approximate logrank. We also prove an improved version of Impagliazzo’s hardcore lemma. Our proofs rely on two innovations over the classical approach of using Von Neumann’s minimax theorem or linear programming duality. First, we use Sion’s minimax theorem to prove a minimax theorem for ratios of bilinear functions representing the cost and score of algorithms. Second, we introduce a new way to analyze low-bias randomized algorithms by viewing them as “forecasting algorithms” evaluated by a certain proper scoring rule. The expected score of the forecasting version of a randomized algorithm appears to be a more fine-grained way of analyzing the bias of the algorithm. We show that such expected scores have many elegant mathematical properties: for example, they can be amplified linearly instead of quadratically. We anticipate forecasting algorithms will find use in future work in which a fine-grained analysis of small-bias algorithms is required.","PeriodicalId":50022,"journal":{"name":"Journal of the ACM","volume":"12 2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135824148","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Relative Error Streaming Quantiles 相对错误流分位数

2区计算机科学

Journal of the ACM Pub Date : 2023-10-16 DOI: 10.1145/3617891

Graham Cormode, Zohar Karnin, Edo Liberty, Justin Thaler, Pavel Veselý

{"title":"Relative Error Streaming Quantiles","authors":"Graham Cormode, Zohar Karnin, Edo Liberty, Justin Thaler, Pavel Veselý","doi":"10.1145/3617891","DOIUrl":"https://doi.org/10.1145/3617891","url":null,"abstract":"Estimating ranks, quantiles, and distributions over streaming data is a central task in data analysis and monitoring. Given a stream of n items from a data universe equipped with a total order, the task is to compute a sketch (data structure) of size polylogarithmic in n . Given the sketch and a query item y , one should be able to approximate its rank in the stream, i.e., the number of stream elements smaller than or equal to y . Most works to date focused on additive ε n error approximation, culminating in the KLL sketch that achieved optimal asymptotic behavior. This article investigates multiplicative (1± ε)-error approximations to the rank. Practical motivation for multiplicative error stems from demands to understand the tails of distributions, and hence for sketches to be more accurate near extreme values. The most space-efficient algorithms due to prior work store either O(log (ε 2 n )/ε 2 ) or O (log 3 (ε n )/ε) universe items. We present a randomized sketch storing O (log 1.5 (ε n )/ε) items that can (1± ε)-approximate the rank of each universe item with high constant probability; this space bound is within an (O(sqrt {log (varepsilon n)})) factor of optimal. Our algorithm does not require prior knowledge of the stream length and is fully mergeable, rendering it suitable for parallel and distributed computing environments.","PeriodicalId":50022,"journal":{"name":"Journal of the ACM","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136077380","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

First Price Auction is 1 − 1/ e ² Efficient 首价拍卖是1−1/ e 2有效

2区计算机科学

Journal of the ACM Pub Date : 2023-10-14 DOI: 10.1145/3617902

Yaonan Jin, Pinyan Lu

引用次数: 0