Toward an Understanding of Long-tailed Runtimes of SLS Algorithms

Q2 Mathematics

Journal of Experimental Algorithmics Pub Date : 2022-10-24 DOI:10.1145/3569170

Jan-Hendrik Lorenz, Florian Wörz

{"title":"Toward an Understanding of Long-tailed Runtimes of SLS Algorithms","authors":"Jan-Hendrik Lorenz, Florian Wörz","doi":"10.1145/3569170","DOIUrl":null,"url":null,"abstract":"The satisfiability problem (SAT) is one of the most famous problems in computer science. Traditionally, its NP-completeness has been used to argue that SAT is intractable. However, there have been tremendous practical advances in recent years that allow modern SAT solvers to solve instances with millions of variables and clauses. A particularly successful paradigm in this context is stochastic local search (SLS). In most cases, there are different ways of formulating the underlying SAT problem. While it is known that the precise formulation of the problem has a significant impact on the runtime of solvers, finding a helpful formulation is generally non-trivial. The recently introduced GapSAT solver [Lorenz and Wörz 2020] demonstrated a successful way to improve the performance of an SLS solver on average by learning additional information, which logically entails from the original problem. Still, there were also cases in which the performance slightly deteriorated. This justifies in-depth investigations into how learning logical implications affects runtimes for SLS algorithms. In this work, we propose a method for generating logically equivalent problem formulations, generalizing the ideas of GapSAT. This method allows a rigorous mathematical study of the effect on the runtime of SLS SAT solvers. Initially, we conduct empirical investigations. If the modification process is treated as random, then Johnson SB distributions provide a perfect characterization of the hardness. Since the observed Johnson SB distributions approach lognormal distributions, our analysis also suggests that the hardness is long-tailed. As a second contribution, we theoretically prove that restarts are useful for long-tailed distributions. This implies that incorporating additional restarts can further refine all algorithms employing above mentioned modification technique. Since the empirical studies compellingly suggest that the runtime distributions follow Johnson SB distributions, we also investigate this property on a theoretical basis. We succeed in proving that the runtimes for the special case of Schöning’s random walk algorithm [Schöning 2002] are approximately Johnson SB distributed.","PeriodicalId":53707,"journal":{"name":"Journal of Experimental Algorithmics","volume":"27 1","pages":"1 - 38"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Experimental Algorithmics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3569170","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Mathematics","Score":null,"Total":0}

引用次数: 0

Abstract

The satisfiability problem (SAT) is one of the most famous problems in computer science. Traditionally, its NP-completeness has been used to argue that SAT is intractable. However, there have been tremendous practical advances in recent years that allow modern SAT solvers to solve instances with millions of variables and clauses. A particularly successful paradigm in this context is stochastic local search (SLS). In most cases, there are different ways of formulating the underlying SAT problem. While it is known that the precise formulation of the problem has a significant impact on the runtime of solvers, finding a helpful formulation is generally non-trivial. The recently introduced GapSAT solver [Lorenz and Wörz 2020] demonstrated a successful way to improve the performance of an SLS solver on average by learning additional information, which logically entails from the original problem. Still, there were also cases in which the performance slightly deteriorated. This justifies in-depth investigations into how learning logical implications affects runtimes for SLS algorithms. In this work, we propose a method for generating logically equivalent problem formulations, generalizing the ideas of GapSAT. This method allows a rigorous mathematical study of the effect on the runtime of SLS SAT solvers. Initially, we conduct empirical investigations. If the modification process is treated as random, then Johnson SB distributions provide a perfect characterization of the hardness. Since the observed Johnson SB distributions approach lognormal distributions, our analysis also suggests that the hardness is long-tailed. As a second contribution, we theoretically prove that restarts are useful for long-tailed distributions. This implies that incorporating additional restarts can further refine all algorithms employing above mentioned modification technique. Since the empirical studies compellingly suggest that the runtime distributions follow Johnson SB distributions, we also investigate this property on a theoretical basis. We succeed in proving that the runtimes for the special case of Schöning’s random walk algorithm [Schöning 2002] are approximately Johnson SB distributed.

查看原文本刊更多论文

对SLS算法长尾运行时的理解

可满足性问题是计算机科学中最著名的问题之一。传统上，它的NP完全性被用来证明SAT是棘手的。然而，近年来已经有了巨大的实际进展，使现代SAT求解器能够解决具有数百万变量和子句的实例。在这种情况下，一个特别成功的范例是随机局部搜索（SLS）。在大多数情况下，有不同的方式来表述潜在的SAT问题。虽然众所周知，问题的精确公式对求解器的运行时间有很大影响，但找到一个有用的公式通常是不容易的。最近推出的GapSAT解算器[Lorenz和Wörz 2020]展示了一种成功的方法，可以通过学习额外的信息来平均提高SLS解算器的性能，这些信息在逻辑上来自原始问题。尽管如此，也有表现略有恶化的情况。这证明了深入研究学习逻辑含义如何影响SLS算法的运行时是合理的。在这项工作中，我们提出了一种生成逻辑等价问题公式的方法，推广了GapSAT的思想。该方法允许对SLS SAT求解器的运行时间影响进行严格的数学研究。最初，我们进行实证调查。如果改性过程被视为随机的，那么Johnson SB分布提供了硬度的完美表征。由于观测到的Johnson SB分布接近对数正态分布，我们的分析也表明硬度是长尾的。作为第二个贡献，我们从理论上证明了重启对于长尾分布是有用的。这意味着结合额外的重新启动可以进一步完善采用上述修改技术的所有算法。由于经验研究令人信服地表明运行时分布遵循Johnson SB分布，我们也在理论基础上研究了这一性质。我们成功地证明了Schöning的随机游动算法[Schöning2002]的特殊情况的运行时间是近似Johnson SB分布的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Experimental Algorithmics Mathematics-Theoretical Computer Science

CiteScore

3.10

自引率

0.00%

发文量

期刊介绍： The ACM JEA is a high-quality, refereed, archival journal devoted to the study of discrete algorithms and data structures through a combination of experimentation and classical analysis and design techniques. It focuses on the following areas in algorithms and data structures: ■combinatorial optimization ■computational biology ■computational geometry ■graph manipulation ■graphics ■heuristics ■network design ■parallel processing ■routing and scheduling ■searching and sorting ■VLSI design