Computing partially path-sensitive MFP solutions in data flow analyses

Proceedings of the 27th International Conference on Compiler Construction Pub Date : 2018-02-24 DOI:10.1145/3178372.3179497

K. Pathade, Uday P. Khedker

{"title":"Computing partially path-sensitive MFP solutions in data flow analyses","authors":"K. Pathade, Uday P. Khedker","doi":"10.1145/3178372.3179497","DOIUrl":null,"url":null,"abstract":"Data flow analysis traverses paths in a control flow graph (CFG) representation of programs to compute useful information. Many of these paths are infeasible, i.e. they cannot arise in any possible execution. The information computed along these paths adds imprecision to the conventional Maximal Fixed Point (MFP) solution of a data flow analysis. Existing approaches for removing this imprecision are either specific to a data flow problem or involve control flow graph restructuring which has exponential complexity. We introduce partial path-sensitivity to the MFP solution by identifying clusters of minimal infeasible path segments to distinguish between the data flowing along feasible and infeasible control flow paths. This allows us to lift any data flow analysis to an analysis over k+1 tuples where k is the number of clusters. Our flow function for a k+1 tuple shifts the values of the underlying analysis from an element in the tuple to other element(s) at the start and end of a cluster as appropriate. This allows us to maintain the distinctions where they are beneficial. Since k is linear in the number of conditional edges in the CFG, the effort is multiplied by a factor that is linear in the number of conditional edges (and is not exponential, unlike conventional approaches of achieving path sensitivity.) We have implemented our method of computing partially path sensitive MFP for reaching definitions analysis and value range analysis of variables. Our measurements on benchmark programs show up to 9% reduction in the number of reaching definitions and up to 14% cases where the value range of a variable is smaller.","PeriodicalId":117615,"journal":{"name":"Proceedings of the 27th International Conference on Compiler Construction","volume":"90 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 27th International Conference on Compiler Construction","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3178372.3179497","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

Abstract

Data flow analysis traverses paths in a control flow graph (CFG) representation of programs to compute useful information. Many of these paths are infeasible, i.e. they cannot arise in any possible execution. The information computed along these paths adds imprecision to the conventional Maximal Fixed Point (MFP) solution of a data flow analysis. Existing approaches for removing this imprecision are either specific to a data flow problem or involve control flow graph restructuring which has exponential complexity. We introduce partial path-sensitivity to the MFP solution by identifying clusters of minimal infeasible path segments to distinguish between the data flowing along feasible and infeasible control flow paths. This allows us to lift any data flow analysis to an analysis over k+1 tuples where k is the number of clusters. Our flow function for a k+1 tuple shifts the values of the underlying analysis from an element in the tuple to other element(s) at the start and end of a cluster as appropriate. This allows us to maintain the distinctions where they are beneficial. Since k is linear in the number of conditional edges in the CFG, the effort is multiplied by a factor that is linear in the number of conditional edges (and is not exponential, unlike conventional approaches of achieving path sensitivity.) We have implemented our method of computing partially path sensitive MFP for reaching definitions analysis and value range analysis of variables. Our measurements on benchmark programs show up to 9% reduction in the number of reaching definitions and up to 14% cases where the value range of a variable is smaller.

查看原文本刊更多论文

数据流分析中部分路径敏感MFP解的计算

数据流分析在程序的控制流图(CFG)表示中遍历路径以计算有用的信息。这些路径中有许多是不可行的，也就是说，它们无法在任何可能的执行中出现。沿着这些路径计算的信息增加了数据流分析的传统最大不动点(MFP)解的不精确性。现有的消除这种不精确的方法要么是特定于数据流问题的，要么涉及具有指数复杂性的控制流图重构。我们通过识别最小不可行路径段的簇来区分沿可行和不可行控制流路径流动的数据，从而将部分路径敏感性引入到MFP解决方案中。这允许我们将任何数据流分析提升到对k+1元组的分析，其中k是簇的数量。我们的k+1元组的流函数将基础分析的值从元组中的一个元素转移到集群开始和结束的其他元素。这使我们能够保持有益的区别。由于k在CFG中条件边的数量上是线性的，因此工作量乘以条件边数量上的线性因子(与实现路径灵敏度的传统方法不同，它不是指数因子)。我们实现了计算部分路径敏感MFP的方法，用于变量的定义分析和值范围分析。我们对基准程序的测量显示，达到定义的次数减少了9%，变量的值范围较小的情况减少了14%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 27th International Conference on Compiler Construction

自引率

0.00%

发文量