Dominance-based duplication simulation (DBDS): code duplication to enable compiler optimizations

Proceedings of the 2018 International Symposium on Code Generation and Optimization Pub Date : 2018-02-24 DOI:10.1145/3168811

David Leopoldseder, Lukas Stadler, Thomas Würthinger, J. Eisl, Doug Simon, H. Mössenböck

{"title":"Dominance-based duplication simulation (DBDS): code duplication to enable compiler optimizations","authors":"David Leopoldseder, Lukas Stadler, Thomas Würthinger, J. Eisl, Doug Simon, H. Mössenböck","doi":"10.1145/3168811","DOIUrl":null,"url":null,"abstract":"Compilers perform a variety of advanced optimizations to improve the quality of the generated machine code. However, optimizations that depend on the data flow of a program are often limited by control-flow merges. Code duplication can solve this problem by hoisting, i.e. duplicating, instructions from merge blocks to their predecessors. However, finding optimization opportunities enabled by duplication is a non-trivial task that requires compile-time intensive analysis. This imposes a challenge on modern (just-in-time) compilers: Duplicating instructions tentatively at every control flow merge is not feasible because excessive duplication leads to uncontrolled code growth and compile time increases. Therefore, compilers need to find out whether a duplication is beneficial enough to be performed. This paper proposes a novel approach to determine which duplication operations should be performed to increase performance. The approach is based on a duplication simulation that enables a compiler to evaluate different success metrics per potential duplication. Using this information, the compiler can then select the most promising candidates for optimization. We show how to map duplication candidates into an optimization cost model that allows us to trade-off between different success metrics including peak performance, code size and compile time. We implemented the approach on top of the GraalVM and evaluated it with the benchmarks Java DaCapo, Scala DaCapo, JavaScript Octane and a micro-benchmark suite, in terms of performance, compilation time and code size increase. We show that our optimization can reach peak performance improvements of up to 40% with a mean peak performance increase of 5.89%, while it generates a mean code size increase of 9.93% and mean compile time increase of 18.44%.","PeriodicalId":103558,"journal":{"name":"Proceedings of the 2018 International Symposium on Code Generation and Optimization","volume":"66 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"36","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2018 International Symposium on Code Generation and Optimization","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3168811","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 36

Abstract

Compilers perform a variety of advanced optimizations to improve the quality of the generated machine code. However, optimizations that depend on the data flow of a program are often limited by control-flow merges. Code duplication can solve this problem by hoisting, i.e. duplicating, instructions from merge blocks to their predecessors. However, finding optimization opportunities enabled by duplication is a non-trivial task that requires compile-time intensive analysis. This imposes a challenge on modern (just-in-time) compilers: Duplicating instructions tentatively at every control flow merge is not feasible because excessive duplication leads to uncontrolled code growth and compile time increases. Therefore, compilers need to find out whether a duplication is beneficial enough to be performed. This paper proposes a novel approach to determine which duplication operations should be performed to increase performance. The approach is based on a duplication simulation that enables a compiler to evaluate different success metrics per potential duplication. Using this information, the compiler can then select the most promising candidates for optimization. We show how to map duplication candidates into an optimization cost model that allows us to trade-off between different success metrics including peak performance, code size and compile time. We implemented the approach on top of the GraalVM and evaluated it with the benchmarks Java DaCapo, Scala DaCapo, JavaScript Octane and a micro-benchmark suite, in terms of performance, compilation time and code size increase. We show that our optimization can reach peak performance improvements of up to 40% with a mean peak performance increase of 5.89%, while it generates a mean code size increase of 9.93% and mean compile time increase of 18.44%.

查看原文本刊更多论文

基于优势的复制模拟(DBDS):代码复制以启用编译器优化

编译器执行各种高级优化以提高生成的机器码的质量。然而，依赖于程序数据流的优化常常受到控制流合并的限制。代码复制可以通过提升(即复制)合并块中的指令到它们的前身来解决这个问题。然而，寻找通过复制实现的优化机会是一项非常重要的任务，需要进行编译时密集的分析。这给现代(即时)编译器带来了挑战:在每个控制流合并时暂时复制指令是不可行的，因为过度的复制会导致不受控制的代码增长和编译时间的增加。因此，编译器需要找出复制是否足够有益而值得执行。本文提出了一种新的方法来确定应该执行哪些复制操作以提高性能。该方法基于复制模拟，该模拟使编译器能够评估每个潜在复制的不同成功度量。使用这些信息，编译器可以选择最有希望进行优化的候选项。我们展示了如何将候选复制映射到一个优化成本模型中，该模型允许我们在不同的成功指标(包括峰值性能、代码大小和编译时间)之间进行权衡。我们在GraalVM上实现了这种方法，并使用Java DaCapo、Scala DaCapo、JavaScript Octane和一个微基准测试套件来评估它的性能、编译时间和代码大小的增加。我们表明，我们的优化可以达到高达40%的峰值性能改进，平均峰值性能提高5.89%，而它产生的平均代码大小增加9.93%，平均编译时间增加18.44%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2018 International Symposium on Code Generation and Optimization

自引率

0.00%

发文量