Synthesis of Benchmarks for the C Programming Language by Mining Software Repositories

Proceedings of the XXIII Brazilian Symposium on Programming Languages Pub Date : 2019-09-23 DOI:10.1145/3355378.3355380

B. Guimarães, José Wesley de S. Magalhães, A. F. Silva, F. Pereira

引用次数: 2

Abstract

Compilers are usually distributed with a test framework. This framework supports the task of tuning optimizations and static analyses. As an example, clang has a test suite that, in March 2019, counted 259 benchmarks. Although in principle a large collection, this number is small once we consider the needs of the automatic tuning techniques that became fashionable recently. To mitigate the problems caused by such lack of benchmarks, this paper introduces a technique that allows the automatic construction of compilable programs out of open-source repositories. Our approach has made it possible to build, in less than 24 hours, a collection with over 500 thousand functions that clang can compile. In this paper, we show that such abundance of data gives us precise information about the behavior of compiler optimizations, and lets us create accurate prediction models. This collection of benchmarks is today freely available to the open-source community.

查看原文本刊更多论文

基于软件资源库的C编程语言基准综合

编译器通常与测试框架一起分发。该框架支持调优优化和静态分析任务。例如，clang有一个测试套件，在2019年3月，有259个基准测试。虽然原则上这是一个很大的集合，但是考虑到最近流行起来的自动调优技术的需要，这个数字就很小了。为了减轻由于缺乏基准而引起的问题，本文介绍了一种允许从开源存储库中自动构建可编译程序的技术。我们的方法使得在不到24小时内构建一个clang可以编译的包含超过50万个函数的集合成为可能。在本文中，我们展示了如此丰富的数据为我们提供了有关编译器优化行为的精确信息，并使我们能够创建准确的预测模型。这个基准集现在可以免费提供给开源社区。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the XXIII Brazilian Symposium on Programming Languages

自引率

0.00%

发文量