A Comparative Study of Automatic Program Repair Techniques for Security Vulnerabilities

2021 IEEE 32nd International Symposium on Software Reliability Engineering (ISSRE) Pub Date : 2021-10-01 DOI:10.1109/ISSRE52982.2021.00031

Eduard Pinconschi, Rui Abreu, P. Adão

{"title":"A Comparative Study of Automatic Program Repair Techniques for Security Vulnerabilities","authors":"Eduard Pinconschi, Rui Abreu, P. Adão","doi":"10.1109/ISSRE52982.2021.00031","DOIUrl":null,"url":null,"abstract":"In the past years, research on automatic program repair (APR), in particular on test-suite-based approaches, has significantly attracted the attention of researchers. Despite the advances in the field, it remains unclear how these techniques fare in the context of security—most approaches are evaluated using benchmarks of bugs that do not (only) contain security vulnerabilities. In this paper, we present our observations using 10 state-of-the-art test-suite-based automatic program repair tools on the DARPA Cyber Grand Challenge benchmark of vulnerabilities in C/C++. Our intention is to have a better understanding of the current state of automatic program repair tools when addressing security issues. In particular, our study is guided by the hypothesis that the efficiency of repair tools may not generalize to security vulnerabilities. We found that the 10 analyzed tools can only fix 30 out of 55 vulnerable programs—54.6 % of the considered issues. In particular, we found that APR tools with atomic change operators and brute-force search strategy (AE and GenProg) and brute-force functionality deletion (Kali) overall perform better at repairing security vulnerabilities (considering both efficiency and effectiveness). AE is the tool that individually repairs most programs with 20 out of 55 programs (36.4%). The causes for failing to repair are discussed in the paper, which can help repair tool designers to improve their techniques and tools.","PeriodicalId":162410,"journal":{"name":"2021 IEEE 32nd International Symposium on Software Reliability Engineering (ISSRE)","volume":"71 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 32nd International Symposium on Software Reliability Engineering (ISSRE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISSRE52982.2021.00031","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 8

Abstract

In the past years, research on automatic program repair (APR), in particular on test-suite-based approaches, has significantly attracted the attention of researchers. Despite the advances in the field, it remains unclear how these techniques fare in the context of security—most approaches are evaluated using benchmarks of bugs that do not (only) contain security vulnerabilities. In this paper, we present our observations using 10 state-of-the-art test-suite-based automatic program repair tools on the DARPA Cyber Grand Challenge benchmark of vulnerabilities in C/C++. Our intention is to have a better understanding of the current state of automatic program repair tools when addressing security issues. In particular, our study is guided by the hypothesis that the efficiency of repair tools may not generalize to security vulnerabilities. We found that the 10 analyzed tools can only fix 30 out of 55 vulnerable programs—54.6 % of the considered issues. In particular, we found that APR tools with atomic change operators and brute-force search strategy (AE and GenProg) and brute-force functionality deletion (Kali) overall perform better at repairing security vulnerabilities (considering both efficiency and effectiveness). AE is the tool that individually repairs most programs with 20 out of 55 programs (36.4%). The causes for failing to repair are discussed in the paper, which can help repair tool designers to improve their techniques and tools.

查看原文本刊更多论文

安全漏洞自动程序修复技术的比较研究

在过去的几年里，自动程序修复(APR)的研究，特别是基于测试套件的方法，引起了研究人员的极大关注。尽管该领域取得了进展，但这些技术在安全性方面的表现仍不清楚——大多数方法都是使用不(仅)包含安全漏洞的错误基准来评估的。在本文中，我们展示了我们在DARPA网络大挑战C/ c++漏洞基准测试中使用10种基于测试套件的自动程序修复工具的观察结果。我们的目的是在处理安全问题时更好地了解自动程序修复工具的当前状态。特别是，我们的研究是基于一个假设，即修复工具的效率可能不会推广到安全漏洞。我们发现，这10个分析工具只能修复55个易受攻击程序中的30个，占考虑问题的54.6%。特别是，我们发现具有原子更改操作符和暴力搜索策略(AE和GenProg)以及暴力功能删除(Kali)的APR工具在修复安全漏洞(考虑效率和有效性)方面总体上表现更好。在55个程序中，有20个程序(36.4%)单独修复了大部分程序。本文讨论了造成维修失败的原因，有助于维修工具设计者改进维修技术和工具。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 IEEE 32nd International Symposium on Software Reliability Engineering (ISSRE)

自引率

0.00%

发文量