ReFixar: Multi-version Reasoning for Automated Repair of Regression Errors

2021 IEEE 32nd International Symposium on Software Reliability Engineering (ISSRE) Pub Date : 2021-10-01 DOI:10.1109/ISSRE52982.2021.00028

X. Le, Quang Loc Le

{"title":"ReFixar: Multi-version Reasoning for Automated Repair of Regression Errors","authors":"X. Le, Quang Loc Le","doi":"10.1109/ISSRE52982.2021.00028","DOIUrl":null,"url":null,"abstract":"Software programs evolve naturally as part of the ever-changing customer needs and fast-paced market. Software evolution, however, often introduces regression bugs, which un-duly break previously working functionalities of the software. To repair regression bugs, one needs to know when and where a bug emerged from, e.g., the bug-inducing code changes, to narrow down the search space. Unfortunately, existing state-of-the-art automated program repair (APR) techniques have not yet fully exploited this information, rendering them less efficient and effective to navigate through a potentially large search space containing many plausible but incorrect solutions. In this work, we revisit APR on repairing regression errors in Java programs. We empirically show that existing state-of-the-art APR techniques do not perform well on regression bugs due to their algorithm design and lack of knowledge on bug inducing changes. We subsequently present ReFixar, a novel repair technique that leverages software evolution history to generate high quality patches for Java regression bugs. The key novelty that empowers ReFixar to more efficiently and effectively traverse the search space is two-fold: (1) A systematic way for multi-version reasoning to capture how a software evolves through its history, and (2) A novel search algorithm over a set of generic repair templates, derived from the principle of incorrectness logic and informed by both past bug fixes and their bug-inducing code changes; this enables ReFixar to achieve a balance of both genericity and specificity, i.e., generic common fix patterns of bugs and their specific contexts. We compare ReFixar against the state-of-the-art APR techniques on a data set of 51 real regression bugs from 28 large real-world programs. Experiments show that ReFixar significantly outperforms the best baseline by a large margin, i.e., ReFixar can fix correctly 24 bugs while the best baseline can only correctly fix 9 bugs.","PeriodicalId":162410,"journal":{"name":"2021 IEEE 32nd International Symposium on Software Reliability Engineering (ISSRE)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 32nd International Symposium on Software Reliability Engineering (ISSRE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISSRE52982.2021.00028","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Software programs evolve naturally as part of the ever-changing customer needs and fast-paced market. Software evolution, however, often introduces regression bugs, which un-duly break previously working functionalities of the software. To repair regression bugs, one needs to know when and where a bug emerged from, e.g., the bug-inducing code changes, to narrow down the search space. Unfortunately, existing state-of-the-art automated program repair (APR) techniques have not yet fully exploited this information, rendering them less efficient and effective to navigate through a potentially large search space containing many plausible but incorrect solutions. In this work, we revisit APR on repairing regression errors in Java programs. We empirically show that existing state-of-the-art APR techniques do not perform well on regression bugs due to their algorithm design and lack of knowledge on bug inducing changes. We subsequently present ReFixar, a novel repair technique that leverages software evolution history to generate high quality patches for Java regression bugs. The key novelty that empowers ReFixar to more efficiently and effectively traverse the search space is two-fold: (1) A systematic way for multi-version reasoning to capture how a software evolves through its history, and (2) A novel search algorithm over a set of generic repair templates, derived from the principle of incorrectness logic and informed by both past bug fixes and their bug-inducing code changes; this enables ReFixar to achieve a balance of both genericity and specificity, i.e., generic common fix patterns of bugs and their specific contexts. We compare ReFixar against the state-of-the-art APR techniques on a data set of 51 real regression bugs from 28 large real-world programs. Experiments show that ReFixar significantly outperforms the best baseline by a large margin, i.e., ReFixar can fix correctly 24 bugs while the best baseline can only correctly fix 9 bugs.

查看原文本刊更多论文

修正:自动修复回归错误的多版本推理

软件程序随着不断变化的客户需求和快节奏的市场而自然发展。然而，软件进化经常会引入回归错误，这些错误会不合时宜地破坏软件先前的工作功能。要修复回归错误，需要知道错误出现的时间和地点，例如，导致错误的代码更改，以缩小搜索空间。不幸的是，现有的最先进的自动程序修复(APR)技术尚未充分利用这些信息，使得它们在包含许多看似合理但不正确的解决方案的潜在巨大搜索空间中导航的效率和效果较低。在这项工作中，我们将重新讨论APR在修复Java程序中的回归错误方面的作用。我们的经验表明，现有的最先进的APR技术由于其算法设计和缺乏对bug诱导变化的了解，在回归错误上表现不佳。我们随后介绍了ReFixar，这是一种新颖的修复技术，它利用软件进化历史为Java回归错误生成高质量的补丁。使ReFixar能够更高效和有效地遍历搜索空间的关键新颖之处有两个:(1)多版本推理的系统方法，以捕获软件在其历史中如何演变;(2)基于一组通用修复模板的新颖搜索算法，该算法源自不正确逻辑原理，并由过去的错误修复和导致错误的代码更改提供信息;这使得ReFixar能够实现通用性和专用性的平衡，也就是说，bug的通用通用修复模式及其特定上下文。我们将ReFixar与最先进的APR技术在28个大型现实世界程序的51个真实回归错误的数据集上进行了比较。实验表明，ReFixar明显优于最佳基线，即ReFixar可以正确修复24个错误，而最佳基线只能正确修复9个错误。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 IEEE 32nd International Symposium on Software Reliability Engineering (ISSRE)

自引率

0.00%

发文量