A Systematic Review of Ensemble Techniques for Software Defect and Change Prediction

e Informatica Softw. Eng. J. Pub Date : 2022-01-01 DOI:10.37190/e-inf220105

Megha Khanna

{"title":"A Systematic Review of Ensemble Techniques for Software Defect and Change Prediction","authors":"Megha Khanna","doi":"10.37190/e-inf220105","DOIUrl":null,"url":null,"abstract":"Background: The use of ensemble techniques have steadily gained popularity in several software quality assurance activities. These aggregated classifiers have proven to be superior than their constituent base models. Though ensemble techniques have been widely used in key areas such as Software Defect Prediction (SDP) and Software Change Prediction (SCP), the current state-of-the-art concerning the use of these techniques needs scrutinization. Aim: The study aims to assess, evaluate and uncover possible research gaps with respect to the use of ensemble techniques in SDP and SCP. Method: This study conducts an extensive literature review of 77 primary studies on the basis of the category, application, rules of formulation, performance, and possible threats of the proposed/utilized ensemble techniques. Results: Ensemble techniques were primarily categorized on the basis of similarity, aggregation, relationship, diversity, and dependency of their base models. They were also found effective in several applications such as their use as a learning algorithm for developing SDP/SCP models and for addressing the class imbalance issue. Conclusion: The results of the review ascertain the need of more studies to propose, assess, validate, and compare various categories of ensemble techniques for diverse applications in SDP/SCP such as transfer learning and online learning. evaluating prediction in realistic online scenarios or unavailability of appropriate training data. We investigated the primary studies to ascertain the various applications of ET i.e., what was the underlying use of ET in SDP/SCP. The various applications are listed as under along with the percentage of primary studies that utilized the ET for the particular application.","PeriodicalId":11452,"journal":{"name":"e Informatica Softw. Eng. J.","volume":"98 1","pages":"220105"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"e Informatica Softw. Eng. J.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.37190/e-inf220105","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Background: The use of ensemble techniques have steadily gained popularity in several software quality assurance activities. These aggregated classifiers have proven to be superior than their constituent base models. Though ensemble techniques have been widely used in key areas such as Software Defect Prediction (SDP) and Software Change Prediction (SCP), the current state-of-the-art concerning the use of these techniques needs scrutinization. Aim: The study aims to assess, evaluate and uncover possible research gaps with respect to the use of ensemble techniques in SDP and SCP. Method: This study conducts an extensive literature review of 77 primary studies on the basis of the category, application, rules of formulation, performance, and possible threats of the proposed/utilized ensemble techniques. Results: Ensemble techniques were primarily categorized on the basis of similarity, aggregation, relationship, diversity, and dependency of their base models. They were also found effective in several applications such as their use as a learning algorithm for developing SDP/SCP models and for addressing the class imbalance issue. Conclusion: The results of the review ascertain the need of more studies to propose, assess, validate, and compare various categories of ensemble techniques for diverse applications in SDP/SCP such as transfer learning and online learning. evaluating prediction in realistic online scenarios or unavailability of appropriate training data. We investigated the primary studies to ascertain the various applications of ET i.e., what was the underlying use of ET in SDP/SCP. The various applications are listed as under along with the percentage of primary studies that utilized the ET for the particular application.

查看原文本刊更多论文

软件缺陷和变更预测集成技术的系统回顾

背景:集成技术的使用在几个软件质量保证活动中逐渐流行起来。这些聚合分类器已被证明优于它们的组成基本模型。尽管集成技术已经广泛应用于关键领域，例如软件缺陷预测(SDP)和软件变更预测(SCP)，但是目前关于这些技术使用的最新技术需要仔细检查。目的:本研究旨在评估、评价和揭示在SDP和SCP中使用集成技术方面可能存在的研究差距。方法:本研究根据所提出/使用的集成技术的类别、应用、制定规则、性能和可能的威胁，对77项主要研究进行了广泛的文献综述。结果:集成技术主要根据其基础模型的相似性、聚合性、关系性、多样性和依赖性进行分类。它们在一些应用中也被发现是有效的，比如它们被用作开发SDP/SCP模型的学习算法和解决类不平衡问题。结论:本综述的结果表明，需要更多的研究来提出、评估、验证和比较不同类型的集成技术在SDP/SCP中的不同应用，如迁移学习和在线学习。评估在现实的在线场景中的预测或缺乏适当的训练数据。我们调查了初步研究，以确定ET的各种应用，即，ET在SDP/SCP中的潜在用途是什么。下面列出了各种应用，以及为特定应用使用ET的初步研究的百分比。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

e Informatica Softw. Eng. J.

自引率

0.00%

发文量