Comparison of back propagation training algorithms for software defect prediction

Ishani Arora, A. Saha
{"title":"Comparison of back propagation training algorithms for software defect prediction","authors":"Ishani Arora, A. Saha","doi":"10.1109/IC3I.2016.7917934","DOIUrl":null,"url":null,"abstract":"The cost of deleting a software bug increases ten times as it is floated onto the next phase of software development lifecycle (SDLC). This makes the task of the project managers difficult and also degrades the quality of the output software product. Software defect prediction (SDP) was proposed as a solution to the problem which could anticipate the defective modules and hence, deal with them in an efficient and effective manner in advance. The adequacy of artificial neural networks (ANNs) to handle the complex nonlinear relationships between the software metrics and the defect data demonstrates their suitability to build the defect prediction models. In this paper, multilayer feed forward back propagation based neural networks were constructed using seven defect datasets from the PROMISE repository. An empirical comparison of Levenberg-Marquardt (LM), Resilient back propagation (RP) and Bayesian Regularization (BR) back propagation training algorithms was performed using statistical measures such as MSE and R2 values and the parameters computed from the confusion matrix. Bayesian based back propagation training method performed better than the LM and RP techniques in terms of minimizing mean square error and type II error and maximizing accuracy, sensitivity and R2 value. An accuracy of more than 90 percent was achieved by BR on all the seven datasets and the best data fit during the regression analysis was shown with a R2 value of 0.96. Overall, it is the context and the criticality of the software project which will aid the project managers to prioritize the performance measures and hence, decide upon the training algorithm to be applied, according to the goals and resources available.","PeriodicalId":305971,"journal":{"name":"2016 2nd International Conference on Contemporary Computing and Informatics (IC3I)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"18","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 2nd International Conference on Contemporary Computing and Informatics (IC3I)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IC3I.2016.7917934","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 18

Abstract

The cost of deleting a software bug increases ten times as it is floated onto the next phase of software development lifecycle (SDLC). This makes the task of the project managers difficult and also degrades the quality of the output software product. Software defect prediction (SDP) was proposed as a solution to the problem which could anticipate the defective modules and hence, deal with them in an efficient and effective manner in advance. The adequacy of artificial neural networks (ANNs) to handle the complex nonlinear relationships between the software metrics and the defect data demonstrates their suitability to build the defect prediction models. In this paper, multilayer feed forward back propagation based neural networks were constructed using seven defect datasets from the PROMISE repository. An empirical comparison of Levenberg-Marquardt (LM), Resilient back propagation (RP) and Bayesian Regularization (BR) back propagation training algorithms was performed using statistical measures such as MSE and R2 values and the parameters computed from the confusion matrix. Bayesian based back propagation training method performed better than the LM and RP techniques in terms of minimizing mean square error and type II error and maximizing accuracy, sensitivity and R2 value. An accuracy of more than 90 percent was achieved by BR on all the seven datasets and the best data fit during the regression analysis was shown with a R2 value of 0.96. Overall, it is the context and the criticality of the software project which will aid the project managers to prioritize the performance measures and hence, decide upon the training algorithm to be applied, according to the goals and resources available.
软件缺陷预测的反向传播训练算法比较
在软件开发生命周期(SDLC)的下一个阶段,删除软件缺陷的成本会增加十倍。这使得项目经理的任务变得困难,也降低了输出软件产品的质量。软件缺陷预测(Software defect prediction, SDP)是一种能够提前预测缺陷模块并对其进行有效处理的方法。人工神经网络(ann)处理软件度量与缺陷数据之间复杂非线性关系的能力证明了其建立缺陷预测模型的适用性。本文利用PROMISE存储库中的7个缺陷数据集,构建了基于多层前馈-反向传播的神经网络。利用MSE和R2值等统计度量以及从混淆矩阵中计算的参数,对Levenberg-Marquardt (LM)、弹性反向传播(RP)和贝叶斯正则化(BR)反向传播训练算法进行了实证比较。基于贝叶斯的反向传播训练方法在均方误差和II型误差最小、精度、灵敏度和R2值最大等方面优于LM和RP技术。BR在所有7个数据集上的准确率均超过90%,回归分析期间的最佳数据拟合显示R2值为0.96。总的来说,软件项目的环境和重要性将帮助项目经理根据目标和可用资源确定性能度量的优先级,从而决定要应用的训练算法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信