IMDAC: A robust intelligent software defect prediction model via multi‐objective optimization and end‐to‐end hybrid deep learning networks

Kun Zhu, Nana Zhang, Changjun Jiang, Dandan Zhu
{"title":"IMDAC: A robust intelligent software defect prediction model via multi‐objective optimization and end‐to‐end hybrid deep learning networks","authors":"Kun Zhu, Nana Zhang, Changjun Jiang, Dandan Zhu","doi":"10.1002/spe.3274","DOIUrl":null,"url":null,"abstract":"Abstract Software defect prediction (SDP) aims to build an effective prediction model for historical defect data from software repositories by some specialized techniques or algorithms, and predict the defect proneness of new software modules. Nevertheless, the complex internal intrinsic structure hidden behind the defect data makes it challenging for the built prediction model to capture the most expressive defect feature representations, and largely limits the SDP performance. Fortunately, artificial intelligence is interacting closely with humans and provides powerful intelligent technical support for addressing these SDP issues. In this article, we propose a robust intelligent SDP model called IMDAC based on deep learning and soft computing techniques. This model has three main advantages: (1) an effective deep generative network—InfoGAN (information maximizing GANs) is employed to conduct data augmentation, namely generating sufficient defect instances and achieving defect class balance simultaneously. (2) Select the fewest representative feature subset for the minimum error via an advanced multi‐objective optimization approach—MSEA (multi‐stage evolutionary algorithm). (3) Build a powerful end‐to‐end deep defect predictor by hybrid deep learning techniques—DAE (Denoising AutoEncoder) and CNN (convolutional neural network), which can not only reconstruct a clean “repaired” input with strong robustness and generalization capabilities via DAE, but also learn the abstract deep semantic features with strong discriminating capability via CNN. Experimental results verify the superiority and robustness of the IMDAC model across 15 software projects.","PeriodicalId":21899,"journal":{"name":"Software: Practice and Experience","volume":"65 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Software: Practice and Experience","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/spe.3274","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Abstract Software defect prediction (SDP) aims to build an effective prediction model for historical defect data from software repositories by some specialized techniques or algorithms, and predict the defect proneness of new software modules. Nevertheless, the complex internal intrinsic structure hidden behind the defect data makes it challenging for the built prediction model to capture the most expressive defect feature representations, and largely limits the SDP performance. Fortunately, artificial intelligence is interacting closely with humans and provides powerful intelligent technical support for addressing these SDP issues. In this article, we propose a robust intelligent SDP model called IMDAC based on deep learning and soft computing techniques. This model has three main advantages: (1) an effective deep generative network—InfoGAN (information maximizing GANs) is employed to conduct data augmentation, namely generating sufficient defect instances and achieving defect class balance simultaneously. (2) Select the fewest representative feature subset for the minimum error via an advanced multi‐objective optimization approach—MSEA (multi‐stage evolutionary algorithm). (3) Build a powerful end‐to‐end deep defect predictor by hybrid deep learning techniques—DAE (Denoising AutoEncoder) and CNN (convolutional neural network), which can not only reconstruct a clean “repaired” input with strong robustness and generalization capabilities via DAE, but also learn the abstract deep semantic features with strong discriminating capability via CNN. Experimental results verify the superiority and robustness of the IMDAC model across 15 software projects.
IMDAC:基于多目标优化和端到端混合深度学习网络的鲁棒智能软件缺陷预测模型
摘要软件缺陷预测(Software defect prediction, SDP)旨在通过一些专门的技术或算法,对软件库中的历史缺陷数据建立有效的预测模型,预测新软件模块的缺陷倾向。然而,隐藏在缺陷数据背后的复杂的内部固有结构使得所构建的预测模型很难捕捉到最具表现力的缺陷特征表示,这在很大程度上限制了SDP的性能。幸运的是,人工智能正在与人类密切互动,为解决这些SDP问题提供强大的智能技术支持。在本文中,我们提出了一个基于深度学习和软计算技术的鲁棒智能SDP模型IMDAC。该模型有三个主要优点:(1)利用有效的深度生成网络——信息最大化gan (information maximize GANs)进行数据扩充,即生成足够的缺陷实例,同时实现缺陷类平衡。(2)采用一种先进的多目标优化方法-多阶段进化算法(msea),选择具有最小误差的最小代表性特征子集。(3)采用去噪自动编码器(Denoising AutoEncoder)和卷积神经网络(CNN)的混合深度学习技术构建强大的端到端深度缺陷预测器,该预测器不仅可以通过DAE重建具有较强鲁棒性和泛化能力的干净“修复”输入,还可以通过CNN学习具有较强判别能力的抽象深度语义特征。实验结果验证了IMDAC模型在15个软件项目中的优越性和鲁棒性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信