A new approach to software defect prediction based on convolutional neural network and bidirectional long short-term memory

Production Systems and Information Engineering Pub Date : 1900-01-01 DOI:10.32968/psaie.2022.3.1

N. A. A. Khleel, K. Nehéz

{"title":"A new approach to software defect prediction based on convolutional neural network and bidirectional long short-term memory","authors":"N. A. A. Khleel, K. Nehéz","doi":"10.32968/psaie.2022.3.1","DOIUrl":null,"url":null,"abstract":"Software defect prediction (SDP) plays an important role in improving software quality and reliability while reducing software maintenance cost. The problem in the field of SDP is how to determine the defective source code with high accuracy. To build more accurate predictor models, a lot of features are presented, e.g., static code features, social network features, and process features, etc. Several machine learning (ML) and deep learning (DL) algorithms have been developed and adopted to identify and remove defects from the source code, where previous studies have proved that DL algorithms are promising techniques for predicting software defects. The aim of this study is to investigate the prediction performance of two DL algorithms namely, Convolutional Neural Network (CNN) and Bidirectional Long short-term memory (BI-LSTM) in the domain of SDP. To establish the effectiveness of the proposed approach, the experiments were conducted on the available benchmark datasets which obtained from open-source java projects GitHub repository and the models were evaluated by applying seven evaluation metrics which are accuracy, precision, recall, f-measure, matthews correlation coefficient (MCC), area under the ROC curve (AUC), mean square error (MSE). We found out that the best accuracy obtained on training dataset is 81% by using CNN model, while the best accuracy obtained on validation dataset is 80% by using BI-LSTM model. The best AUC obtained on training dataset is 88% by using CNN model, while the best AUC obtained on validation dataset is 83% by using the both models. It is nearly impossible to rule which model is better than the other so every model can be analyzed separately and the best model according to the problem at hand can be used, therefore, based on the problem of this study, The evaluation results show the effectiveness of our proposed models based on standard performance evaluation criteria.","PeriodicalId":117509,"journal":{"name":"Production Systems and Information Engineering","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Production Systems and Information Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.32968/psaie.2022.3.1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Software defect prediction (SDP) plays an important role in improving software quality and reliability while reducing software maintenance cost. The problem in the field of SDP is how to determine the defective source code with high accuracy. To build more accurate predictor models, a lot of features are presented, e.g., static code features, social network features, and process features, etc. Several machine learning (ML) and deep learning (DL) algorithms have been developed and adopted to identify and remove defects from the source code, where previous studies have proved that DL algorithms are promising techniques for predicting software defects. The aim of this study is to investigate the prediction performance of two DL algorithms namely, Convolutional Neural Network (CNN) and Bidirectional Long short-term memory (BI-LSTM) in the domain of SDP. To establish the effectiveness of the proposed approach, the experiments were conducted on the available benchmark datasets which obtained from open-source java projects GitHub repository and the models were evaluated by applying seven evaluation metrics which are accuracy, precision, recall, f-measure, matthews correlation coefficient (MCC), area under the ROC curve (AUC), mean square error (MSE). We found out that the best accuracy obtained on training dataset is 81% by using CNN model, while the best accuracy obtained on validation dataset is 80% by using BI-LSTM model. The best AUC obtained on training dataset is 88% by using CNN model, while the best AUC obtained on validation dataset is 83% by using the both models. It is nearly impossible to rule which model is better than the other so every model can be analyzed separately and the best model according to the problem at hand can be used, therefore, based on the problem of this study, The evaluation results show the effectiveness of our proposed models based on standard performance evaluation criteria.

查看原文本刊更多论文

基于卷积神经网络和双向长短期记忆的软件缺陷预测新方法

软件缺陷预测在提高软件质量和可靠性、降低软件维护成本方面发挥着重要作用。如何高精度地确定有缺陷的源代码是当前软件开发领域的难题。为了构建更准确的预测模型，提出了许多特征，例如，静态代码特征、社会网络特征和过程特征等。已经开发并采用了几种机器学习(ML)和深度学习(DL)算法来识别和消除源代码中的缺陷，其中先前的研究已经证明DL算法是预测软件缺陷的有前途的技术。本研究的目的是研究卷积神经网络(CNN)和双向长短期记忆(BI-LSTM)两种深度学习算法在SDP领域的预测性能。为了验证该方法的有效性，在开源java项目GitHub库中获得了可用的基准数据集，并采用准确率、精密度、召回率、f-measure、马修斯相关系数(MCC)、ROC曲线下面积(AUC)、均方误差(MSE) 7个评价指标对模型进行了评价。我们发现使用CNN模型在训练数据集上获得的最佳准确率为81%，而使用BI-LSTM模型在验证数据集上获得的最佳准确率为80%。使用CNN模型在训练数据集上获得的最佳AUC为88%，两种模型同时使用在验证数据集上获得的最佳AUC为83%。由于几乎不可能判断哪个模型比另一个模型更好，因此每个模型都可以单独分析，并根据手头的问题使用最佳模型，因此，基于本研究的问题，评估结果表明我们提出的基于标准性能评估标准的模型的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Production Systems and Information Engineering

自引率

0.00%

发文量