{"title":"Software Defect Prediction Model Based on Improved BP Neural Network","authors":"Y. Liu, Fengli Sun, Jun Yang, Donghong Zhou","doi":"10.1109/DSA.2019.00095","DOIUrl":null,"url":null,"abstract":"This paper proposes a software defect prediction algorithm based on improved BP neural network, which can effectively improve the prediction accuracy caused by the imbalance of the category distribution of data within the project. In this paper, in order to improve the data imbalance in the project, we use SMOTE algorithm to increase the minority samples (defective software modules), the ENN (Extended Nearest Neighbor Algorithm ) data cleaning algorithm is performed for the post-sampling data noise problem. The SA ( Simulated Annealing ) algorithm is used to optimize the four- layers BP neural network to establish the classification prediction model on the AEEEM database. We use cross validation to evaluate the performance of the proposed algorithm on AEEEM database. The results show that the proposed algorithm can effectively improve the performance of the model in predicting unbalanced data.","PeriodicalId":342719,"journal":{"name":"2019 6th International Conference on Dependable Systems and Their Applications (DSA)","volume":"152 6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 6th International Conference on Dependable Systems and Their Applications (DSA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DSA.2019.00095","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
This paper proposes a software defect prediction algorithm based on improved BP neural network, which can effectively improve the prediction accuracy caused by the imbalance of the category distribution of data within the project. In this paper, in order to improve the data imbalance in the project, we use SMOTE algorithm to increase the minority samples (defective software modules), the ENN (Extended Nearest Neighbor Algorithm ) data cleaning algorithm is performed for the post-sampling data noise problem. The SA ( Simulated Annealing ) algorithm is used to optimize the four- layers BP neural network to establish the classification prediction model on the AEEEM database. We use cross validation to evaluate the performance of the proposed algorithm on AEEEM database. The results show that the proposed algorithm can effectively improve the performance of the model in predicting unbalanced data.