{"title":"基于Bagging集成技术的软件故障预测机器学习算法性能分析","authors":"Roshan Samantaray, Himansu Das","doi":"10.1109/ISCON57294.2023.10111952","DOIUrl":null,"url":null,"abstract":"Software development comes with a lot of challenges. Developers face various issues with performance and bugs. These issues increase with the scale of the project and if fewer individuals work on the development. It has become necessary to fix various bugs during development to ensure better performance and reduce the chances of failure during the deployment of the software. As a result of this, faults in the software must be predicted during the earlier stages of development. This would help in reducing the cost of maintenance of the software post-deployment. Multiple software fault prediction (SFP) approaches have been proposed to tackle this problem. These approaches can be improved by implementing ensemble techniques. In this paper, we study the effect of the bagging technique and how it helps to improve the predictive capability across various datasets. These datasets are provided and made open source by NASA. Decision Tree Classifier (DTC), Logistic Regression (LR), K-Nearest Neighbors (KNN), Gaussian Naive Bayes (GNB), and Support Vector Classifier (SVC) were used as base classifiers for the bagging method. Random forest (RFC) is an ensemble learning algorithm that uses the bagging technique. Based on the outcome of the results, it was concluded that RFC was the best-performing algorithm.","PeriodicalId":280183,"journal":{"name":"2023 6th International Conference on Information Systems and Computer Networks (ISCON)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Performance Analysis of Machine Learning Algorithms Using Bagging Ensemble Technique for Software Fault Prediction\",\"authors\":\"Roshan Samantaray, Himansu Das\",\"doi\":\"10.1109/ISCON57294.2023.10111952\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Software development comes with a lot of challenges. Developers face various issues with performance and bugs. These issues increase with the scale of the project and if fewer individuals work on the development. It has become necessary to fix various bugs during development to ensure better performance and reduce the chances of failure during the deployment of the software. As a result of this, faults in the software must be predicted during the earlier stages of development. This would help in reducing the cost of maintenance of the software post-deployment. Multiple software fault prediction (SFP) approaches have been proposed to tackle this problem. These approaches can be improved by implementing ensemble techniques. In this paper, we study the effect of the bagging technique and how it helps to improve the predictive capability across various datasets. These datasets are provided and made open source by NASA. Decision Tree Classifier (DTC), Logistic Regression (LR), K-Nearest Neighbors (KNN), Gaussian Naive Bayes (GNB), and Support Vector Classifier (SVC) were used as base classifiers for the bagging method. Random forest (RFC) is an ensemble learning algorithm that uses the bagging technique. Based on the outcome of the results, it was concluded that RFC was the best-performing algorithm.\",\"PeriodicalId\":280183,\"journal\":{\"name\":\"2023 6th International Conference on Information Systems and Computer Networks (ISCON)\",\"volume\":\"28 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-03-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 6th International Conference on Information Systems and Computer Networks (ISCON)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISCON57294.2023.10111952\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 6th International Conference on Information Systems and Computer Networks (ISCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCON57294.2023.10111952","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Performance Analysis of Machine Learning Algorithms Using Bagging Ensemble Technique for Software Fault Prediction
Software development comes with a lot of challenges. Developers face various issues with performance and bugs. These issues increase with the scale of the project and if fewer individuals work on the development. It has become necessary to fix various bugs during development to ensure better performance and reduce the chances of failure during the deployment of the software. As a result of this, faults in the software must be predicted during the earlier stages of development. This would help in reducing the cost of maintenance of the software post-deployment. Multiple software fault prediction (SFP) approaches have been proposed to tackle this problem. These approaches can be improved by implementing ensemble techniques. In this paper, we study the effect of the bagging technique and how it helps to improve the predictive capability across various datasets. These datasets are provided and made open source by NASA. Decision Tree Classifier (DTC), Logistic Regression (LR), K-Nearest Neighbors (KNN), Gaussian Naive Bayes (GNB), and Support Vector Classifier (SVC) were used as base classifiers for the bagging method. Random forest (RFC) is an ensemble learning algorithm that uses the bagging technique. Based on the outcome of the results, it was concluded that RFC was the best-performing algorithm.