Predicted of Software Fault Based on Random Forest and K-Nearest Neighbor

2022 4th International Conference on Advanced Science and Engineering (ICOASE) Pub Date : 2022-09-21 DOI:10.1109/ICOASE56293.2022.10075596

Mustafa Zaki Mohammed, I. Saleh

{"title":"Predicted of Software Fault Based on Random Forest and K-Nearest Neighbor","authors":"Mustafa Zaki Mohammed, I. Saleh","doi":"10.1109/ICOASE56293.2022.10075596","DOIUrl":null,"url":null,"abstract":"Software systems have gotten increasingly complicated and adaptable in today's computer world. As a result, it's critical to track down and fix software design flaws on a regular basis. Software fault prediction in early phase is useful for enhancing software quality and for reducing software testing time and expense; it's a technique for predicting problems using historical data. To anticipate software flaws from historical databases, several machine learning approaches are applied. This paper focuses on creating a predictor to predict software defects, Based on previous data. For this purpose, a supervised machine learning techniques was utilized to forecast future software failures, K-Nearest Neighbor (KNN) and Random Forest (RF) applied technique applied to the defective data set belonging to the NASA's PROMISE repository. Also, a set of performance measures such as accuracy, precision, recall and f1 measure were used to evaluate the performance of the models. This paper showed a good performance of the RF model compared to the KNN model resulting in a maximum and minimum accuracy are 99%,88% on the MC1 and KC1 responsibly. In general, the study's findings suggest that software defect metrics may be used to determine the problematic module, and that the RF model can be used to anticipate software errors.","PeriodicalId":297211,"journal":{"name":"2022 4th International Conference on Advanced Science and Engineering (ICOASE)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 4th International Conference on Advanced Science and Engineering (ICOASE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOASE56293.2022.10075596","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Software systems have gotten increasingly complicated and adaptable in today's computer world. As a result, it's critical to track down and fix software design flaws on a regular basis. Software fault prediction in early phase is useful for enhancing software quality and for reducing software testing time and expense; it's a technique for predicting problems using historical data. To anticipate software flaws from historical databases, several machine learning approaches are applied. This paper focuses on creating a predictor to predict software defects, Based on previous data. For this purpose, a supervised machine learning techniques was utilized to forecast future software failures, K-Nearest Neighbor (KNN) and Random Forest (RF) applied technique applied to the defective data set belonging to the NASA's PROMISE repository. Also, a set of performance measures such as accuracy, precision, recall and f1 measure were used to evaluate the performance of the models. This paper showed a good performance of the RF model compared to the KNN model resulting in a maximum and minimum accuracy are 99%,88% on the MC1 and KC1 responsibly. In general, the study's findings suggest that software defect metrics may be used to determine the problematic module, and that the RF model can be used to anticipate software errors.

查看原文本刊更多论文

基于随机森林和k近邻的软件故障预测

在当今的计算机世界中，软件系统变得越来越复杂和适应性强。因此，定期追踪和修复软件设计缺陷至关重要。软件早期故障预测有助于提高软件质量，减少软件测试时间和费用;这是一种利用历史数据预测问题的技术。为了从历史数据库中预测软件缺陷，应用了几种机器学习方法。本文的重点是基于以前的数据创建一个预测器来预测软件缺陷。为此，利用监督机器学习技术来预测未来的软件故障，k -最近邻(KNN)和随机森林(RF)应用技术应用于属于NASA PROMISE存储库的缺陷数据集。此外，还采用准确率、精密度、召回率和f1测度等性能指标来评价模型的性能。本文表明，与KNN模型相比，RF模型具有良好的性能，在MC1和KC1上的最大和最小精度分别为99%和88%。总的来说，研究结果表明软件缺陷度量可以用来确定有问题的模块，并且RF模型可以用来预测软件错误。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 4th International Conference on Advanced Science and Engineering (ICOASE)

自引率

0.00%

发文量