An Empirical Analysis of Threshold Techniques for identifying Faulty Classes

Int. J. Next Gener. Comput. Pub Date : 2020-12-22 DOI:10.47164/IJNGC.V11I3.642

N. Kaur, Hardeep Singh

{"title":"An Empirical Analysis of Threshold Techniques for identifying Faulty Classes","authors":"N. Kaur, Hardeep Singh","doi":"10.47164/IJNGC.V11I3.642","DOIUrl":null,"url":null,"abstract":"The experimental validation of the proficiency of the proposed techniques is a mandatory task in the research regime. The existing literature has proved the presence of extensive work attained the statistical validation of software metrics by utilizing them in the development of fault prediction models, where, both statistical and machine learning techniques were engaged into the construction of the models being capable of identifying faulty and non-faulty classes. On the contrary, the research area involving the investigation of threshold concept has not gained sufficient maturity. An effective threshold technique can assist in the identification of optimal cut-off value in software metric which can discriminate the faulty from non-faulty classes with minimal misclassification rate. The idea of threshold calculation can make the applicability of the existing metrics in software industries, a much easier task. As the developers only need to know the cut-off values which can help them to concentrate on the specific classes that exceeds the computed thresholds. Also, the presence of peculiarity in the software metric index can alert the testers and in turn helps them to disburse the resources systematically. The current study empirically validated and compared the discriminating strength of two threshold techniques, i.e., ROC curve and Alves Rankings, on the public dataset. This study selected twenty Object Oriented (OO) measures for the process of threshold calculation. Besides, the widely addressed metric suite proposed by Chidamber and Kemerer, this study also considered other fourteen OO measures for the experiment. Furthermore, Wilcoxon signed ranks test was used to enquire the classification difference between the aforementioned threshold techniques. The outcome from the statistical analysis revealed the better predictive capability of ROC curve than the Alves Rankings.","PeriodicalId":351421,"journal":{"name":"Int. J. Next Gener. Comput.","volume":"69 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. J. Next Gener. Comput.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.47164/IJNGC.V11I3.642","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The experimental validation of the proficiency of the proposed techniques is a mandatory task in the research regime. The existing literature has proved the presence of extensive work attained the statistical validation of software metrics by utilizing them in the development of fault prediction models, where, both statistical and machine learning techniques were engaged into the construction of the models being capable of identifying faulty and non-faulty classes. On the contrary, the research area involving the investigation of threshold concept has not gained sufficient maturity. An effective threshold technique can assist in the identification of optimal cut-off value in software metric which can discriminate the faulty from non-faulty classes with minimal misclassification rate. The idea of threshold calculation can make the applicability of the existing metrics in software industries, a much easier task. As the developers only need to know the cut-off values which can help them to concentrate on the specific classes that exceeds the computed thresholds. Also, the presence of peculiarity in the software metric index can alert the testers and in turn helps them to disburse the resources systematically. The current study empirically validated and compared the discriminating strength of two threshold techniques, i.e., ROC curve and Alves Rankings, on the public dataset. This study selected twenty Object Oriented (OO) measures for the process of threshold calculation. Besides, the widely addressed metric suite proposed by Chidamber and Kemerer, this study also considered other fourteen OO measures for the experiment. Furthermore, Wilcoxon signed ranks test was used to enquire the classification difference between the aforementioned threshold techniques. The outcome from the statistical analysis revealed the better predictive capability of ROC curve than the Alves Rankings.

查看原文本刊更多论文

故障类识别的阈值技术实证分析

对所提出的技术的熟练程度进行实验验证是研究制度中的强制性任务。现有文献已经证明了大量工作的存在，通过在故障预测模型的开发中利用软件度量来获得软件度量的统计验证，其中，统计和机器学习技术都参与了能够识别故障和非故障类的模型的构建。相反，涉及阈值概念研究的研究领域还不够成熟。一种有效的阈值技术可以帮助识别软件度量中的最佳截止值，以最小的误分类率区分故障类和非故障类。阈值计算的思想可以使现有度量在软件行业中的适用性变得容易得多。因为开发人员只需要知道截止值，这可以帮助他们专注于超出计算阈值的特定类。此外，软件度量索引中特性的存在可以提醒测试人员，并反过来帮助他们系统地分配资源。本研究在公开数据集上对ROC曲线和Alves排名两种阈值技术的判别强度进行了实证验证和比较。本研究选取了20个面向对象(OO)的度量进行阈值计算。除了Chidamber和Kemerer提出的广泛解决的度量套件之外，本研究还考虑了实验的其他14个OO度量。此外，采用Wilcoxon符号秩检验来考察上述阈值技术的分类差异。统计分析结果显示ROC曲线的预测能力优于Alves排名。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Int. J. Next Gener. Comput.

自引率

0.00%

发文量