Towards Confidence with Capture-recapture Estimation: An Exploratory Study of Dependence within Inspections

Proceedings of the 21st International Conference on Evaluation and Assessment in Software Engineering Pub Date : 2017-06-15 DOI:10.1145/3084226.3084250

Guoping Rong, Bohan Liu, He Zhang, Qiuping Zhang, Dong Shao

{"title":"Towards Confidence with Capture-recapture Estimation: An Exploratory Study of Dependence within Inspections","authors":"Guoping Rong, Bohan Liu, He Zhang, Qiuping Zhang, Dong Shao","doi":"10.1145/3084226.3084250","DOIUrl":null,"url":null,"abstract":"Background: Capture-ReCapture (CRC), as a technique for post-inspection defect estimation, has been studied in Software Engineering (SE) community since 1990s. While most studies focused on the performance evaluation of various CRC models and estimators, few have been done on the assessment of the credibility of estimation results, rendering the difficulty of decision-making for quality management when applying CRC for defect estimation. Objective: This research aims to explore and investigate a reliable and practical approach to assess the credibility of CRC based defect estimation. Method: One fundamental assumption of applying CRC method is the statistical independence of samples that can be measured by 'Coefficient of CoVariation' (CCV). We applied CCV as an indicator of the statistical dependence between the observations (i.e., the defects detected by inspectors), and assessed the estimation results of CRC with the published datasets in SE literature by examining the correlation between Relative Error (RE) and CCV. Based on the observed correlation, we further propose CĈV, which replaces the unknown N (the actual number of defects) with the estimated number (N), to assess the credibility of CRC estimates. Results: We found that most datasets are with non-zero CCVs and the R2 (Coefficient of Determination) of non-linear curve-fitting for their CCVs and REs is higher than 0.8. Conclusions: Our study shows the evidence that the statistical dependence among inspectors is ubiquitous in the existing CRC-related studies. Besides, the significant correlation between CCV (by CĈV in practice) and RE may enable the possibility of the assessment of CRC-based estimation in support of quality management.","PeriodicalId":192290,"journal":{"name":"Proceedings of the 21st International Conference on Evaluation and Assessment in Software Engineering","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 21st International Conference on Evaluation and Assessment in Software Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3084226.3084250","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

Background: Capture-ReCapture (CRC), as a technique for post-inspection defect estimation, has been studied in Software Engineering (SE) community since 1990s. While most studies focused on the performance evaluation of various CRC models and estimators, few have been done on the assessment of the credibility of estimation results, rendering the difficulty of decision-making for quality management when applying CRC for defect estimation. Objective: This research aims to explore and investigate a reliable and practical approach to assess the credibility of CRC based defect estimation. Method: One fundamental assumption of applying CRC method is the statistical independence of samples that can be measured by 'Coefficient of CoVariation' (CCV). We applied CCV as an indicator of the statistical dependence between the observations (i.e., the defects detected by inspectors), and assessed the estimation results of CRC with the published datasets in SE literature by examining the correlation between Relative Error (RE) and CCV. Based on the observed correlation, we further propose CĈV, which replaces the unknown N (the actual number of defects) with the estimated number (N), to assess the credibility of CRC estimates. Results: We found that most datasets are with non-zero CCVs and the R2 (Coefficient of Determination) of non-linear curve-fitting for their CCVs and REs is higher than 0.8. Conclusions: Our study shows the evidence that the statistical dependence among inspectors is ubiquitous in the existing CRC-related studies. Besides, the significant correlation between CCV (by CĈV in practice) and RE may enable the possibility of the assessment of CRC-based estimation in support of quality management.

查看原文本刊更多论文

用捕获-再捕获估计实现信心:检验中依赖性的探索性研究

背景:自20世纪90年代以来，捕获-重捕获(CRC)作为一种检查后缺陷估计技术在软件工程界得到了广泛的研究。大多数研究都集中在对各种CRC模型和估计器的性能评估上，而对估计结果可信度的评估却很少，这使得在应用CRC进行缺陷估计时，质量管理的决策存在困难。目的:本研究旨在探索和研究一种可靠和实用的方法来评估基于CRC的缺陷估计的可信度。方法:应用CRC方法的一个基本假设是样本的统计独立性，可以用“协变系数”(CCV)来衡量。我们将CCV作为观测值(即检查员检测到的缺陷)之间的统计依赖性的指标，并通过检查相对误差(Relative Error, RE)与CCV之间的相关性，对SE文献中已发表的数据集的CRC估计结果进行评估。基于观察到的相关性，我们进一步提出CĈV，将未知N(实际缺陷数量)替换为估计数量(N)，以评估CRC估计的可信度。结果:我们发现大多数数据集的ccv都是非零的，其ccv和REs的非线性曲线拟合R2(决定系数)大于0.8。结论:我们的研究表明，在现有的crc相关研究中，检查员之间的统计依赖是普遍存在的。此外，CCV(在实践中通过CĈV)与RE之间的显著相关性可能使基于CCV的评估支持质量管理的评估成为可能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 21st International Conference on Evaluation and Assessment in Software Engineering

自引率

0.00%

发文量