Using blind analysis for software engineering experiments

Proceedings of the 19th International Conference on Evaluation and Assessment in Software Engineering Pub Date : 2015-04-27 DOI:10.1145/2745802.2745832

Boyce Sigweni, M. Shepperd

{"title":"Using blind analysis for software engineering experiments","authors":"Boyce Sigweni, M. Shepperd","doi":"10.1145/2745802.2745832","DOIUrl":null,"url":null,"abstract":"Context: In recent years there has been growing concern about conflicting experimental results in empirical software engineering. This has been paralleled by awareness of how bias can impact research results. Objective: To explore the practicalities of blind analysis of experimental results to reduce bias. Method: We apply blind analysis to a real software engineering experiment that compares three feature weighting approaches with a naïve benchmark (sample mean) to the Finnish software effort data set. We use this experiment as an example to explore blind analysis as a method to reduce researcher bias. Results: Our experience shows that blinding can be a relatively straightforward procedure. We also highlight various statistical analysis decisions which ought not be guided by the hunt for statistical significance and show that results can be inverted merely through a seemingly inconsequential statistical nicety (i.e., the degree of trimming). Conclusion: Whilst there are minor challenges and some limits to the degree of blinding possible, blind analysis is a very practical and easy to implement method that supports more objective analysis of experimental results. Therefore we argue that blind analysis should be the norm for analysing software engineering experiments.","PeriodicalId":324836,"journal":{"name":"Proceedings of the 19th International Conference on Evaluation and Assessment in Software Engineering","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-04-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 19th International Conference on Evaluation and Assessment in Software Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2745802.2745832","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 12

Abstract

Context: In recent years there has been growing concern about conflicting experimental results in empirical software engineering. This has been paralleled by awareness of how bias can impact research results. Objective: To explore the practicalities of blind analysis of experimental results to reduce bias. Method: We apply blind analysis to a real software engineering experiment that compares three feature weighting approaches with a naïve benchmark (sample mean) to the Finnish software effort data set. We use this experiment as an example to explore blind analysis as a method to reduce researcher bias. Results: Our experience shows that blinding can be a relatively straightforward procedure. We also highlight various statistical analysis decisions which ought not be guided by the hunt for statistical significance and show that results can be inverted merely through a seemingly inconsequential statistical nicety (i.e., the degree of trimming). Conclusion: Whilst there are minor challenges and some limits to the degree of blinding possible, blind analysis is a very practical and easy to implement method that supports more objective analysis of experimental results. Therefore we argue that blind analysis should be the norm for analysing software engineering experiments.

查看原文本刊更多论文

应用盲分析进行软件工程实验

背景:近年来，人们越来越关注经验软件工程中相互冲突的实验结果。与此同时，人们也意识到偏见会如何影响研究结果。目的:探讨对实验结果进行盲分析以减少偏倚的可行性。方法:我们将盲分析应用于一个真实的软件工程实验，该实验将芬兰软件工作数据集的三种特征加权方法与naïve基准(样本均值)进行比较。我们以这个实验为例，探讨盲法分析作为减少研究者偏倚的方法。结果:我们的经验表明，盲法可以是一个相对简单的程序。我们还强调了各种统计分析决策，这些决策不应该以寻找统计显著性为指导，并表明结果可以仅仅通过看似无关紧要的统计准确性(即修剪程度)来反转。结论:虽然存在一些小的挑战和可能的盲化程度的一些限制，但盲分析是一种非常实用且易于实施的方法，可以支持对实验结果进行更客观的分析。因此，我们认为盲目分析应该成为分析软件工程实验的规范。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 19th International Conference on Evaluation and Assessment in Software Engineering

自引率

0.00%

发文量