复制预测模型的实证研究经验

Proceedings Eighth IEEE Symposium on Software Metrics Pub Date : 2002-06-04 DOI:10.1109/METRIC.2002.1011340

M. C. Ohlsson, P. Runeson

{"title":"复制预测模型的实证研究经验","authors":"M. C. Ohlsson, P. Runeson","doi":"10.1109/METRIC.2002.1011340","DOIUrl":null,"url":null,"abstract":"When conducting empirical studies, replications are important contributors to investigating the generality of the studies. By replicating a study in another context, we investigate what impact the specific environment has, related to the effect of the studied object. In this paper, we define different levels of replication to characterise the similarities and differences between an original study and a replication, with particular focus on prediction models for the identification of fault-prone software components. Further, we derive a set of issues and concerns which are important in order to enable replication of an empirical study and to enable practitioners to use the results. To illustrate the importance of the issues raised, a replication case study is presented in the domain of prediction models for fault-prone software components. It is concluded that the results are very divergent, depending on how different parameters are chosen, which demonstrates the need for well-documented empirical studies to enable replication and use.","PeriodicalId":165815,"journal":{"name":"Proceedings Eighth IEEE Symposium on Software Metrics","volume":"556 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2002-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"42","resultStr":"{\"title\":\"Experience from replicating empirical studies on prediction models\",\"authors\":\"M. C. Ohlsson, P. Runeson\",\"doi\":\"10.1109/METRIC.2002.1011340\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"When conducting empirical studies, replications are important contributors to investigating the generality of the studies. By replicating a study in another context, we investigate what impact the specific environment has, related to the effect of the studied object. In this paper, we define different levels of replication to characterise the similarities and differences between an original study and a replication, with particular focus on prediction models for the identification of fault-prone software components. Further, we derive a set of issues and concerns which are important in order to enable replication of an empirical study and to enable practitioners to use the results. To illustrate the importance of the issues raised, a replication case study is presented in the domain of prediction models for fault-prone software components. It is concluded that the results are very divergent, depending on how different parameters are chosen, which demonstrates the need for well-documented empirical studies to enable replication and use.\",\"PeriodicalId\":165815,\"journal\":{\"name\":\"Proceedings Eighth IEEE Symposium on Software Metrics\",\"volume\":\"556 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2002-06-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"42\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings Eighth IEEE Symposium on Software Metrics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/METRIC.2002.1011340\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings Eighth IEEE Symposium on Software Metrics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/METRIC.2002.1011340","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 42

摘要

在进行实证研究时，重复是调查研究普遍性的重要贡献者。通过在另一个环境中复制一项研究，我们调查了与研究对象的影响相关的特定环境的影响。在本文中，我们定义了不同级别的复制，以表征原始研究和复制之间的相似性和差异性，特别关注用于识别易出错软件组件的预测模型。此外，我们得出了一系列重要的问题和关注点，以便能够复制实证研究并使从业者能够使用结果。为了说明所提出问题的重要性，本文在易出错软件组件的预测模型领域提出了一个复制案例研究。结论是，根据选择不同的参数，结果非常不同，这表明需要有充分记录的实证研究来实现复制和使用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Experience from replicating empirical studies on prediction models

When conducting empirical studies, replications are important contributors to investigating the generality of the studies. By replicating a study in another context, we investigate what impact the specific environment has, related to the effect of the studied object. In this paper, we define different levels of replication to characterise the similarities and differences between an original study and a replication, with particular focus on prediction models for the identification of fault-prone software components. Further, we derive a set of issues and concerns which are important in order to enable replication of an empirical study and to enable practitioners to use the results. To illustrate the importance of the issues raised, a replication case study is presented in the domain of prediction models for fault-prone software components. It is concluded that the results are very divergent, depending on how different parameters are chosen, which demonstrates the need for well-documented empirical studies to enable replication and use.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings Eighth IEEE Symposium on Software Metrics

自引率

0.00%

发文量