On measuring Java software

Australasian Computer Science Conference Pub Date : 1900-01-01 DOI:10.1145/1378279.1378283

E. Tempero

{"title":"On measuring Java software","authors":"E. Tempero","doi":"10.1145/1378279.1378283","DOIUrl":null,"url":null,"abstract":"Software metrics have a reputation in industry of not being very useful. I believe one reason for this is that for most metrics one important aspect of them is usually not provided, namely the \"entity population model\". In measurement theory, an entity population model defines the typical values for measurements from a metric for a given set of entities. Having these models is necessary in order to interpret the measurements. For example, without knowing the entity population model for the body temperature of humans we would not know that someone with a temperature of 40 degrees would be a cause for concern. In order for software metrics to be useful we need to have a good understanding of their entity population models. \n \nIn fact, we know very little about the entity population models for software metrics for anything but the simplest forms of measurements. We do have speculations, expectations, and even some theories as to what they should be, but there has been very little data published that can help us know which are correct and which are not. There are various reasons why we do not have this data. Often it is because we do not know how to measure something, reuse for example. Sometimes there is disagreement as to what to measure - there are more than 20 metrics for cohesion of object-oriented software for example. But it is also the case that we simply have not made a consistent and sustained attempt to make and report such measurements. The few empirical studies that do exist suffer from lacking sufficient detail to allow them to be reproduced, or are from such a small sample that little can be determined from them. This is the situation I and others are trying to change. \n \nIn this talk I will discuss my experience in measuring Java software. I have found that just measuring a large collection of software provides interesting insights as to the state of current software development. It seems that no matter what is measured, the results are usually interesting and sometimes surprising. I will present some of these results. I will also discuss the issues involved in doing this kind of research. One such issue is making measurements that are reproducible. To address this issue, I advocate basing software metrics research on the use of standard software corpora, that is, creating collections of software whose contents are well-defined. However creating such a corpus is not just a matter of downloading stuff off the 'net. I discuss some of the difficulties that arise in developing a corpus of open source Java software.","PeriodicalId":136130,"journal":{"name":"Australasian Computer Science Conference","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Australasian Computer Science Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/1378279.1378283","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

Abstract

Software metrics have a reputation in industry of not being very useful. I believe one reason for this is that for most metrics one important aspect of them is usually not provided, namely the "entity population model". In measurement theory, an entity population model defines the typical values for measurements from a metric for a given set of entities. Having these models is necessary in order to interpret the measurements. For example, without knowing the entity population model for the body temperature of humans we would not know that someone with a temperature of 40 degrees would be a cause for concern. In order for software metrics to be useful we need to have a good understanding of their entity population models. In fact, we know very little about the entity population models for software metrics for anything but the simplest forms of measurements. We do have speculations, expectations, and even some theories as to what they should be, but there has been very little data published that can help us know which are correct and which are not. There are various reasons why we do not have this data. Often it is because we do not know how to measure something, reuse for example. Sometimes there is disagreement as to what to measure - there are more than 20 metrics for cohesion of object-oriented software for example. But it is also the case that we simply have not made a consistent and sustained attempt to make and report such measurements. The few empirical studies that do exist suffer from lacking sufficient detail to allow them to be reproduced, or are from such a small sample that little can be determined from them. This is the situation I and others are trying to change. In this talk I will discuss my experience in measuring Java software. I have found that just measuring a large collection of software provides interesting insights as to the state of current software development. It seems that no matter what is measured, the results are usually interesting and sometimes surprising. I will present some of these results. I will also discuss the issues involved in doing this kind of research. One such issue is making measurements that are reproducible. To address this issue, I advocate basing software metrics research on the use of standard software corpora, that is, creating collections of software whose contents are well-defined. However creating such a corpus is not just a matter of downloading stuff off the 'net. I discuss some of the difficulties that arise in developing a corpus of open source Java software.

查看原文本刊更多论文

关于度量Java软件

软件度量标准在业界被认为不是很有用。我认为造成这种情况的一个原因是，对于大多数指标来说，它们通常没有提供一个重要的方面，即“实体填充模型”。在度量理论中，实体总体模型定义了给定一组实体的度量的典型值。为了解释测量结果，有这些模型是必要的。例如，如果不知道人类体温的实体人口模型，我们就不会知道体温为40度的人会引起关注。为了使软件度量有用，我们需要很好地理解它们的实体填充模型。事实上，除了最简单的度量形式外，我们对软件度量的实体总体模型知之甚少。我们确实有推测，期望，甚至一些理论，关于它们应该是什么，但很少有公布的数据可以帮助我们知道哪些是正确的，哪些是错误的。我们没有这些数据的原因有很多。通常是因为我们不知道如何度量某些东西，例如重用。有时在度量什么方面存在分歧——例如，面向对象软件的内聚性有20多个度量标准。但是，也有一种情况是，我们根本没有做出一致和持续的尝试来进行和报告这种测量。为数不多的实证研究由于缺乏足够的细节而无法复制，或者样本太小，无法从中得出什么结论。这是我和其他人试图改变的情况。在这次演讲中，我将讨论我在度量Java软件方面的经验。我发现，仅仅对大量软件进行度量，就可以对当前软件开发的状态提供有趣的见解。似乎无论测量什么，结果通常都很有趣，有时甚至令人惊讶。我将介绍其中的一些结果。我还将讨论做这种研究所涉及的问题。其中一个问题是进行可重复的测量。为了解决这个问题，我提倡将软件度量研究建立在使用标准软件语料库的基础上，也就是说，创建内容定义良好的软件集合。然而，创建这样一个语料库并不仅仅是从网上下载东西那么简单。我将讨论在开发开源Java软件语料库时出现的一些困难。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Australasian Computer Science Conference

自引率

0.00%

发文量