Rodrigo Tavares Coimbra, Antônio Resende, Ricardo Terra
{"title":"Halstead复杂性度量与其他软件度量的相关性分析","authors":"Rodrigo Tavares Coimbra, Antônio Resende, Ricardo Terra","doi":"10.1109/CLEI.2018.00014","DOIUrl":null,"url":null,"abstract":"Halstead Complexity Measures, proposed in 1977, analyze a software system independently of its underlying programming language (technology) based on the measures number of operators and operands. From these two measures, it calculates other measures namely vocabulary, length, volume, difficulty, programming effort, errors, and testing time. The problem, nevertheless, is that since then the Academy and Industry have been coming up with hundreds of new metrics that differ in their assertions and calculations. Therefore, the objective of this paper is to present a correlation analysis between the eleven Halstead measures and other 27 popular measures proposed over the decades (e.g., LOC, cyclomatic complexity, and efferent coupling) through the inspection of 97 open-source Java systems in order to (i) identify redundancy in measures and (ii) minimize the costs of monitoring and diagnosing software projects, facilitating the task of making measurements. As a result, we identified strong correlations between Halstead measures and other measures, mainly related to size such as quantity of methods, packages, attributes, etc. We also identified direct correlation of Halstead measurements with coupling measures named Afferent and Efferent Coupling, with values ranging from 0.802 to 0.931, which are quite close to the maximum value 1 for a correlation. These results demonstrate that—although there is no perfect correlation—there is enough correlation to hypothesize that there is an overlap of measures with different denominations whose measured results are equivalent.","PeriodicalId":379986,"journal":{"name":"2018 XLIV Latin American Computer Conference (CLEI)","volume":"52 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A Correlation Analysis between Halstead Complexity Measures and other Software Measures\",\"authors\":\"Rodrigo Tavares Coimbra, Antônio Resende, Ricardo Terra\",\"doi\":\"10.1109/CLEI.2018.00014\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Halstead Complexity Measures, proposed in 1977, analyze a software system independently of its underlying programming language (technology) based on the measures number of operators and operands. From these two measures, it calculates other measures namely vocabulary, length, volume, difficulty, programming effort, errors, and testing time. The problem, nevertheless, is that since then the Academy and Industry have been coming up with hundreds of new metrics that differ in their assertions and calculations. Therefore, the objective of this paper is to present a correlation analysis between the eleven Halstead measures and other 27 popular measures proposed over the decades (e.g., LOC, cyclomatic complexity, and efferent coupling) through the inspection of 97 open-source Java systems in order to (i) identify redundancy in measures and (ii) minimize the costs of monitoring and diagnosing software projects, facilitating the task of making measurements. As a result, we identified strong correlations between Halstead measures and other measures, mainly related to size such as quantity of methods, packages, attributes, etc. We also identified direct correlation of Halstead measurements with coupling measures named Afferent and Efferent Coupling, with values ranging from 0.802 to 0.931, which are quite close to the maximum value 1 for a correlation. These results demonstrate that—although there is no perfect correlation—there is enough correlation to hypothesize that there is an overlap of measures with different denominations whose measured results are equivalent.\",\"PeriodicalId\":379986,\"journal\":{\"name\":\"2018 XLIV Latin American Computer Conference (CLEI)\",\"volume\":\"52 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 XLIV Latin American Computer Conference (CLEI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CLEI.2018.00014\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 XLIV Latin American Computer Conference (CLEI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CLEI.2018.00014","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
摘要
1977年提出的Halstead复杂性度量,基于运算符和操作数的度量数,独立于其底层编程语言(技术)来分析软件系统。从这两个度量中,它计算其他度量,即词汇量、长度、容量、难度、编程工作、错误和测试时间。然而,问题是,从那时起,学术界和工业界已经提出了数百个在断言和计算上不同的新指标。因此,本文的目的是通过对97个开源Java系统的检查,提出11个Halstead度量与过去几十年来提出的其他27个流行度量(例如LOC、圈复杂度和发出耦合)之间的相关性分析,以便(i)识别度量中的冗余,(ii)最小化监视和诊断软件项目的成本,促进进行度量的任务。因此,我们确定了Halstead度量和其他度量之间的强相关性,主要与大小相关,例如方法、包、属性等的数量。我们还确定了Halstead测量值与传入和传出耦合(Afferent and Efferent coupling)的直接相关性,其值范围为0.802至0.931,非常接近相关性的最大值1。这些结果表明,尽管没有完美的相关性,但有足够的相关性来假设,具有不同名称的测量存在重叠,其测量结果是等效的。
A Correlation Analysis between Halstead Complexity Measures and other Software Measures
Halstead Complexity Measures, proposed in 1977, analyze a software system independently of its underlying programming language (technology) based on the measures number of operators and operands. From these two measures, it calculates other measures namely vocabulary, length, volume, difficulty, programming effort, errors, and testing time. The problem, nevertheless, is that since then the Academy and Industry have been coming up with hundreds of new metrics that differ in their assertions and calculations. Therefore, the objective of this paper is to present a correlation analysis between the eleven Halstead measures and other 27 popular measures proposed over the decades (e.g., LOC, cyclomatic complexity, and efferent coupling) through the inspection of 97 open-source Java systems in order to (i) identify redundancy in measures and (ii) minimize the costs of monitoring and diagnosing software projects, facilitating the task of making measurements. As a result, we identified strong correlations between Halstead measures and other measures, mainly related to size such as quantity of methods, packages, attributes, etc. We also identified direct correlation of Halstead measurements with coupling measures named Afferent and Efferent Coupling, with values ranging from 0.802 to 0.931, which are quite close to the maximum value 1 for a correlation. These results demonstrate that—although there is no perfect correlation—there is enough correlation to hypothesize that there is an overlap of measures with different denominations whose measured results are equivalent.