A computer analysis tool for structural decomposition using entropy metrics

1979 International Workshop on Managing Requirements Knowledge (MARK) Pub Date : 1979-06-01 DOI:10.1109/MARK.1979.8817137

A. N. Silver

{"title":"A computer analysis tool for structural decomposition using entropy metrics","authors":"A. N. Silver","doi":"10.1109/MARK.1979.8817137","DOIUrl":null,"url":null,"abstract":"The decomposition of a metric space into successive subregions exhibiting distinctive characteristics is a problem of broad application. In pattern classification, the object is to partition the space such that pattern classes are easily separable; that is, so that each subregion of the partition contains predominantly samples of only one class. In piece-wise-constant approximation the decompositions produced contain samples whose values are sufficiently close to allow approximation with a specified degree of accuracy. In defining software it is quite often necessary to derive a structural model of a computer program which contains modules, i.e., partitions exhibiting the flow relations or connectivities among the elements (statements) in a program. The subsequent analysis and manipulation of the structural model produces useful design alternatives that enhance the operational qualities of the software generated in terms of program control, logic paths, data transfer and other relevant software issues. The basic feasibility of this approach has been demonstrated by numerous investigators. 1 - 5 However, the analytical and diagnostic tools for performing structural decompositions require further refinement and development. For example, the metrics usually used 6 , 7 for defining the topology of a given software structure are primarily single attribute measures. Although the entropy metric proposed in this paper is metrizable in terms of its hypergraph representation, 8 the extension to a multi-attribute unique formulation is, as yet, elusive. This is because an all-purpose problemindependent metric space places unrealizable constraints on the structure it proposes to define. Thus, as Koontz et al. 9 point out, even when a metric is given and a structure well known, the notion of neighboring points can not be rigorously defined for finite point sets from a computational point of view, since the simplest Euclidean distance measure must be scaled by a factor indicating its own respective distance to the nearest neighbor in order to avoid overlapping and ambiguous regions. Although conceptually, the construction of a neighborhood and the determination of the limit point of a sequence of real numbers is a widely used idea, a more fundamental requirement for metrizable hyper-spaces is that of specifying the existence of a limit point of a set. The resultant necessary and sufficient conditions for identifying metrizable spaces is given by Hausdorff. 10 However, equivalent normalizations and the use of discrete semi-metrics over a restricted space have precluded some of these inherent problems in the quest for such a unique, multi-attribute metric. Thus, the primary emphasis is to obtain realizable decompositions using readily-implementable metrics, as well as focus upon suitable partitioning alternatives in terms of identifying mathematically consistent criteria for structural decompositions.","PeriodicalId":341008,"journal":{"name":"1979 International Workshop on Managing Requirements Knowledge (MARK)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1979-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"1979 International Workshop on Managing Requirements Knowledge (MARK)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MARK.1979.8817137","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The decomposition of a metric space into successive subregions exhibiting distinctive characteristics is a problem of broad application. In pattern classification, the object is to partition the space such that pattern classes are easily separable; that is, so that each subregion of the partition contains predominantly samples of only one class. In piece-wise-constant approximation the decompositions produced contain samples whose values are sufficiently close to allow approximation with a specified degree of accuracy. In defining software it is quite often necessary to derive a structural model of a computer program which contains modules, i.e., partitions exhibiting the flow relations or connectivities among the elements (statements) in a program. The subsequent analysis and manipulation of the structural model produces useful design alternatives that enhance the operational qualities of the software generated in terms of program control, logic paths, data transfer and other relevant software issues. The basic feasibility of this approach has been demonstrated by numerous investigators. 1 - 5 However, the analytical and diagnostic tools for performing structural decompositions require further refinement and development. For example, the metrics usually used 6 , 7 for defining the topology of a given software structure are primarily single attribute measures. Although the entropy metric proposed in this paper is metrizable in terms of its hypergraph representation, 8 the extension to a multi-attribute unique formulation is, as yet, elusive. This is because an all-purpose problemindependent metric space places unrealizable constraints on the structure it proposes to define. Thus, as Koontz et al. 9 point out, even when a metric is given and a structure well known, the notion of neighboring points can not be rigorously defined for finite point sets from a computational point of view, since the simplest Euclidean distance measure must be scaled by a factor indicating its own respective distance to the nearest neighbor in order to avoid overlapping and ambiguous regions. Although conceptually, the construction of a neighborhood and the determination of the limit point of a sequence of real numbers is a widely used idea, a more fundamental requirement for metrizable hyper-spaces is that of specifying the existence of a limit point of a set. The resultant necessary and sufficient conditions for identifying metrizable spaces is given by Hausdorff. 10 However, equivalent normalizations and the use of discrete semi-metrics over a restricted space have precluded some of these inherent problems in the quest for such a unique, multi-attribute metric. Thus, the primary emphasis is to obtain realizable decompositions using readily-implementable metrics, as well as focus upon suitable partitioning alternatives in terms of identifying mathematically consistent criteria for structural decompositions.

查看原文本刊更多论文

使用熵度量进行结构分解的计算机分析工具

将度量空间分解为具有不同特征的连续子区域是一个应用广泛的问题。在模式分类中，目标是划分空间，使模式类易于分离;也就是说，分区的每个子区域只包含一个类的主要样本。在分段常数近似中，所产生的分解包含其值足够接近的样本，以允许以特定的精度进行近似。在定义软件时，通常有必要推导一个包含模块的计算机程序的结构模型，即显示程序中元素(语句)之间的流关系或连接的分区。随后对结构模型的分析和操作产生了有用的设计方案，这些方案在程序控制、逻辑路径、数据传输和其他相关软件问题方面提高了所生成软件的操作质量。这种方法的基本可行性已被许多研究者证明。然而，用于进行结构分解的分析和诊断工具需要进一步改进和发展。例如，通常用于定义给定软件结构拓扑的度量标准主要是单个属性度量。虽然本文中提出的熵度量在其超图表示方面是可度量的，但到目前为止，扩展到多属性唯一公式是难以捉摸的。这是因为一个通用的与问题无关的度量空间对它所要定义的结构施加了不可实现的约束。因此，正如Koontz et al. 9所指出的那样，即使给定了度量和已知的结构，从计算的角度来看，对于有限的点集，邻点的概念也不能严格定义，因为最简单的欧几里得距离度量必须通过一个因子来表示其各自与最近邻居的距离，以避免重叠和模糊区域。虽然从概念上讲，一个邻域的构造和实数序列极限点的确定是一个被广泛使用的思想，但对于可度量超空间的一个更基本的要求是指定一个集合的极限点的存在性。Hausdorff给出了识别可度量空间的充分必要条件。然而，在有限空间上的等价归一化和离散半度量的使用，在寻求这样一个唯一的，多属性度量时，排除了一些固有的问题。因此，主要的重点是使用易于实现的度量来获得可实现的分解，以及在为结构分解确定数学上一致的标准方面关注合适的划分替代方案。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

1979 International Workshop on Managing Requirements Knowledge (MARK)

自引率

0.00%

发文量