A Probabilistic Based Approach towards Software System Clustering

2010 14th European Conference on Software Maintenance and Reengineering Pub Date : 2010-03-15 DOI:10.1109/CSMR.2010.36

A. Corazza, S. Martino, G. Scanniello

引用次数: 64

Abstract

In this paper we present a clustering based approach to partition software systems into meaningful subsystems. In particular, the approach uses lexical information extracted from four zones in Java classes, which may provide a different contribution towards software systems partitioning. To automatically weigh these zones, we introduced a probabilistic model, and applied the Expectation-Maximization (EM) algorithm. To group classes according to the considered lexical information, we customized the well-known K-Medoids algorithm. To assess the approach and the implemented supporting system, we have conducted a case study on six open source software systems.

查看原文本刊更多论文

基于概率的软件系统聚类方法

本文提出了一种基于聚类的方法，将软件系统划分为有意义的子系统。特别是，该方法使用从Java类中的四个区域提取的词法信息，这可能为软件系统分区提供不同的贡献。为了自动权衡这些区域，我们引入了一个概率模型，并应用了期望最大化(EM)算法。为了根据考虑的词汇信息对类进行分组，我们定制了著名的k - mediids算法。为了评估该方法和实施的支援系统，我们对六个开放源码软件系统进行了个案研究。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2010 14th European Conference on Software Maintenance and Reengineering

自引率

0.00%

发文量