Automated construction of reference model for software remodularization through software evolution

IF 1.7 4区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Journal of Software-Evolution and Process Pub Date : 2024-06-19 DOI:10.1002/smr.2700

Fanyi Meng, Hai Yu, Chun Yong Chong, Ying Wang, Zhiliang Zhu

{"title":"Automated construction of reference model for software remodularization through software evolution","authors":"Fanyi Meng, Hai Yu, Chun Yong Chong, Ying Wang, Zhiliang Zhu","doi":"10.1002/smr.2700","DOIUrl":null,"url":null,"abstract":"The undocumented evolution of a software project and its underlying architecture underscores the need to recover the architecture from the software's implementation-level artifacts. Despite the existence of various software remodularization techniques, they often suffer from inaccuracies, and evaluating their effectiveness is challenging due to the absence of accurate “ground-truth” architectures or reference models. Prior studies on reference model construction are time-consuming and labor-intensive as it heavily relies on manual analysis by domain experts. Besides, other existing approaches that directly utilize the directory or package structure of the latest version can be unreliable, lacking in-depth analysis of the employed software structure. To address the above limitations, in this paper, we propose Automated Construction of Reference Model (ACRM), an approach for automatically constructing reference models by assigning weights to classes for various software projects using the metadata of all software versions and historical maintenance records. We evaluate ACRM through both quantitative and qualitative analyses. The experiment results provide quantitative validation and show that the generated reference models are reasonable, as confirmed by the relationship between proposed reference models and architectural smells or bugs. Furthermore, we conduct a survey among the practitioners from industry, to gain insights from practitioners' practices and further validate the generated reference models. The survey shows that, on average, 87% of the participants agree with the reference models generated by ACRM. Moreover, we propose an improved metric, wc2c, which analyzes the strengths and weaknesses of different types of software clustering techniques using the proposed reference models of the given software. Finally, we discuss the potential benefits of using ACRM in analyzed projects, particularly in terms of improving software quality, reducing maintenance costs, and enhancing developer productivity.","PeriodicalId":48898,"journal":{"name":"Journal of Software-Evolution and Process","volume":"36 10","pages":""},"PeriodicalIF":1.7000,"publicationDate":"2024-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Software-Evolution and Process","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/smr.2700","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}

引用次数: 0

Abstract

The undocumented evolution of a software project and its underlying architecture underscores the need to recover the architecture from the software's implementation-level artifacts. Despite the existence of various software remodularization techniques, they often suffer from inaccuracies, and evaluating their effectiveness is challenging due to the absence of accurate “ground-truth” architectures or reference models. Prior studies on reference model construction are time-consuming and labor-intensive as it heavily relies on manual analysis by domain experts. Besides, other existing approaches that directly utilize the directory or package structure of the latest version can be unreliable, lacking in-depth analysis of the employed software structure. To address the above limitations, in this paper, we propose Automated Construction of Reference Model (ACRM), an approach for automatically constructing reference models by assigning weights to classes for various software projects using the metadata of all software versions and historical maintenance records. We evaluate ACRM through both quantitative and qualitative analyses. The experiment results provide quantitative validation and show that the generated reference models are reasonable, as confirmed by the relationship between proposed reference models and architectural smells or bugs. Furthermore, we conduct a survey among the practitioners from industry, to gain insights from practitioners' practices and further validate the generated reference models. The survey shows that, on average, 87% of the participants agree with the reference models generated by ACRM. Moreover, we propose an improved metric, wc2c, which analyzes the strengths and weaknesses of different types of software clustering techniques using the proposed reference models of the given software. Finally, we discuss the potential benefits of using ACRM in analyzed projects, particularly in terms of improving software quality, reducing maintenance costs, and enhancing developer productivity.

查看原文本刊更多论文

通过软件进化自动构建软件重模块化参考模型

软件项目及其底层体系结构的演变无据可查，这就凸显了从软件的实现级工件中恢复体系结构的必要性。尽管存在各种软件重模块化技术，但由于缺乏准确的 "地面实况 "架构或参考模型，这些技术往往存在误差，而且评估这些技术的有效性也具有挑战性。先前关于参考模型构建的研究耗时耗力，因为它严重依赖领域专家的人工分析。此外，现有的其他方法直接利用最新版本的目录或软件包结构，缺乏对所使用软件结构的深入分析，因此并不可靠。针对上述局限性，我们在本文中提出了自动构建参考模型（ACRM），这是一种利用所有软件版本的元数据和历史维护记录为不同软件项目的类分配权重，从而自动构建参考模型的方法。我们通过定量和定性分析对 ACRM 进行了评估。实验结果提供了定量验证，并表明所生成的参考模型是合理的，这一点可以从所提出的参考模型与架构缺陷或错误之间的关系中得到证实。此外，我们还对行业从业人员进行了调查，以深入了解从业人员的实践，进一步验证生成的参考模型。调查显示，平均有 87% 的参与者同意 ACRM 生成的参考模型。此外，我们还提出了一种改进的度量方法--wc2c，它可以利用所提出的给定软件参考模型来分析不同类型软件聚类技术的优缺点。最后，我们讨论了在分析项目中使用 ACRM 的潜在好处，特别是在提高软件质量、降低维护成本和提高开发人员工作效率方面。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Software-Evolution and Process COMPUTER SCIENCE, SOFTWARE ENGINEERING-

自引率

10.00%

发文量

109