REMS: Recommending Extract Method Refactoring Opportunities via Multi-view Representation of Code Property Graph

2023 IEEE/ACM 31st International Conference on Program Comprehension (ICPC) Pub Date : 2023-05-01 DOI:10.1109/ICPC58990.2023.00034

Di Cui, Qiangqiang Wang, Siqi Wang, Jianlei Chi, Jianan Li, Lu Wang, Qingshan Li

{"title":"REMS: Recommending Extract Method Refactoring Opportunities via Multi-view Representation of Code Property Graph","authors":"Di Cui, Qiangqiang Wang, Siqi Wang, Jianlei Chi, Jianan Li, Lu Wang, Qingshan Li","doi":"10.1109/ICPC58990.2023.00034","DOIUrl":null,"url":null,"abstract":"Extract Method is one of the most frequently performed refactoring operations for the decomposition of large and complex methods, which can also be combined with other refactoring operations to remove a variety of design flaws. Several Extract Method refactoring tools have been proposed based on the quantification of extraction criteria. To the best of our knowledge, state-of-the-art related techniques can be broadly divided into two categories: the first line is non-machine-learning-based approaches built on heuristics, and the second line is machine learning-based approaches built on historical data. Most of these approaches characterize the extraction criteria by deriving software metrics from fine-grained code properties. However, in most cases, these metrics can be challenging to concretize, and their selections and thresholds also largely rely on expert knowledge. Thus, in this paper, we propose an approach to automatically recommend Extract Method refactoring opportunities named REMS via mining multi-view representations from code property graph. We fuse various representations together using compact bilinear pooling and further train machine learning classifiers to guide the extraction of suitable lines of code as new method. We evaluate our approach on two publicly available datasets. The results show that our approach outperforms five state-of-the-art refactoring tools including GEMS, JExtract, SEMI, JDeodorant, and Segmentation in effectiveness and usefulness. Our approach demonstrates an increase of 29% in precision, 15% in recall, and 23% in f1-measure. The results also unveil practical suggestions and provide new insights that benefit additional extract-related refactoring techniques.","PeriodicalId":376593,"journal":{"name":"2023 IEEE/ACM 31st International Conference on Program Comprehension (ICPC)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE/ACM 31st International Conference on Program Comprehension (ICPC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICPC58990.2023.00034","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Extract Method is one of the most frequently performed refactoring operations for the decomposition of large and complex methods, which can also be combined with other refactoring operations to remove a variety of design flaws. Several Extract Method refactoring tools have been proposed based on the quantification of extraction criteria. To the best of our knowledge, state-of-the-art related techniques can be broadly divided into two categories: the first line is non-machine-learning-based approaches built on heuristics, and the second line is machine learning-based approaches built on historical data. Most of these approaches characterize the extraction criteria by deriving software metrics from fine-grained code properties. However, in most cases, these metrics can be challenging to concretize, and their selections and thresholds also largely rely on expert knowledge. Thus, in this paper, we propose an approach to automatically recommend Extract Method refactoring opportunities named REMS via mining multi-view representations from code property graph. We fuse various representations together using compact bilinear pooling and further train machine learning classifiers to guide the extraction of suitable lines of code as new method. We evaluate our approach on two publicly available datasets. The results show that our approach outperforms five state-of-the-art refactoring tools including GEMS, JExtract, SEMI, JDeodorant, and Segmentation in effectiveness and usefulness. Our approach demonstrates an increase of 29% in precision, 15% in recall, and 23% in f1-measure. The results also unveil practical suggestions and provide new insights that benefit additional extract-related refactoring techniques.

查看原文本刊更多论文

REMS:通过代码属性图的多视图表示推荐提取方法重构机会

Extract Method是用于分解大型和复杂方法的最常执行的重构操作之一，它也可以与其他重构操作相结合，以消除各种设计缺陷。基于提取标准的量化，提出了几种提取方法重构工具。据我们所知，最先进的相关技术可以大致分为两类:第一类是基于启发式的非机器学习方法，第二类是基于历史数据的基于机器学习的方法。这些方法中的大多数通过从细粒度的代码属性中派生软件度量来描述提取标准。然而，在大多数情况下，这些指标很难具体化，它们的选择和阈值也很大程度上依赖于专家知识。因此，在本文中，我们提出了一种通过从代码属性图中挖掘多视图表示来自动推荐称为REMS的提取方法重构机会的方法。我们使用紧凑双线性池将各种表示融合在一起，并进一步训练机器学习分类器，以指导提取合适的代码行作为新方法。我们在两个公开可用的数据集上评估我们的方法。结果表明，我们的方法在有效性和有用性方面优于五种最先进的重构工具，包括GEMS、JExtract、SEMI、JDeodorant和Segmentation。我们的方法表明，准确率提高了29%，召回率提高了15%，f1-measure提高了23%。研究结果还揭示了一些实用的建议，并提供了新的见解，有助于其他与提取相关的重构技术。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 IEEE/ACM 31st International Conference on Program Comprehension (ICPC)

自引率

0.00%

发文量