语义度量的形式化定义与自动生成:Bug预测的实证研究

2021 IEEE 21st International Working Conference on Source Code Analysis and Manipulation (SCAM) Pub Date : 2021-09-01 DOI:10.1109/SCAM52516.2021.00020

Ting Hu, Ran Mo, Pu Xiong, Zengyang Li, Qiong Feng

{"title":"语义度量的形式化定义与自动生成:Bug预测的实证研究","authors":"Ting Hu, Ran Mo, Pu Xiong, Zengyang Li, Qiong Feng","doi":"10.1109/SCAM52516.2021.00020","DOIUrl":null,"url":null,"abstract":"Bug prediction is helpful for facilitating bug fixes and improving the efficiency in software development and maintenance. In the past decades, researchers have proposed numerous studies on bug prediction by using code metrics. However, most of the existing studies use syntax-based metrics, there exists little work building bug prediction models with semantic metrics from source code. In this paper, we propose a new model, semantic dependency graph (SDG), to represent semantic relationships among source files. Based on the SDG, we formally define a suite of semantic metrics reflecting semantic characteristics of a project’s source files. Moreover, we create a tool to automate the generation of our proposed SDG-based metrics. Through our experimental studies, we have demonstrated that the SDG-based semantic metrics are effective for building bug prediction models, and the SDG-based metrics outperform traditional syntactic metrics on bug prediction. In addition, models using the SDG-based metrics could achieve a better prediction performance than two state-of-the-art models that learn semantic features automatically. Finally, we have also presented that our approach is applicable in practice in terms of execution time and space.","PeriodicalId":380117,"journal":{"name":"2021 IEEE 21st International Working Conference on Source Code Analysis and Manipulation (SCAM)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Formal Definition and Automatic Generation of Semantic Metrics: An Empirical Study on Bug Prediction\",\"authors\":\"Ting Hu, Ran Mo, Pu Xiong, Zengyang Li, Qiong Feng\",\"doi\":\"10.1109/SCAM52516.2021.00020\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Bug prediction is helpful for facilitating bug fixes and improving the efficiency in software development and maintenance. In the past decades, researchers have proposed numerous studies on bug prediction by using code metrics. However, most of the existing studies use syntax-based metrics, there exists little work building bug prediction models with semantic metrics from source code. In this paper, we propose a new model, semantic dependency graph (SDG), to represent semantic relationships among source files. Based on the SDG, we formally define a suite of semantic metrics reflecting semantic characteristics of a project’s source files. Moreover, we create a tool to automate the generation of our proposed SDG-based metrics. Through our experimental studies, we have demonstrated that the SDG-based semantic metrics are effective for building bug prediction models, and the SDG-based metrics outperform traditional syntactic metrics on bug prediction. In addition, models using the SDG-based metrics could achieve a better prediction performance than two state-of-the-art models that learn semantic features automatically. Finally, we have also presented that our approach is applicable in practice in terms of execution time and space.\",\"PeriodicalId\":380117,\"journal\":{\"name\":\"2021 IEEE 21st International Working Conference on Source Code Analysis and Manipulation (SCAM)\",\"volume\":\"9 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE 21st International Working Conference on Source Code Analysis and Manipulation (SCAM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SCAM52516.2021.00020\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 21st International Working Conference on Source Code Analysis and Manipulation (SCAM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SCAM52516.2021.00020","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

Bug预测有助于简化Bug修复，提高软件开发和维护的效率。在过去的几十年里，研究人员提出了许多关于使用代码度量来预测bug的研究。然而，现有的研究大多使用基于语法的度量，很少有研究利用源代码中的语义度量来构建bug预测模型。在本文中，我们提出了一个新的模型，语义依赖图(SDG)来表示源文件之间的语义关系。基于SDG，我们正式定义了一套反映项目源文件语义特征的语义度量。此外，我们创建了一个工具来自动生成我们提出的基于可持续发展目标的指标。通过实验研究，我们证明了基于sdg的语义度量对于构建bug预测模型是有效的，并且基于sdg的度量在bug预测方面优于传统的句法度量。此外，使用基于sdg的指标的模型可以获得比自动学习语义特征的两种最先进的模型更好的预测性能。最后，我们还展示了我们的方法在执行时间和空间方面在实践中是适用的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Formal Definition and Automatic Generation of Semantic Metrics: An Empirical Study on Bug Prediction

Bug prediction is helpful for facilitating bug fixes and improving the efficiency in software development and maintenance. In the past decades, researchers have proposed numerous studies on bug prediction by using code metrics. However, most of the existing studies use syntax-based metrics, there exists little work building bug prediction models with semantic metrics from source code. In this paper, we propose a new model, semantic dependency graph (SDG), to represent semantic relationships among source files. Based on the SDG, we formally define a suite of semantic metrics reflecting semantic characteristics of a project’s source files. Moreover, we create a tool to automate the generation of our proposed SDG-based metrics. Through our experimental studies, we have demonstrated that the SDG-based semantic metrics are effective for building bug prediction models, and the SDG-based metrics outperform traditional syntactic metrics on bug prediction. In addition, models using the SDG-based metrics could achieve a better prediction performance than two state-of-the-art models that learn semantic features automatically. Finally, we have also presented that our approach is applicable in practice in terms of execution time and space.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 IEEE 21st International Working Conference on Source Code Analysis and Manipulation (SCAM)

自引率

0.00%

发文量