Pre-Trained Model-Based Software Defect Prediction for Edge-Cloud Systems

IF 0.7 4区计算机科学 Q4 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Journal of Web Engineering Pub Date : 2023-03-01 DOI:10.13052/jwe1540-9589.2223

Sunjae Kwon;Sungu Lee;Duksan Ryu;Jongmoon Baik

{"title":"Pre-Trained Model-Based Software Defect Prediction for Edge-Cloud Systems","authors":"Sunjae Kwon;Sungu Lee;Duksan Ryu;Jongmoon Baik","doi":"10.13052/jwe1540-9589.2223","DOIUrl":null,"url":null,"abstract":"Edge-cloud computing is a distributed computing infrastructure that brings computation and data storage with low latency closer to clients. As interest in edge-cloud systems grows, research on testing the systems has also been actively studied. However, as with traditional systems, the amount of resources for testing is always limited. Thus, we suggest a function-level just-in-time (JIT) software defect prediction (SDP) model based on a pre-trained model to address the limitation by prioritizing the limited testing resources for the defect-prone functions. The pre-trained model is a transformer-based deep learning model trained on a large corpus of code snippets, and the fine-tuned pre-trained model can provide the defect proneness for the changed functions at a commit level. We evaluate the performance of the three popular pre-trained models (i.e., CodeBERT, GraphCodeBERT, UniXCoder) on edge-cloud systems in within-project and cross-project environments. To the best of our knowledge, it is the first attempt to analyse the performance of the three pre-trained model-based SDP models for edge-cloud systems. As a result, we can confirm that UniXCoder showed the best performance among the three in the WPDP environment. However, we also confirm that additional research is necessary to apply the SDP models to the CPDP environment.","PeriodicalId":49952,"journal":{"name":"Journal of Web Engineering","volume":"22 2","pages":"255-278"},"PeriodicalIF":0.7000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/iel7/10243554/10243559/10247502.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Web Engineering","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10247502/","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}

引用次数: 0

Abstract

Edge-cloud computing is a distributed computing infrastructure that brings computation and data storage with low latency closer to clients. As interest in edge-cloud systems grows, research on testing the systems has also been actively studied. However, as with traditional systems, the amount of resources for testing is always limited. Thus, we suggest a function-level just-in-time (JIT) software defect prediction (SDP) model based on a pre-trained model to address the limitation by prioritizing the limited testing resources for the defect-prone functions. The pre-trained model is a transformer-based deep learning model trained on a large corpus of code snippets, and the fine-tuned pre-trained model can provide the defect proneness for the changed functions at a commit level. We evaluate the performance of the three popular pre-trained models (i.e., CodeBERT, GraphCodeBERT, UniXCoder) on edge-cloud systems in within-project and cross-project environments. To the best of our knowledge, it is the first attempt to analyse the performance of the three pre-trained model-based SDP models for edge-cloud systems. As a result, we can confirm that UniXCoder showed the best performance among the three in the WPDP environment. However, we also confirm that additional research is necessary to apply the SDP models to the CPDP environment.

查看原文本刊更多论文

基于预训练模型的边缘云系统软件缺陷预测

边缘云计算是一种分布式计算基础设施，它使低延迟的计算和数据存储更接近客户端。随着人们对边缘云系统的兴趣不断增长，对测试系统的研究也得到了积极的研究。然而，与传统系统一样，用于测试的资源量总是有限的。因此，我们提出了一个基于预先训练的模型的功能级实时（JIT）软件缺陷预测（SDP）模型，通过对易出现缺陷的功能的有限测试资源进行优先级排序来解决限制。预训练的模型是在大型代码片段语料库上训练的基于转换器的深度学习模型，并且经过微调的预训练模型可以在提交级别为更改后的函数提供缺陷倾向性。我们评估了三种流行的预训练模型（即CodeBERT、GraphCodeBERT和UniXCoder）在项目内和跨项目环境中对边缘云系统的性能。据我们所知，这是第一次尝试分析边缘云系统的三个预先训练的基于模型的SDP模型的性能。因此，我们可以确认，UniXCoder在WPDP环境中表现出了三者中最好的性能。然而，我们也确认，有必要进行额外的研究，将SDP模型应用于CPDP环境。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Web Engineering 工程技术-计算机：理论方法

CiteScore

1.80

自引率

12.50%

发文量

审稿时长

9 months

期刊介绍： The World Wide Web and its associated technologies have become a major implementation and delivery platform for a large variety of applications, ranging from simple institutional information Web sites to sophisticated supply-chain management systems, financial applications, e-government, distance learning, and entertainment, among others. Such applications, in addition to their intrinsic functionality, also exhibit the more complex behavior of distributed applications.