基于二元关联和逻辑回归分析的阻塞漏洞识别

2022 9th International Conference on Dependable Systems and Their Applications (DSA) Pub Date : 2022-08-01 DOI:10.1109/DSA56465.2022.00052

Zhihua Chen, Xiaolin Ju, Guilong Lu, Xiang Chen

{"title":"基于二元关联和逻辑回归分析的阻塞漏洞识别","authors":"Zhihua Chen, Xiaolin Ju, Guilong Lu, Xiang Chen","doi":"10.1109/DSA56465.2022.00052","DOIUrl":null,"url":null,"abstract":"Blocking bugs, a type of bugs that prevents other bugs from being fixed, significantly increase the fixed time of both themself and the blocked bugs. Thus, these blocking bugs bring a considerable negative impact on software evolution. Therefore, the timely identification of blocking bugs is essential for software maintenance. This paper proposes an approach based on Binary Relevance(BR) and Logistic Regression(LR) analysis, called BR-LR, to predict bugs' blocking and blocked labels. We first filter and build a dataset consisting of two sets with a specific type of blocking relationship based on the ideas of BR. Then, we extract several fields from the bug reports and train the model by applying the logistic regression analysis with the constructed dataset in the first step, resulting in two prediction models for bug blocked and blocking labels. Finally, our approach combines the two prediction results to identify whether the bug is blocking or blocked. We also conduct empirical studies on seven open-source projects to verify the effectiveness of our approach. The final experimental results show that our model performs better from a partially correct perspective and can accurately predict bug labels than benchmarks. Specifically, the average accuracy of our model is 54.86%, and the average F1-measure is 50.61 %.","PeriodicalId":208148,"journal":{"name":"2022 9th International Conference on Dependable Systems and Their Applications (DSA)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Blocking Bugs Identification via Binary Relevance and Logistic Regression Analysis\",\"authors\":\"Zhihua Chen, Xiaolin Ju, Guilong Lu, Xiang Chen\",\"doi\":\"10.1109/DSA56465.2022.00052\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Blocking bugs, a type of bugs that prevents other bugs from being fixed, significantly increase the fixed time of both themself and the blocked bugs. Thus, these blocking bugs bring a considerable negative impact on software evolution. Therefore, the timely identification of blocking bugs is essential for software maintenance. This paper proposes an approach based on Binary Relevance(BR) and Logistic Regression(LR) analysis, called BR-LR, to predict bugs' blocking and blocked labels. We first filter and build a dataset consisting of two sets with a specific type of blocking relationship based on the ideas of BR. Then, we extract several fields from the bug reports and train the model by applying the logistic regression analysis with the constructed dataset in the first step, resulting in two prediction models for bug blocked and blocking labels. Finally, our approach combines the two prediction results to identify whether the bug is blocking or blocked. We also conduct empirical studies on seven open-source projects to verify the effectiveness of our approach. The final experimental results show that our model performs better from a partially correct perspective and can accurately predict bug labels than benchmarks. Specifically, the average accuracy of our model is 54.86%, and the average F1-measure is 50.61 %.\",\"PeriodicalId\":208148,\"journal\":{\"name\":\"2022 9th International Conference on Dependable Systems and Their Applications (DSA)\",\"volume\":\"20 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 9th International Conference on Dependable Systems and Their Applications (DSA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DSA56465.2022.00052\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 9th International Conference on Dependable Systems and Their Applications (DSA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DSA56465.2022.00052","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

阻塞bug是一种阻止其他bug被修复的bug，它会显著增加自身和被阻塞bug的固定时间。因此，这些阻塞的bug对软件的进化带来了相当大的负面影响。因此，及时识别阻塞bug对于软件维护至关重要。本文提出了一种基于二进制相关性(BR)和逻辑回归(LR)分析的方法，称为BR-LR，用于预测错误的阻塞和阻塞标签。首先，我们基于BR的思想对具有特定类型阻塞关系的两组数据集进行过滤和构建。然后，我们从bug报告中提取几个字段，并对第一步构建的数据集进行逻辑回归分析对模型进行训练，得到bug blocked和blocking标签的两个预测模型。最后，我们的方法将两个预测结果结合起来，以确定bug是阻塞的还是阻塞的。我们还对七个开源项目进行了实证研究，以验证我们方法的有效性。最终的实验结果表明，从部分正确的角度来看，我们的模型表现更好，并且可以比基准测试更准确地预测bug标签。具体来说，我们的模型的平均准确率为54.86%，f1测量的平均准确率为50.61%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Blocking Bugs Identification via Binary Relevance and Logistic Regression Analysis

Blocking bugs, a type of bugs that prevents other bugs from being fixed, significantly increase the fixed time of both themself and the blocked bugs. Thus, these blocking bugs bring a considerable negative impact on software evolution. Therefore, the timely identification of blocking bugs is essential for software maintenance. This paper proposes an approach based on Binary Relevance(BR) and Logistic Regression(LR) analysis, called BR-LR, to predict bugs' blocking and blocked labels. We first filter and build a dataset consisting of two sets with a specific type of blocking relationship based on the ideas of BR. Then, we extract several fields from the bug reports and train the model by applying the logistic regression analysis with the constructed dataset in the first step, resulting in two prediction models for bug blocked and blocking labels. Finally, our approach combines the two prediction results to identify whether the bug is blocking or blocked. We also conduct empirical studies on seven open-source projects to verify the effectiveness of our approach. The final experimental results show that our model performs better from a partially correct perspective and can accurately predict bug labels than benchmarks. Specifically, the average accuracy of our model is 54.86%, and the average F1-measure is 50.61 %.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 9th International Conference on Dependable Systems and Their Applications (DSA)

自引率

0.00%

发文量