Blocking Bug Prediction Based on XGBoost with Enhanced Features

2020 IEEE 44th Annual Computers, Software, and Applications Conference (COMPSAC) Pub Date : 2020-07-01 DOI:10.1109/COMPSAC48688.2020.0-152

Xiao‐Liang Cheng, N. Liu, Lin Guo, Zhou Xu, Tao Zhang

{"title":"Blocking Bug Prediction Based on XGBoost with Enhanced Features","authors":"Xiao‐Liang Cheng, N. Liu, Lin Guo, Zhou Xu, Tao Zhang","doi":"10.1109/COMPSAC48688.2020.0-152","DOIUrl":null,"url":null,"abstract":"With a growing number of software projects, software quality is increasingly crucial. Researchers and engineers in the software engineering field often pay much attention to bug management tasks, such as bug localization, bug triage, and duplicate bug detection. However, there are few researchers to study blocking bug prediction. Blocking bugs prevent other bugs from being fixed and usually need more time to be fixed. Thus, developers need to identify blocking bugs and reduce the impact of blocking bugs. The previous studies utilized supervised algorithms to implement this task. However, they did not consider the dependencies among individual classifiers so that they cannot get the perfect accuracy for blocking bug prediction. In this paper, we propose a new framework XGBlocker that includes two stages. In the first stage, XGBlocker collects more features from bug reports to build an enhanced dataset. In the second stage, XGBlocker exploits XGBoost technique to construct an effective model to perform the prediction task. We conduct experiments on four projects with three evaluation metrics. The experimental results show that our method XGBlocker achieves promising performance compared with baseline methods in most cases. In detail, XGBlocker achieves F1-score, ER@20%, and AUC of up to 0.808, 0.944, and 0.975, respectively. On average across the four projects, XGBlocker improves F1-score, ER@20%, and AUC over the state-of-the-art method ELBlocker by 17.27%, 12.67%, and 4.85%, respectively.","PeriodicalId":430098,"journal":{"name":"2020 IEEE 44th Annual Computers, Software, and Applications Conference (COMPSAC)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 44th Annual Computers, Software, and Applications Conference (COMPSAC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/COMPSAC48688.2020.0-152","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 8

Abstract

With a growing number of software projects, software quality is increasingly crucial. Researchers and engineers in the software engineering field often pay much attention to bug management tasks, such as bug localization, bug triage, and duplicate bug detection. However, there are few researchers to study blocking bug prediction. Blocking bugs prevent other bugs from being fixed and usually need more time to be fixed. Thus, developers need to identify blocking bugs and reduce the impact of blocking bugs. The previous studies utilized supervised algorithms to implement this task. However, they did not consider the dependencies among individual classifiers so that they cannot get the perfect accuracy for blocking bug prediction. In this paper, we propose a new framework XGBlocker that includes two stages. In the first stage, XGBlocker collects more features from bug reports to build an enhanced dataset. In the second stage, XGBlocker exploits XGBoost technique to construct an effective model to perform the prediction task. We conduct experiments on four projects with three evaluation metrics. The experimental results show that our method XGBlocker achieves promising performance compared with baseline methods in most cases. In detail, XGBlocker achieves F1-score, ER@20%, and AUC of up to 0.808, 0.944, and 0.975, respectively. On average across the four projects, XGBlocker improves F1-score, ER@20%, and AUC over the state-of-the-art method ELBlocker by 17.27%, 12.67%, and 4.85%, respectively.

查看原文本刊更多论文

基于增强功能的XGBoost阻塞Bug预测

随着软件项目数量的增加，软件质量变得越来越重要。软件工程领域的研究人员和工程师经常关注缺陷管理任务，例如缺陷定位、缺陷分类和重复缺陷检测。然而，目前对阻塞bug预测的研究还很少。阻塞bug会阻碍其他bug的修复，并且通常需要更多的时间来修复。因此，开发人员需要识别阻塞错误并减少阻塞错误的影响。先前的研究利用监督算法来实现这一任务。然而，他们没有考虑各个分类器之间的依赖关系，因此他们无法获得阻止错误预测的完美准确性。在本文中，我们提出了一个新的框架XGBlocker，它包括两个阶段。在第一阶段，XGBlocker从bug报告中收集更多特性，以构建增强的数据集。第二阶段，XGBlocker利用XGBoost技术构建一个有效的模型来执行预测任务。我们在四个项目上用三个评估指标进行实验。实验结果表明，在大多数情况下，与基线方法相比，我们的XGBlocker方法取得了令人满意的性能。其中，XGBlocker达到F1-score, ER@20%， AUC分别高达0.808,0.944,0.975。在四个项目中，平均而言，XGBlocker比最先进的ELBlocker方法分别提高了f1分数，ER@20%和AUC，分别提高了17.27%，12.67%和4.85%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 IEEE 44th Annual Computers, Software, and Applications Conference (COMPSAC)

自引率

0.00%

发文量