T-REC: Towards Accurate Bug Triage for Technical Groups

2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA) Pub Date : 2019-12-01 DOI:10.1109/ICMLA.2019.00154

Cícero A. L. Pahins, Fabrício D'Morison, Thiago M. Rocha, Larissa M. Almeida, Arthur F. Batista, Diego F. Souza

{"title":"T-REC: Towards Accurate Bug Triage for Technical Groups","authors":"Cícero A. L. Pahins, Fabrício D'Morison, Thiago M. Rocha, Larissa M. Almeida, Arthur F. Batista, Diego F. Souza","doi":"10.1109/ICMLA.2019.00154","DOIUrl":null,"url":null,"abstract":"With ever-larger software development systems involving more people with different skills, it is necessary to think about the process of bug assignment to groups of developers and not just to a developer alone. This work aims to leverage Bug Triage Process by suggesting a list of specialized groups of developers, or Technical Groups (TG's), to be attributed to a new bug report, based on other bugs that are similar and have been resolved by these TG's in the past. In the dataset used in our experiments, the mean time to correctly assign bug reports to their proper TG is 14 days, and just by then, the bug fixing process starts. This is a critical problem for software development and management since issues tend to accumulate a high-resolution time, which compromises developer performance and deliveries. In order to enhance the Bug Triage Process, we propose T-REC, an auxiliary SW Project Management system that accurately and efficiently analyzes similar issues to provide personalized TG recommendation. T-REC is a method that ensemble Machine Learning (ML) and Information Retrieval (IR) algorithms to recommend a list of TG's to handle an issue. Our experiments show that T-REC recommendation reaches an overall Acc@1 of 50.9%, Acc@2 of 63.2%, Acc@5 of 76.1%, Acc@10 of 83.6%, and Acc@20 of 89.7%. To the best of our knowledge, our work is the first to associate multiple machine learning strategies (classifiers, attributes, and training history) on the prediction of specialized groups of developers. We validate our approach on a real-world dataset from a large company that comprises 9.5M mobile-related bug reports from January 2001 to January 2019.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLA.2019.00154","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

Abstract

With ever-larger software development systems involving more people with different skills, it is necessary to think about the process of bug assignment to groups of developers and not just to a developer alone. This work aims to leverage Bug Triage Process by suggesting a list of specialized groups of developers, or Technical Groups (TG's), to be attributed to a new bug report, based on other bugs that are similar and have been resolved by these TG's in the past. In the dataset used in our experiments, the mean time to correctly assign bug reports to their proper TG is 14 days, and just by then, the bug fixing process starts. This is a critical problem for software development and management since issues tend to accumulate a high-resolution time, which compromises developer performance and deliveries. In order to enhance the Bug Triage Process, we propose T-REC, an auxiliary SW Project Management system that accurately and efficiently analyzes similar issues to provide personalized TG recommendation. T-REC is a method that ensemble Machine Learning (ML) and Information Retrieval (IR) algorithms to recommend a list of TG's to handle an issue. Our experiments show that T-REC recommendation reaches an overall Acc@1 of 50.9%, Acc@2 of 63.2%, Acc@5 of 76.1%, Acc@10 of 83.6%, and Acc@20 of 89.7%. To the best of our knowledge, our work is the first to associate multiple machine learning strategies (classifiers, attributes, and training history) on the prediction of specialized groups of developers. We validate our approach on a real-world dataset from a large company that comprises 9.5M mobile-related bug reports from January 2001 to January 2019.

查看原文本刊更多论文

T-REC:为技术组提供准确的Bug分类

随着越来越大的软件开发系统涉及到更多拥有不同技能的人，有必要考虑将bug分配给开发人员团队的过程，而不仅仅是一个开发人员。这项工作的目的是通过建议一个专门的开发人员小组或技术小组(TG)的列表来利用Bug分类过程，这些小组要归因于一个新的Bug报告，该报告基于其他类似的Bug，并且这些TG在过去已经解决了这些Bug。在我们实验中使用的数据集中，正确地将bug报告分配到适当的TG的平均时间是14天，到那时，bug修复过程就开始了。这是软件开发和管理的一个关键问题，因为问题往往会累积一个高分辨率的时间，这会损害开发人员的性能和交付。为了加强Bug分类流程，我们提出了一个辅助的软件项目管理系统T-REC，它可以准确有效地分析类似问题，提供个性化的TG建议。T-REC是一种集成机器学习(ML)和信息检索(IR)算法来推荐处理问题的TG列表的方法。我们的实验表明，T-REC推荐总体达到Acc@1为50.9%，Acc@2为63.2%，Acc@5为76.1%，Acc@10为83.6%，Acc@20为89.7%。据我们所知，我们的工作是第一个将多个机器学习策略(分类器、属性和训练历史)与专业开发人员群体的预测相关联的工作。我们在一家大公司的真实数据集上验证了我们的方法，该数据集包括2001年1月至2019年1月期间950万份与移动设备相关的bug报告。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)

自引率

0.00%

发文量