Cícero A. L. Pahins, Fabrício D'Morison, Thiago M. Rocha, Larissa M. Almeida, Arthur F. Batista, Diego F. Souza
{"title":"T-REC: Towards Accurate Bug Triage for Technical Groups","authors":"Cícero A. L. Pahins, Fabrício D'Morison, Thiago M. Rocha, Larissa M. Almeida, Arthur F. Batista, Diego F. Souza","doi":"10.1109/ICMLA.2019.00154","DOIUrl":null,"url":null,"abstract":"With ever-larger software development systems involving more people with different skills, it is necessary to think about the process of bug assignment to groups of developers and not just to a developer alone. This work aims to leverage Bug Triage Process by suggesting a list of specialized groups of developers, or Technical Groups (TG's), to be attributed to a new bug report, based on other bugs that are similar and have been resolved by these TG's in the past. In the dataset used in our experiments, the mean time to correctly assign bug reports to their proper TG is 14 days, and just by then, the bug fixing process starts. This is a critical problem for software development and management since issues tend to accumulate a high-resolution time, which compromises developer performance and deliveries. In order to enhance the Bug Triage Process, we propose T-REC, an auxiliary SW Project Management system that accurately and efficiently analyzes similar issues to provide personalized TG recommendation. T-REC is a method that ensemble Machine Learning (ML) and Information Retrieval (IR) algorithms to recommend a list of TG's to handle an issue. Our experiments show that T-REC recommendation reaches an overall Acc@1 of 50.9%, Acc@2 of 63.2%, Acc@5 of 76.1%, Acc@10 of 83.6%, and Acc@20 of 89.7%. To the best of our knowledge, our work is the first to associate multiple machine learning strategies (classifiers, attributes, and training history) on the prediction of specialized groups of developers. We validate our approach on a real-world dataset from a large company that comprises 9.5M mobile-related bug reports from January 2001 to January 2019.","PeriodicalId":436714,"journal":{"name":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLA.2019.00154","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
With ever-larger software development systems involving more people with different skills, it is necessary to think about the process of bug assignment to groups of developers and not just to a developer alone. This work aims to leverage Bug Triage Process by suggesting a list of specialized groups of developers, or Technical Groups (TG's), to be attributed to a new bug report, based on other bugs that are similar and have been resolved by these TG's in the past. In the dataset used in our experiments, the mean time to correctly assign bug reports to their proper TG is 14 days, and just by then, the bug fixing process starts. This is a critical problem for software development and management since issues tend to accumulate a high-resolution time, which compromises developer performance and deliveries. In order to enhance the Bug Triage Process, we propose T-REC, an auxiliary SW Project Management system that accurately and efficiently analyzes similar issues to provide personalized TG recommendation. T-REC is a method that ensemble Machine Learning (ML) and Information Retrieval (IR) algorithms to recommend a list of TG's to handle an issue. Our experiments show that T-REC recommendation reaches an overall Acc@1 of 50.9%, Acc@2 of 63.2%, Acc@5 of 76.1%, Acc@10 of 83.6%, and Acc@20 of 89.7%. To the best of our knowledge, our work is the first to associate multiple machine learning strategies (classifiers, attributes, and training history) on the prediction of specialized groups of developers. We validate our approach on a real-world dataset from a large company that comprises 9.5M mobile-related bug reports from January 2001 to January 2019.