Jie Dai, Qingshan Li, Shenglong Xie, Daizhen Li, Hua Chu
{"title":"PCG: A joint framework of graph collaborative filtering for bug triaging","authors":"Jie Dai, Qingshan Li, Shenglong Xie, Daizhen Li, Hua Chu","doi":"10.1002/smr.2673","DOIUrl":null,"url":null,"abstract":"<p>Bug triaging is a vital process in software maintenance, involving assigning bug reports to developers in the issue tracking system. Current studies predominantly treat automatic bug triaging as a classification task, categorizing bug reports using developers as labels. However, this approach deviates from the essence of triaging, which is establishing bug–developer correlations. These correlations should be explicitly leveraged, offering a more comprehensive and promising paradigm. Our bug triaging model utilizes graph collaborative filtering (GCF), a method known for handling correlations. However, GCF encounters two challenges in bug triaging: data sparsity in bug fixing records and semantic deficiency in exploiting input data. To address them, we propose PCG, an innovative framework that integrates prototype augmentation and contrastive learning with GCF. With bug triaging modeled as predicting links on the bipartite graph of bug–developer correlations, we introduce prototype clustering-based augmentation to mitigate data sparsity and devise a semantic contrastive learning task to overcome semantic deficiency. Extensive experiments against competitive baselines validate the superiority of PCG. This work may open new avenues for investigating correlations in bug triaging and related scenarios.</p>","PeriodicalId":48898,"journal":{"name":"Journal of Software-Evolution and Process","volume":"36 9","pages":""},"PeriodicalIF":1.7000,"publicationDate":"2024-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Software-Evolution and Process","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/smr.2673","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 0
Abstract
Bug triaging is a vital process in software maintenance, involving assigning bug reports to developers in the issue tracking system. Current studies predominantly treat automatic bug triaging as a classification task, categorizing bug reports using developers as labels. However, this approach deviates from the essence of triaging, which is establishing bug–developer correlations. These correlations should be explicitly leveraged, offering a more comprehensive and promising paradigm. Our bug triaging model utilizes graph collaborative filtering (GCF), a method known for handling correlations. However, GCF encounters two challenges in bug triaging: data sparsity in bug fixing records and semantic deficiency in exploiting input data. To address them, we propose PCG, an innovative framework that integrates prototype augmentation and contrastive learning with GCF. With bug triaging modeled as predicting links on the bipartite graph of bug–developer correlations, we introduce prototype clustering-based augmentation to mitigate data sparsity and devise a semantic contrastive learning task to overcome semantic deficiency. Extensive experiments against competitive baselines validate the superiority of PCG. This work may open new avenues for investigating correlations in bug triaging and related scenarios.