Jie Dai, Qingshan Li, Shenglong Xie, Daizhen Li, Hua Chu
{"title":"PCG:用于错误分流的图协同过滤联合框架","authors":"Jie Dai, Qingshan Li, Shenglong Xie, Daizhen Li, Hua Chu","doi":"10.1002/smr.2673","DOIUrl":null,"url":null,"abstract":"<p>Bug triaging is a vital process in software maintenance, involving assigning bug reports to developers in the issue tracking system. Current studies predominantly treat automatic bug triaging as a classification task, categorizing bug reports using developers as labels. However, this approach deviates from the essence of triaging, which is establishing bug–developer correlations. These correlations should be explicitly leveraged, offering a more comprehensive and promising paradigm. Our bug triaging model utilizes graph collaborative filtering (GCF), a method known for handling correlations. However, GCF encounters two challenges in bug triaging: data sparsity in bug fixing records and semantic deficiency in exploiting input data. To address them, we propose PCG, an innovative framework that integrates prototype augmentation and contrastive learning with GCF. With bug triaging modeled as predicting links on the bipartite graph of bug–developer correlations, we introduce prototype clustering-based augmentation to mitigate data sparsity and devise a semantic contrastive learning task to overcome semantic deficiency. Extensive experiments against competitive baselines validate the superiority of PCG. This work may open new avenues for investigating correlations in bug triaging and related scenarios.</p>","PeriodicalId":48898,"journal":{"name":"Journal of Software-Evolution and Process","volume":"36 9","pages":""},"PeriodicalIF":1.7000,"publicationDate":"2024-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"PCG: A joint framework of graph collaborative filtering for bug triaging\",\"authors\":\"Jie Dai, Qingshan Li, Shenglong Xie, Daizhen Li, Hua Chu\",\"doi\":\"10.1002/smr.2673\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Bug triaging is a vital process in software maintenance, involving assigning bug reports to developers in the issue tracking system. Current studies predominantly treat automatic bug triaging as a classification task, categorizing bug reports using developers as labels. However, this approach deviates from the essence of triaging, which is establishing bug–developer correlations. These correlations should be explicitly leveraged, offering a more comprehensive and promising paradigm. Our bug triaging model utilizes graph collaborative filtering (GCF), a method known for handling correlations. However, GCF encounters two challenges in bug triaging: data sparsity in bug fixing records and semantic deficiency in exploiting input data. To address them, we propose PCG, an innovative framework that integrates prototype augmentation and contrastive learning with GCF. With bug triaging modeled as predicting links on the bipartite graph of bug–developer correlations, we introduce prototype clustering-based augmentation to mitigate data sparsity and devise a semantic contrastive learning task to overcome semantic deficiency. Extensive experiments against competitive baselines validate the superiority of PCG. This work may open new avenues for investigating correlations in bug triaging and related scenarios.</p>\",\"PeriodicalId\":48898,\"journal\":{\"name\":\"Journal of Software-Evolution and Process\",\"volume\":\"36 9\",\"pages\":\"\"},\"PeriodicalIF\":1.7000,\"publicationDate\":\"2024-04-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Software-Evolution and Process\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/smr.2673\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"COMPUTER SCIENCE, SOFTWARE ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Software-Evolution and Process","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/smr.2673","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
PCG: A joint framework of graph collaborative filtering for bug triaging
Bug triaging is a vital process in software maintenance, involving assigning bug reports to developers in the issue tracking system. Current studies predominantly treat automatic bug triaging as a classification task, categorizing bug reports using developers as labels. However, this approach deviates from the essence of triaging, which is establishing bug–developer correlations. These correlations should be explicitly leveraged, offering a more comprehensive and promising paradigm. Our bug triaging model utilizes graph collaborative filtering (GCF), a method known for handling correlations. However, GCF encounters two challenges in bug triaging: data sparsity in bug fixing records and semantic deficiency in exploiting input data. To address them, we propose PCG, an innovative framework that integrates prototype augmentation and contrastive learning with GCF. With bug triaging modeled as predicting links on the bipartite graph of bug–developer correlations, we introduce prototype clustering-based augmentation to mitigate data sparsity and devise a semantic contrastive learning task to overcome semantic deficiency. Extensive experiments against competitive baselines validate the superiority of PCG. This work may open new avenues for investigating correlations in bug triaging and related scenarios.