利用余弦相似度预测缺陷解决时间

Pranjal Ambardekar, Anagha Jamthe, Mandar M. Chincholkar
{"title":"利用余弦相似度预测缺陷解决时间","authors":"Pranjal Ambardekar, Anagha Jamthe, Mandar M. Chincholkar","doi":"10.1109/ICODSE.2017.8285884","DOIUrl":null,"url":null,"abstract":"Defect resolution on time is one of the overriding project goals which cannot be neglected. Often projects suffer from missed deadlines due to open critical defects. This negatively impacts successful delivery of a product, resulting in loss of revenue and customer dissatisfaction. Predicting defect resolution time, though a daunting task, can alleviate this risk of missing targeted milestones. In this paper, the authors propose three supervised learning approaches leveraging cosine similarity measure, progressively improving the prediction for days to resolve (DTR) a defect. The prediction model uses historical defect data to estimate DTR for new similar defects. The first prediction approach leverages Naïve Bayes Classifier (NBC) to assess project risks by answering: Is quicker defect resolution feasible? The outcome of this analysis gives preliminary information on the resolution duration. To gain deeper insights on DTR, second approach utilizes similarity score between two defect summaries to predict DTR. To improve the prediction accuracy further, a third approach is shown, where predictions are based on statistical analysis on DTR of defects having same similarity scores. This approach yields lower error rates in predicting DTR for P2-High and P3-Medium defects, as compared to the second approach. Both the approaches however outperforms the simple approach, not involving supervised learning. These approaches can be applied over both open and closed source projects to reduce defect DTR.","PeriodicalId":366005,"journal":{"name":"2017 International Conference on Data and Software Engineering (ICoDSE)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Predicting defect resolution time using cosine similarity\",\"authors\":\"Pranjal Ambardekar, Anagha Jamthe, Mandar M. Chincholkar\",\"doi\":\"10.1109/ICODSE.2017.8285884\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Defect resolution on time is one of the overriding project goals which cannot be neglected. Often projects suffer from missed deadlines due to open critical defects. This negatively impacts successful delivery of a product, resulting in loss of revenue and customer dissatisfaction. Predicting defect resolution time, though a daunting task, can alleviate this risk of missing targeted milestones. In this paper, the authors propose three supervised learning approaches leveraging cosine similarity measure, progressively improving the prediction for days to resolve (DTR) a defect. The prediction model uses historical defect data to estimate DTR for new similar defects. The first prediction approach leverages Naïve Bayes Classifier (NBC) to assess project risks by answering: Is quicker defect resolution feasible? The outcome of this analysis gives preliminary information on the resolution duration. To gain deeper insights on DTR, second approach utilizes similarity score between two defect summaries to predict DTR. To improve the prediction accuracy further, a third approach is shown, where predictions are based on statistical analysis on DTR of defects having same similarity scores. This approach yields lower error rates in predicting DTR for P2-High and P3-Medium defects, as compared to the second approach. Both the approaches however outperforms the simple approach, not involving supervised learning. These approaches can be applied over both open and closed source projects to reduce defect DTR.\",\"PeriodicalId\":366005,\"journal\":{\"name\":\"2017 International Conference on Data and Software Engineering (ICoDSE)\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 International Conference on Data and Software Engineering (ICoDSE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICODSE.2017.8285884\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Data and Software Engineering (ICoDSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICODSE.2017.8285884","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

摘要

及时解决缺陷是最重要的项目目标之一,不能被忽视。项目经常因为公开的关键缺陷而错过截止日期。这对产品的成功交付产生了负面影响,导致收入损失和客户不满。预测缺陷解决时间,虽然是一项艰巨的任务,但可以减轻错过目标里程碑的风险。在本文中,作者提出了三种利用余弦相似度度量的监督学习方法,逐步提高了对缺陷的预测天数。预测模型使用历史缺陷数据来估计新的相似缺陷的DTR。第一种预测方法利用Naïve贝叶斯分类器(NBC)通过回答以下问题来评估项目风险:更快的缺陷解决是否可行?这一分析的结果提供了有关决议持续时间的初步资料。为了更深入地了解DTR,第二种方法利用两个缺陷摘要之间的相似性评分来预测DTR。为了进一步提高预测精度,本文给出了第三种方法,该方法基于对具有相同相似分数的缺陷的DTR进行统计分析来进行预测。与第二种方法相比,这种方法在预测p2 -高和p3 -中等缺陷的DTR时产生较低的错误率。然而,这两种方法都优于不涉及监督学习的简单方法。这些方法可以应用于开源和闭源项目,以减少缺陷DTR。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Predicting defect resolution time using cosine similarity
Defect resolution on time is one of the overriding project goals which cannot be neglected. Often projects suffer from missed deadlines due to open critical defects. This negatively impacts successful delivery of a product, resulting in loss of revenue and customer dissatisfaction. Predicting defect resolution time, though a daunting task, can alleviate this risk of missing targeted milestones. In this paper, the authors propose three supervised learning approaches leveraging cosine similarity measure, progressively improving the prediction for days to resolve (DTR) a defect. The prediction model uses historical defect data to estimate DTR for new similar defects. The first prediction approach leverages Naïve Bayes Classifier (NBC) to assess project risks by answering: Is quicker defect resolution feasible? The outcome of this analysis gives preliminary information on the resolution duration. To gain deeper insights on DTR, second approach utilizes similarity score between two defect summaries to predict DTR. To improve the prediction accuracy further, a third approach is shown, where predictions are based on statistical analysis on DTR of defects having same similarity scores. This approach yields lower error rates in predicting DTR for P2-High and P3-Medium defects, as compared to the second approach. Both the approaches however outperforms the simple approach, not involving supervised learning. These approaches can be applied over both open and closed source projects to reduce defect DTR.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信