Varying defect prediction approaches during project evolution: A preliminary investigation

2018 IEEE Workshop on Machine Learning Techniques for Software Quality Evaluation (MaLTeSQuE) Pub Date : 2018-03-20 DOI:10.1109/MALTESQUE.2018.8368451

Salvatore Geremia, D. Tamburri

{"title":"Varying defect prediction approaches during project evolution: A preliminary investigation","authors":"Salvatore Geremia, D. Tamburri","doi":"10.1109/MALTESQUE.2018.8368451","DOIUrl":null,"url":null,"abstract":"Defect prediction approaches use various features of software product or process to prioritize testing, analysis and general quality assurance activities. Such approaches require the availability of project's historical data, making them inapplicable in early phase. To cope with this problem, researchers have proposed cross-project and even cross-company prediction models, which use training material from other projects to build the model. Despite such advances, there is limited knowledge of how, as the project evolves, it would be convenient to still keep using data from other projects, and when, instead, it might become convenient to switch towards a local prediction model. This paper empirically investigates, using historical data from four open source projects, on how the performance of various kinds of defect prediction approaches — within-project prediction, local and global cross-project prediction, and mixed (injected local cross) prediction — varies over time. Results of the study are part of a long-term investigation towards supporting the customization of defect prediction models over projects' history.","PeriodicalId":345739,"journal":{"name":"2018 IEEE Workshop on Machine Learning Techniques for Software Quality Evaluation (MaLTeSQuE)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE Workshop on Machine Learning Techniques for Software Quality Evaluation (MaLTeSQuE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MALTESQUE.2018.8368451","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

Defect prediction approaches use various features of software product or process to prioritize testing, analysis and general quality assurance activities. Such approaches require the availability of project's historical data, making them inapplicable in early phase. To cope with this problem, researchers have proposed cross-project and even cross-company prediction models, which use training material from other projects to build the model. Despite such advances, there is limited knowledge of how, as the project evolves, it would be convenient to still keep using data from other projects, and when, instead, it might become convenient to switch towards a local prediction model. This paper empirically investigates, using historical data from four open source projects, on how the performance of various kinds of defect prediction approaches — within-project prediction, local and global cross-project prediction, and mixed (injected local cross) prediction — varies over time. Results of the study are part of a long-term investigation towards supporting the customization of defect prediction models over projects' history.

查看原文本刊更多论文

在项目发展过程中变化缺陷预测方法:初步调查

缺陷预测方法使用软件产品或过程的各种特性来确定测试、分析和一般质量保证活动的优先级。这种方法需要项目历史数据的可用性，这使得它们在早期阶段不适用。为了解决这个问题，研究人员提出了跨项目甚至跨公司的预测模型，这些模型使用来自其他项目的培训材料来构建模型。尽管取得了这样的进步，但随着项目的发展，如何方便地继续使用其他项目的数据，以及何时可以方便地切换到本地预测模型，这些知识都是有限的。本文使用来自四个开源项目的历史数据，对各种缺陷预测方法的性能——项目内预测、局部和全局跨项目预测，以及混合(注入局部交叉)预测——如何随时间变化进行了实证调查。研究的结果是一个长期调查的一部分，它支持项目历史上缺陷预测模型的定制。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 IEEE Workshop on Machine Learning Techniques for Software Quality Evaluation (MaLTeSQuE)

自引率

0.00%

发文量