PredictPP: A Rank-Based Weighted Ensemble Model for Prediction of Software Project Productivity

IF 1.8 4区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING

Journal of Software-Evolution and Process Pub Date : 2025-10-08 DOI:10.1002/smr.70059

Suyash Shukla, Sandeep Kumar

{"title":"PredictPP: A Rank-Based Weighted Ensemble Model for Prediction of Software Project Productivity","authors":"Suyash Shukla, Sandeep Kumar","doi":"10.1002/smr.70059","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>Software effort estimation (SEE) determines the effort necessary to develop software. The researchers have been tending to SEE issues since the 1960s, and several methods have been created until the formulation of the function point (FP) and constructive cost estimation (COCOMO) methods. However, these methods are only useful for procedurally developed software, not modern object-oriented (OO) software. Because the use case is the widely used unit of an OO system, particularly in scenarios requiring structured and early-stage effort estimation, using the use case point (UCP) approach will help get accurate results. The UCP approach consists of size estimation (in UCP) and effort estimation with calculated size. This study focuses on effort estimation when the size (in UCP) is already known. The productivity of a project is one of the main components for estimating effort from the given size. The classical SEE models based on UCP utilized a fixed number of productivity values. So, the validity of classical approaches is a subject of disapproval because of static productivity values. Purposefully, we proposed a rank-based weighted ensemble model for productivity prediction that allows us to use flexible productivity values. We used learning techniques such as simple linear regression (SLR), Least Absolute Shrinkage and Selection Operator Regression (LR), ridge regression (RR), elastic net regression (ER), K-nearest neighbor (KNN), decision tree (DT), support vector regression (SVR), multilayer perceptron (MLP), bagging, and adaptive boosting for productivity prediction and compared them with the proposed model. Further, we used existing UCP prediction models and compared the proposed approach with them.</p>\n </div>","PeriodicalId":48898,"journal":{"name":"Journal of Software-Evolution and Process","volume":"37 10","pages":""},"PeriodicalIF":1.8000,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Software-Evolution and Process","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/smr.70059","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}

引用次数: 0

Abstract

Software effort estimation (SEE) determines the effort necessary to develop software. The researchers have been tending to SEE issues since the 1960s, and several methods have been created until the formulation of the function point (FP) and constructive cost estimation (COCOMO) methods. However, these methods are only useful for procedurally developed software, not modern object-oriented (OO) software. Because the use case is the widely used unit of an OO system, particularly in scenarios requiring structured and early-stage effort estimation, using the use case point (UCP) approach will help get accurate results. The UCP approach consists of size estimation (in UCP) and effort estimation with calculated size. This study focuses on effort estimation when the size (in UCP) is already known. The productivity of a project is one of the main components for estimating effort from the given size. The classical SEE models based on UCP utilized a fixed number of productivity values. So, the validity of classical approaches is a subject of disapproval because of static productivity values. Purposefully, we proposed a rank-based weighted ensemble model for productivity prediction that allows us to use flexible productivity values. We used learning techniques such as simple linear regression (SLR), Least Absolute Shrinkage and Selection Operator Regression (LR), ridge regression (RR), elastic net regression (ER), K-nearest neighbor (KNN), decision tree (DT), support vector regression (SVR), multilayer perceptron (MLP), bagging, and adaptive boosting for productivity prediction and compared them with the proposed model. Further, we used existing UCP prediction models and compared the proposed approach with them.

Abstract Image

查看原文本刊更多论文

基于秩的软件项目生产力预测加权集成模型

软件工作量估算（SEE）决定了开发软件所需的工作量。自20世纪60年代以来，研究人员一直致力于SEE问题，并且在功能点（FP）和建设性成本估算（COCOMO）方法形成之前，已经创建了几种方法。然而，这些方法只适用于程序开发的软件，而不适用于现代面向对象（OO）软件。因为用例是OO系统中广泛使用的单元，特别是在需要结构化和早期工作评估的场景中，使用用例点（UCP）方法将有助于获得准确的结果。UCP方法包括规模估计（在UCP中）和计算规模的工作量估计。本研究关注的是在规模（在UCP中）已知的情况下的工作量估计。项目的生产率是估算给定规模的工作量的主要组成部分之一。基于UCP的经典SEE模型使用了固定数量的生产率值。因此，由于静态生产率值，经典方法的有效性受到了反对。有目的地，我们提出了一个基于等级的加权集成模型用于生产力预测，允许我们使用灵活的生产力值。我们使用简单线性回归（SLR）、最小绝对收缩和选择算子回归（LR）、脊回归（RR）、弹性网络回归（ER）、k近邻回归（KNN）、决策树（DT）、支持向量回归（SVR）、多层感知器（MLP）、装袋和自适应提升等学习技术进行生产率预测，并将它们与提出的模型进行比较。此外，我们使用现有的UCP预测模型，并将所提出的方法与它们进行了比较。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Software-Evolution and Process COMPUTER SCIENCE, SOFTWARE ENGINEERING-

自引率

10.00%

发文量

109