Machine Learning for Accurate Software Development Cost Estimation in Economically and Technically Limited Environments

IF 2.4 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Mohammad Alauthman, Ahmad al-Qerem, Someah Alangari, Ali Mohd Ali, Ahmad Nabo, Amjad Aldweesh, Issam Jebreen, Ammar Almoman, Brij B. Gupta
{"title":"Machine Learning for Accurate Software Development Cost Estimation in Economically and Technically Limited Environments","authors":"Mohammad Alauthman, Ahmad al-Qerem, Someah Alangari, Ali Mohd Ali, Ahmad Nabo, Amjad Aldweesh, Issam Jebreen, Ammar Almoman, Brij B. Gupta","doi":"10.4018/ijssci.331753","DOIUrl":null,"url":null,"abstract":"Cost estimation for software development is crucial for project planning and management. Several regression models have been developed to predict software development costs, using historical datasets of previous projects. Accurate cost estimation in software development is heavily influenced by the relevance and quality of the cost estimation dataset and its suitability to the software development environment. The currently available cost estimation datasets are limited to North American and European environments, leaving a gap in the representation of other economically and technically constrained software industries. In this article, the authors evaluate the performance of regression models using the SEERA dataset, which highly represents these constrained environments. This study provides insights into selecting regression models for cost estimation in software development. It highlights the importance of using appropriate models based on the specific software development model and dataset used in the estimation process. In the performance evaluations of eight regression models, including elastic net, lasso regression, linear regression, neural network, RANSACRegressor, random forest, ride regression, and SVM, for cost estimation in different software models, along with correlation coefficients and accuracy indicators, were reported. The results showed that SVM and random forest indicated superior performance. However, the elastic net, lasso regression, linear regression, neural network, and RANSACRegressor models also demonstrated exemplary performance in cost estimation.","PeriodicalId":29913,"journal":{"name":"International Journal of Software Science and Computational Intelligence-IJSSCI","volume":null,"pages":null},"PeriodicalIF":2.4000,"publicationDate":"2023-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Software Science and Computational Intelligence-IJSSCI","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4018/ijssci.331753","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Cost estimation for software development is crucial for project planning and management. Several regression models have been developed to predict software development costs, using historical datasets of previous projects. Accurate cost estimation in software development is heavily influenced by the relevance and quality of the cost estimation dataset and its suitability to the software development environment. The currently available cost estimation datasets are limited to North American and European environments, leaving a gap in the representation of other economically and technically constrained software industries. In this article, the authors evaluate the performance of regression models using the SEERA dataset, which highly represents these constrained environments. This study provides insights into selecting regression models for cost estimation in software development. It highlights the importance of using appropriate models based on the specific software development model and dataset used in the estimation process. In the performance evaluations of eight regression models, including elastic net, lasso regression, linear regression, neural network, RANSACRegressor, random forest, ride regression, and SVM, for cost estimation in different software models, along with correlation coefficients and accuracy indicators, were reported. The results showed that SVM and random forest indicated superior performance. However, the elastic net, lasso regression, linear regression, neural network, and RANSACRegressor models also demonstrated exemplary performance in cost estimation.
机器学习在经济和技术有限的环境中进行准确的软件开发成本估算
软件开发的成本估算对于项目计划和管理是至关重要的。已经开发了几个回归模型来预测软件开发成本,使用以前项目的历史数据集。成本估算数据集的相关性和质量及其对软件开发环境的适应性对软件开发中成本估算的准确性有很大影响。目前可用的成本估算数据集仅限于北美和欧洲环境,在其他经济和技术受限的软件行业的代表性方面留下了空白。在本文中,作者使用SEERA数据集评估了回归模型的性能,该数据集高度代表了这些约束环境。这项研究提供了在软件开发中为成本估算选择回归模型的见解。它强调了在评估过程中使用基于特定软件开发模型和数据集的适当模型的重要性。本文报道了弹性网、lasso回归、线性回归、神经网络、RANSACRegressor、随机森林、ride回归、SVM等8种回归模型在不同软件模型下成本估算的性能评价,并给出了相关系数和精度指标。结果表明,支持向量机和随机森林具有较好的性能。然而,弹性网、套索回归、线性回归、神经网络和RANSACRegressor模型在成本估计方面也表现出了典型的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
27.60%
发文量
34
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信