Machine learning regression for estimating the cost range of building projects

IF 3.1 Q2 CONSTRUCTION & BUILDING TECHNOLOGY
A. Gurmu, Mani Pourdadash Miri
{"title":"Machine learning regression for estimating the cost range of building projects","authors":"A. Gurmu, Mani Pourdadash Miri","doi":"10.1108/ci-08-2022-0197","DOIUrl":null,"url":null,"abstract":"\nPurpose\nSeveral factors influence the costs of buildings. Thus, identifying the cost significant factors can assist to improve the accuracy of project cost forecasts during the planning phase. This paper aims to identify the cost significant parameters and explore the potential for improving the accuracy of cost forecasts for buildings using machine learning techniques and large data sets.\n\n\nDesign/methodology/approach\nThe Australian State of Victoria Building Authority data sets, which comprise various parameters such as cost of the buildings, materials used, gross floor areas (GFA) and type of buildings, have been used. Five different machine learning regression models, such as decision tree, linear regression, random forest, gradient boosting and k-nearest neighbor were used.\n\n\nFindings\nThe findings of the study showed that among the chosen models, linear regression provided the worst outcome (r2 = 0.38) while decision tree (r2 = 0.66) and gradient boosting (r2 = 0.62) provided the best outcome. Among the analyzed features, the class of buildings explained about 34% of the variations, followed by GFA and walls, which both accounted for 26% of the variations.\n\n\nOriginality/value\nThe output of this research can provide important information regarding the factors that have major impacts on the costs of buildings in the Australian construction industry. The study revealed that the cost of buildings is highly influenced by their classes.\n","PeriodicalId":45580,"journal":{"name":"Construction Innovation-England","volume":null,"pages":null},"PeriodicalIF":3.1000,"publicationDate":"2023-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Construction Innovation-England","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1108/ci-08-2022-0197","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CONSTRUCTION & BUILDING TECHNOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Purpose Several factors influence the costs of buildings. Thus, identifying the cost significant factors can assist to improve the accuracy of project cost forecasts during the planning phase. This paper aims to identify the cost significant parameters and explore the potential for improving the accuracy of cost forecasts for buildings using machine learning techniques and large data sets. Design/methodology/approach The Australian State of Victoria Building Authority data sets, which comprise various parameters such as cost of the buildings, materials used, gross floor areas (GFA) and type of buildings, have been used. Five different machine learning regression models, such as decision tree, linear regression, random forest, gradient boosting and k-nearest neighbor were used. Findings The findings of the study showed that among the chosen models, linear regression provided the worst outcome (r2 = 0.38) while decision tree (r2 = 0.66) and gradient boosting (r2 = 0.62) provided the best outcome. Among the analyzed features, the class of buildings explained about 34% of the variations, followed by GFA and walls, which both accounted for 26% of the variations. Originality/value The output of this research can provide important information regarding the factors that have major impacts on the costs of buildings in the Australian construction industry. The study revealed that the cost of buildings is highly influenced by their classes.
用于估算建筑项目成本范围的机器学习回归
有几个因素影响建筑物的成本。因此,确定成本重要因素有助于在计划阶段提高项目成本预测的准确性。本文旨在确定成本重要参数,并探索使用机器学习技术和大型数据集提高建筑物成本预测准确性的潜力。设计/方法/方法使用了澳大利亚维多利亚州建筑管理局的数据集,其中包括各种参数,如建筑物成本、使用的材料、总建筑面积(GFA)和建筑物类型。使用了决策树、线性回归、随机森林、梯度增强和k近邻等五种不同的机器学习回归模型。研究结果表明,在选择的模型中,线性回归的结果最差(r2 = 0.38),决策树(r2 = 0.66)和梯度增强(r2 = 0.62)的结果最好。在分析的特征中,建筑类别解释了约34%的变化,其次是建筑面积和墙壁,两者都占26%的变化。原创性/价值本研究的产出可以提供有关对澳大利亚建筑行业的建筑物成本产生重大影响的因素的重要信息。研究表明,建筑物的造价在很大程度上受其等级的影响。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Construction Innovation-England
Construction Innovation-England CONSTRUCTION & BUILDING TECHNOLOGY-
CiteScore
7.10
自引率
12.10%
发文量
71
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信