Predicting construction project compliance with machine learning model: case study using Portuguese procurement data

IF 3.6 2区 工程技术 Q1 ENGINEERING, CIVIL
Luís Jacques de Sousa, João Poças Martins, Luís Sanhudo
{"title":"Predicting construction project compliance with machine learning model: case study using Portuguese procurement data","authors":"Luís Jacques de Sousa, João Poças Martins, Luís Sanhudo","doi":"10.1108/ecam-09-2023-0973","DOIUrl":null,"url":null,"abstract":"<h3>Purpose</h3>\n<p>Factors like bid price, submission time, and number of bidders influence the procurement process in public projects. These factors and the award criteria may impact the project’s financial compliance. Predicting budget compliance in construction projects has been traditionally challenging, but Machine Learning (ML) techniques have revolutionised estimations.</p><!--/ Abstract__block -->\n<h3>Design/methodology/approach</h3>\n<p>In this study, Portuguese Public Procurement Data (PPPData) was utilised as the model’s input. Notably, this dataset exhibited a substantial imbalance in the target feature. To address this issue, the study evaluated three distinct data balancing techniques: oversampling, undersampling, and the SMOTE method. Next, a comprehensive feature selection process was conducted, leading to the testing of five different algorithms for forecasting budget compliance. Finally, a secondary test was conducted, refining the features to include only those elements that procurement technicians can modify while also considering the two most accurate predictors identified in the previous test.</p><!--/ Abstract__block -->\n<h3>Findings</h3>\n<p>The findings indicate that employing the SMOTE method on the scraped data can achieve a balanced dataset. Furthermore, the results demonstrate that the Adam ANN algorithm outperformed others, boasting a precision rate of 68.1%.</p><!--/ Abstract__block -->\n<h3>Practical implications</h3>\n<p>The model can aid procurement technicians during the tendering phase by using historical data and analogous projects to predict performance.</p><!--/ Abstract__block -->\n<h3>Social implications</h3>\n<p>Although the study reveals that ML algorithms cannot accurately predict budget compliance using procurement data, they can still provide project owners with insights into the most suitable criteria, aiding decision-making. Further research should assess the model’s impact and capacity within the procurement workflow.</p><!--/ Abstract__block -->\n<h3>Originality/value</h3>\n<p>Previous research predominantly focused on forecasting budgets by leveraging data from the private construction execution phase. While some investigations incorporated procurement data, this study distinguishes itself by using an imbalanced dataset and anticipating compliance rather than predicting budgetary figures. The model predicts budget compliance by analysing qualitative and quantitative characteristics of public project contracts. The research paper explores various model architectures and data treatment techniques to develop a model to assist the Client in tender definition.</p><!--/ Abstract__block -->","PeriodicalId":11888,"journal":{"name":"Engineering, Construction and Architectural Management","volume":"185 1","pages":""},"PeriodicalIF":3.6000,"publicationDate":"2024-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering, Construction and Architectural Management","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1108/ecam-09-2023-0973","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
引用次数: 0

Abstract

Purpose

Factors like bid price, submission time, and number of bidders influence the procurement process in public projects. These factors and the award criteria may impact the project’s financial compliance. Predicting budget compliance in construction projects has been traditionally challenging, but Machine Learning (ML) techniques have revolutionised estimations.

Design/methodology/approach

In this study, Portuguese Public Procurement Data (PPPData) was utilised as the model’s input. Notably, this dataset exhibited a substantial imbalance in the target feature. To address this issue, the study evaluated three distinct data balancing techniques: oversampling, undersampling, and the SMOTE method. Next, a comprehensive feature selection process was conducted, leading to the testing of five different algorithms for forecasting budget compliance. Finally, a secondary test was conducted, refining the features to include only those elements that procurement technicians can modify while also considering the two most accurate predictors identified in the previous test.

Findings

The findings indicate that employing the SMOTE method on the scraped data can achieve a balanced dataset. Furthermore, the results demonstrate that the Adam ANN algorithm outperformed others, boasting a precision rate of 68.1%.

Practical implications

The model can aid procurement technicians during the tendering phase by using historical data and analogous projects to predict performance.

Social implications

Although the study reveals that ML algorithms cannot accurately predict budget compliance using procurement data, they can still provide project owners with insights into the most suitable criteria, aiding decision-making. Further research should assess the model’s impact and capacity within the procurement workflow.

Originality/value

Previous research predominantly focused on forecasting budgets by leveraging data from the private construction execution phase. While some investigations incorporated procurement data, this study distinguishes itself by using an imbalanced dataset and anticipating compliance rather than predicting budgetary figures. The model predicts budget compliance by analysing qualitative and quantitative characteristics of public project contracts. The research paper explores various model architectures and data treatment techniques to develop a model to assist the Client in tender definition.

利用机器学习模型预测建筑项目合规性:使用葡萄牙采购数据的案例研究
目的投标价格、提交时间和投标人数量等因素影响着公共项目的采购过程。这些因素和授标标准可能会影响项目的财务合规性。预测建筑项目的预算合规性历来具有挑战性,但机器学习(ML)技术已经彻底改变了估算方法。在本研究中,葡萄牙公共采购数据(PPPData)被用作模型的输入。值得注意的是,该数据集在目标特征方面表现出严重的不平衡。为解决这一问题,研究评估了三种不同的数据平衡技术:超采样、欠采样和 SMOTE 方法。接下来,进行了全面的特征选择过程,从而测试了五种不同的预算合规性预测算法。最后,进行了二次测试,对特征进行了改进,使其仅包括采购技术人员可以修改的元素,同时还考虑了前一次测试中确定的两个最准确的预测因素。此外,结果表明,Adam ANN 算法优于其他算法,精确率高达 68.1%。社会影响虽然该研究表明,ML 算法无法利用采购数据准确预测预算合规性,但仍可为项目所有人提供有关最合适标准的见解,从而有助于决策。进一步的研究应评估该模型在采购工作流程中的影响和能力。原创性/价值以前的研究主要集中在利用私人建筑执行阶段的数据预测预算。虽然有些研究结合了采购数据,但本研究通过使用不平衡数据集和预测预算数字而与众不同。该模型通过分析公共项目合同的定性和定量特征来预测预算合规性。研究论文探讨了各种模型架构和数据处理技术,以开发一个模型来协助客户进行投标定义。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Engineering, Construction and Architectural Management
Engineering, Construction and Architectural Management Business, Management and Accounting-General Business,Management and Accounting
CiteScore
8.10
自引率
19.50%
发文量
226
期刊介绍: ECAM publishes original peer-reviewed research papers, case studies, technical notes, book reviews, features, discussions and other contemporary articles that advance research and practice in engineering, construction and architectural management. In particular, ECAM seeks to advance integrated design and construction practices, project lifecycle management, and sustainable construction. The journal’s scope covers all aspects of architectural design, design management, construction/project management, engineering management of major infrastructure projects, and the operation and management of constructed facilities. ECAM also addresses the technological, process, economic/business, environmental/sustainability, political, and social/human developments that influence the construction project delivery process. ECAM strives to establish strong theoretical and empirical debates in the above areas of engineering, architecture, and construction research. Papers should be heavily integrated with the existing and current body of knowledge within the field and develop explicit and novel contributions. Acknowledging the global character of the field, we welcome papers on regional studies but encourage authors to position the work within the broader international context by reviewing and comparing findings from their regional study with studies conducted in other regions or countries whenever possible.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信