Explaining building energy efficiency prediction through architectural and engineering solutions considering environmental impacts using a hybrid model
IF 6.6 2区 工程技术Q1 CONSTRUCTION & BUILDING TECHNOLOGY
{"title":"Explaining building energy efficiency prediction through architectural and engineering solutions considering environmental impacts using a hybrid model","authors":"Semi Emrah Aslay","doi":"10.1016/j.enbuild.2025.115614","DOIUrl":null,"url":null,"abstract":"<div><div>The aim of this study is to investigate building energy efficiency by integrating hybrid modelling approaches and interpretable models into architectural design processes and engineering solutions while considering environmental impacts. At the same time, it is aimed to achieve the most efficient building design possible. Separate analyses were carried out using a total of 6,913 data from Rutland (185) and Salford (6718) cities. The data were grouped into carbon emission information, architectural information, lighting information, personal heating information, and main heating system information to form a dataset. Light Gradient Boosting Machine (LightGBM) was preferred as the base model and Particle Swarm Optimisation (PSO) method was applied for hyperparameter optimisation. The hybrid model created in this way is called PSO-LightGBM. The optimization process was carried out using software in both R Studio and Python environments, utilizing seven different hyperparameters. Apart from the hybrid model used as a method, 2 different SHAP analyses, neural network based, and tree based, were performed to clearly explain the parameter relationships. The PSO-LightGBM hybrid model provided more successful predictions compared to the basic LightGBM model. While R<sup>2</sup> values improved between 0.82 and 0.90 in the Rutland dataset, this value increased from 0.8687 to 0.8901 for test data and from 0.8538 to 0.9091 for training data in the Salford dataset. R<sup>2</sup> values show an improvement of 7% in the Rutland dataset and maximum 6% in the Salford dataset. When the reduction in error rates is evaluated, it is found that the greatest improvement is in the Mean Squared Error (MSE) metric. MSE decreased by 17% in the Rutland dataset and by 4% in the Salford dataset. According to the SHAP Analysis results, CO<sub>2</sub> emissions have the largest impact on energy consumption, while primary fuel types, number heated rooms and individual heating systems are other important parameters. While the tree-based SHAP model is more sensitive to physical parameters, the neural network model is more sensitive to indirect relationships. In both analyses, the communal heating system type has the lowest impact. In order to improve building energy efficiency, high efficiency individual boiler systems should be preferred, architectural approaches that optimise the number of heated rooms and smart heating solutions should be used, and central heating systems should be modernised. The results highlight the effectiveness of hybrid modelling approaches with SHAP analyses based on different baselines to ensure the integration of environmental impacts, architectural design processes and engineering solutions in terms of building energy efficiency. Furthermore, the findings contribute to the importance of interdisciplinary work in buildings to improve energy efficiency. Future studies can focus on the development of building energy performance prediction models that can be explained by integrating various learning algorithms and different optimisation techniques, using regions with high cooling demand, large datasets and different building types.</div></div>","PeriodicalId":11641,"journal":{"name":"Energy and Buildings","volume":"337 ","pages":"Article 115614"},"PeriodicalIF":6.6000,"publicationDate":"2025-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Energy and Buildings","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0378778825003445","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CONSTRUCTION & BUILDING TECHNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
The aim of this study is to investigate building energy efficiency by integrating hybrid modelling approaches and interpretable models into architectural design processes and engineering solutions while considering environmental impacts. At the same time, it is aimed to achieve the most efficient building design possible. Separate analyses were carried out using a total of 6,913 data from Rutland (185) and Salford (6718) cities. The data were grouped into carbon emission information, architectural information, lighting information, personal heating information, and main heating system information to form a dataset. Light Gradient Boosting Machine (LightGBM) was preferred as the base model and Particle Swarm Optimisation (PSO) method was applied for hyperparameter optimisation. The hybrid model created in this way is called PSO-LightGBM. The optimization process was carried out using software in both R Studio and Python environments, utilizing seven different hyperparameters. Apart from the hybrid model used as a method, 2 different SHAP analyses, neural network based, and tree based, were performed to clearly explain the parameter relationships. The PSO-LightGBM hybrid model provided more successful predictions compared to the basic LightGBM model. While R2 values improved between 0.82 and 0.90 in the Rutland dataset, this value increased from 0.8687 to 0.8901 for test data and from 0.8538 to 0.9091 for training data in the Salford dataset. R2 values show an improvement of 7% in the Rutland dataset and maximum 6% in the Salford dataset. When the reduction in error rates is evaluated, it is found that the greatest improvement is in the Mean Squared Error (MSE) metric. MSE decreased by 17% in the Rutland dataset and by 4% in the Salford dataset. According to the SHAP Analysis results, CO2 emissions have the largest impact on energy consumption, while primary fuel types, number heated rooms and individual heating systems are other important parameters. While the tree-based SHAP model is more sensitive to physical parameters, the neural network model is more sensitive to indirect relationships. In both analyses, the communal heating system type has the lowest impact. In order to improve building energy efficiency, high efficiency individual boiler systems should be preferred, architectural approaches that optimise the number of heated rooms and smart heating solutions should be used, and central heating systems should be modernised. The results highlight the effectiveness of hybrid modelling approaches with SHAP analyses based on different baselines to ensure the integration of environmental impacts, architectural design processes and engineering solutions in terms of building energy efficiency. Furthermore, the findings contribute to the importance of interdisciplinary work in buildings to improve energy efficiency. Future studies can focus on the development of building energy performance prediction models that can be explained by integrating various learning algorithms and different optimisation techniques, using regions with high cooling demand, large datasets and different building types.
期刊介绍:
An international journal devoted to investigations of energy use and efficiency in buildings
Energy and Buildings is an international journal publishing articles with explicit links to energy use in buildings. The aim is to present new research results, and new proven practice aimed at reducing the energy needs of a building and improving indoor environment quality.