{"title":"基于XGBoost分位回归的可解释建筑能效预测","authors":"Sinem Guler Kangalli Uyar , Bilge Kagan Ozbay , Berker Dal","doi":"10.1016/j.enbuild.2025.115815","DOIUrl":null,"url":null,"abstract":"<div><div>This study focuses on predicting the Building Energy Performance Ratio (BEPR) of 3,594 residential buildings in Istanbul using machine learning algorithms. The main objective is to identify the factors affecting BEPR, examine their influence, and analyze how these factors differ across buildings with low and high energy efficiency. To achieve this, seven machine learning models were evaluated: Multiple Linear Regression (MLR), Decision Tree (DT), K-Nearest Neighbors (KNN), Support Vector Regression (SVR), Artificial Neural Networks (ANN), Random Forest (RF), and Extreme Gradient Boosting (XGBoost). The results show that XGBoost yields the highest accuracy among all models. To improve the interpretability of XGBoost, the Shapley Additive Explanations (SHAP) method was employed, enabling the assessment of the impact of various features (such as wall U-value, window U-value, and building age) on BEPR. The analysis revealed that building thermal properties and age are critical factors in determining BEPR. Additionally, by applying the XGBoost Quantile Regression (XGBoost-QR) algorithm, the distribution of BEPR across different quantiles (low, medium, and high) was analyzed more effectively. This approach demonstrated that the features influencing BEPR vary between buildings with low and high energy efficiency. Specifically, in the lower quantiles, structural features such as wall and window insulation have a greater impact on BEPR, whereas in the higher quantiles, building age and roof insulation become more influential. This research contributes to a better understanding of the determinants of residential energy performance, introduces the integration of XGBoost-QR into energy performance analysis, and offers valuable insights for enhancing energy efficiency strategies.</div></div>","PeriodicalId":11641,"journal":{"name":"Energy and Buildings","volume":"340 ","pages":"Article 115815"},"PeriodicalIF":6.6000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Interpretable building energy performance prediction using XGBoost Quantile Regression\",\"authors\":\"Sinem Guler Kangalli Uyar , Bilge Kagan Ozbay , Berker Dal\",\"doi\":\"10.1016/j.enbuild.2025.115815\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>This study focuses on predicting the Building Energy Performance Ratio (BEPR) of 3,594 residential buildings in Istanbul using machine learning algorithms. The main objective is to identify the factors affecting BEPR, examine their influence, and analyze how these factors differ across buildings with low and high energy efficiency. To achieve this, seven machine learning models were evaluated: Multiple Linear Regression (MLR), Decision Tree (DT), K-Nearest Neighbors (KNN), Support Vector Regression (SVR), Artificial Neural Networks (ANN), Random Forest (RF), and Extreme Gradient Boosting (XGBoost). The results show that XGBoost yields the highest accuracy among all models. To improve the interpretability of XGBoost, the Shapley Additive Explanations (SHAP) method was employed, enabling the assessment of the impact of various features (such as wall U-value, window U-value, and building age) on BEPR. The analysis revealed that building thermal properties and age are critical factors in determining BEPR. Additionally, by applying the XGBoost Quantile Regression (XGBoost-QR) algorithm, the distribution of BEPR across different quantiles (low, medium, and high) was analyzed more effectively. This approach demonstrated that the features influencing BEPR vary between buildings with low and high energy efficiency. Specifically, in the lower quantiles, structural features such as wall and window insulation have a greater impact on BEPR, whereas in the higher quantiles, building age and roof insulation become more influential. This research contributes to a better understanding of the determinants of residential energy performance, introduces the integration of XGBoost-QR into energy performance analysis, and offers valuable insights for enhancing energy efficiency strategies.</div></div>\",\"PeriodicalId\":11641,\"journal\":{\"name\":\"Energy and Buildings\",\"volume\":\"340 \",\"pages\":\"Article 115815\"},\"PeriodicalIF\":6.6000,\"publicationDate\":\"2025-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Energy and Buildings\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0378778825005456\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CONSTRUCTION & BUILDING TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Energy and Buildings","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0378778825005456","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CONSTRUCTION & BUILDING TECHNOLOGY","Score":null,"Total":0}
Interpretable building energy performance prediction using XGBoost Quantile Regression
This study focuses on predicting the Building Energy Performance Ratio (BEPR) of 3,594 residential buildings in Istanbul using machine learning algorithms. The main objective is to identify the factors affecting BEPR, examine their influence, and analyze how these factors differ across buildings with low and high energy efficiency. To achieve this, seven machine learning models were evaluated: Multiple Linear Regression (MLR), Decision Tree (DT), K-Nearest Neighbors (KNN), Support Vector Regression (SVR), Artificial Neural Networks (ANN), Random Forest (RF), and Extreme Gradient Boosting (XGBoost). The results show that XGBoost yields the highest accuracy among all models. To improve the interpretability of XGBoost, the Shapley Additive Explanations (SHAP) method was employed, enabling the assessment of the impact of various features (such as wall U-value, window U-value, and building age) on BEPR. The analysis revealed that building thermal properties and age are critical factors in determining BEPR. Additionally, by applying the XGBoost Quantile Regression (XGBoost-QR) algorithm, the distribution of BEPR across different quantiles (low, medium, and high) was analyzed more effectively. This approach demonstrated that the features influencing BEPR vary between buildings with low and high energy efficiency. Specifically, in the lower quantiles, structural features such as wall and window insulation have a greater impact on BEPR, whereas in the higher quantiles, building age and roof insulation become more influential. This research contributes to a better understanding of the determinants of residential energy performance, introduces the integration of XGBoost-QR into energy performance analysis, and offers valuable insights for enhancing energy efficiency strategies.
期刊介绍:
An international journal devoted to investigations of energy use and efficiency in buildings
Energy and Buildings is an international journal publishing articles with explicit links to energy use in buildings. The aim is to present new research results, and new proven practice aimed at reducing the energy needs of a building and improving indoor environment quality.