Qi Liu, Hongjuan Hou, Zekai Zhou, Lengge Si, Xi Wang, Yan Jia, Eric Hu
{"title":"一种基于casda驱动的无监督聚类热负荷预测新方法","authors":"Qi Liu, Hongjuan Hou, Zekai Zhou, Lengge Si, Xi Wang, Yan Jia, Eric Hu","doi":"10.1016/j.jobe.2025.113457","DOIUrl":null,"url":null,"abstract":"Accurate heat load forecasting is essential for improving the operational efficiency and intelligent management of district heating systems, particularly in addressing the mismatch between heat supply and demand caused by spatiotemporal variability. While Artificial Neural Network (ANN) have shown promise in this domain, their performance is highly sensitive to the quality and volume of training data. Given the limitations of data availability and the cost of large-scale data acquisition, enhancing data quality through preprocessing has become a practical alternative. This study proposes an improved data preprocessing strategy, termed Cluster Analysis based on Similar Day Approach (CASDA), to enhance ANN training for heat load forecasting. Unlike traditional Similar Day Approach (SDA) that relies primarily on weather similarity, CASDA clusters historical data based on heat load patterns, providing a more representative training dataset. Clusters are labeled by dominant weather types and used to train distinct ANN models. For forecasting, the most appropriate model is selected by evaluating the similarity between the forecast day and each cluster using a combination of Grey Relational Analysis (GRA) and Pearson correlation. A case study on a district heating substation in Beijing demonstrates that CASDA significantly improves forecasting accuracy across multiple ANN architectures, including Recurrent Neural Network (RNN), Convolutional Neural Network (CNN) and Transformers, and reduces MAPE by 15.2% (ConvGRU-GRU) compared with traditional SDA methods. Notably, the Transformer-based model (CMT) achieved the best performance, with an average validation R<ce:sup loc=\"post\">2</ce:sup> of 0.77, outperforming both traditional SDA models and models trained on unprocessed data. Moreover, CASDA offers reduced modeling costs by optimizing data utilization.","PeriodicalId":15064,"journal":{"name":"Journal of building engineering","volume":"109 1","pages":""},"PeriodicalIF":6.7000,"publicationDate":"2025-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Novel Heat Load Forecasting Method Based on CASDA-Driven Unsupervised Clustering\",\"authors\":\"Qi Liu, Hongjuan Hou, Zekai Zhou, Lengge Si, Xi Wang, Yan Jia, Eric Hu\",\"doi\":\"10.1016/j.jobe.2025.113457\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Accurate heat load forecasting is essential for improving the operational efficiency and intelligent management of district heating systems, particularly in addressing the mismatch between heat supply and demand caused by spatiotemporal variability. While Artificial Neural Network (ANN) have shown promise in this domain, their performance is highly sensitive to the quality and volume of training data. Given the limitations of data availability and the cost of large-scale data acquisition, enhancing data quality through preprocessing has become a practical alternative. This study proposes an improved data preprocessing strategy, termed Cluster Analysis based on Similar Day Approach (CASDA), to enhance ANN training for heat load forecasting. Unlike traditional Similar Day Approach (SDA) that relies primarily on weather similarity, CASDA clusters historical data based on heat load patterns, providing a more representative training dataset. Clusters are labeled by dominant weather types and used to train distinct ANN models. For forecasting, the most appropriate model is selected by evaluating the similarity between the forecast day and each cluster using a combination of Grey Relational Analysis (GRA) and Pearson correlation. A case study on a district heating substation in Beijing demonstrates that CASDA significantly improves forecasting accuracy across multiple ANN architectures, including Recurrent Neural Network (RNN), Convolutional Neural Network (CNN) and Transformers, and reduces MAPE by 15.2% (ConvGRU-GRU) compared with traditional SDA methods. Notably, the Transformer-based model (CMT) achieved the best performance, with an average validation R<ce:sup loc=\\\"post\\\">2</ce:sup> of 0.77, outperforming both traditional SDA models and models trained on unprocessed data. Moreover, CASDA offers reduced modeling costs by optimizing data utilization.\",\"PeriodicalId\":15064,\"journal\":{\"name\":\"Journal of building engineering\",\"volume\":\"109 1\",\"pages\":\"\"},\"PeriodicalIF\":6.7000,\"publicationDate\":\"2025-07-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of building engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1016/j.jobe.2025.113457\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CONSTRUCTION & BUILDING TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of building engineering","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1016/j.jobe.2025.113457","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CONSTRUCTION & BUILDING TECHNOLOGY","Score":null,"Total":0}
A Novel Heat Load Forecasting Method Based on CASDA-Driven Unsupervised Clustering
Accurate heat load forecasting is essential for improving the operational efficiency and intelligent management of district heating systems, particularly in addressing the mismatch between heat supply and demand caused by spatiotemporal variability. While Artificial Neural Network (ANN) have shown promise in this domain, their performance is highly sensitive to the quality and volume of training data. Given the limitations of data availability and the cost of large-scale data acquisition, enhancing data quality through preprocessing has become a practical alternative. This study proposes an improved data preprocessing strategy, termed Cluster Analysis based on Similar Day Approach (CASDA), to enhance ANN training for heat load forecasting. Unlike traditional Similar Day Approach (SDA) that relies primarily on weather similarity, CASDA clusters historical data based on heat load patterns, providing a more representative training dataset. Clusters are labeled by dominant weather types and used to train distinct ANN models. For forecasting, the most appropriate model is selected by evaluating the similarity between the forecast day and each cluster using a combination of Grey Relational Analysis (GRA) and Pearson correlation. A case study on a district heating substation in Beijing demonstrates that CASDA significantly improves forecasting accuracy across multiple ANN architectures, including Recurrent Neural Network (RNN), Convolutional Neural Network (CNN) and Transformers, and reduces MAPE by 15.2% (ConvGRU-GRU) compared with traditional SDA methods. Notably, the Transformer-based model (CMT) achieved the best performance, with an average validation R2 of 0.77, outperforming both traditional SDA models and models trained on unprocessed data. Moreover, CASDA offers reduced modeling costs by optimizing data utilization.
期刊介绍:
The Journal of Building Engineering is an interdisciplinary journal that covers all aspects of science and technology concerned with the whole life cycle of the built environment; from the design phase through to construction, operation, performance, maintenance and its deterioration.