Xinyi Lin , Zhe Tian , Adrian Chong , Yakai Lu , Jide Niu , Na Deng
{"title":"建筑热动力学灰盒模型的数据信息量评价方法","authors":"Xinyi Lin , Zhe Tian , Adrian Chong , Yakai Lu , Jide Niu , Na Deng","doi":"10.1016/j.enbuild.2026.117103","DOIUrl":null,"url":null,"abstract":"<div><div>Grey-box modeling has been widely used in building thermal modeling due to its adaptability and interpretability. The identification of model parameters mainly depends on the measured dataset, and its optimal construction is critical for ensuring model accuracy. Existing studies commonly discuss the influence of training data quantity on the model accuracy. However, the training data informativeness is always ignored, which reflects the quality and richness of information within the data samples and informs the estimates of model parameter values. Notably, the informativeness level may vary among samples, and the quantity of data does not necessarily correlate with its informativeness. Here, we propose a data informativeness evaluation method that can well select informative training data for grey-box models under different scenarios. The method establishes two evaluation criteria based on the characteristics of grey-box model: one describes the consistency between training and forecasting data distributions, and the other outlines the distribution variations within the training data. The effectiveness of the proposed method is demonstrated using data from experiment case. The results indicate that the proposed data informativeness index reflects the quality of the dataset well and has a high correlation with prediction accuracy (The Pearson correlation coefficient varies from −0.6 to −0.8). This evaluation method will be of great significance for optimizing the dataset construction of grey-box model of building thermal dynamics.</div></div>","PeriodicalId":11641,"journal":{"name":"Energy and Buildings","volume":"357 ","pages":"Article 117103"},"PeriodicalIF":7.1000,"publicationDate":"2026-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A data informativeness evaluation method for grey-box modeling of building thermal dynamics\",\"authors\":\"Xinyi Lin , Zhe Tian , Adrian Chong , Yakai Lu , Jide Niu , Na Deng\",\"doi\":\"10.1016/j.enbuild.2026.117103\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Grey-box modeling has been widely used in building thermal modeling due to its adaptability and interpretability. The identification of model parameters mainly depends on the measured dataset, and its optimal construction is critical for ensuring model accuracy. Existing studies commonly discuss the influence of training data quantity on the model accuracy. However, the training data informativeness is always ignored, which reflects the quality and richness of information within the data samples and informs the estimates of model parameter values. Notably, the informativeness level may vary among samples, and the quantity of data does not necessarily correlate with its informativeness. Here, we propose a data informativeness evaluation method that can well select informative training data for grey-box models under different scenarios. The method establishes two evaluation criteria based on the characteristics of grey-box model: one describes the consistency between training and forecasting data distributions, and the other outlines the distribution variations within the training data. The effectiveness of the proposed method is demonstrated using data from experiment case. The results indicate that the proposed data informativeness index reflects the quality of the dataset well and has a high correlation with prediction accuracy (The Pearson correlation coefficient varies from −0.6 to −0.8). This evaluation method will be of great significance for optimizing the dataset construction of grey-box model of building thermal dynamics.</div></div>\",\"PeriodicalId\":11641,\"journal\":{\"name\":\"Energy and Buildings\",\"volume\":\"357 \",\"pages\":\"Article 117103\"},\"PeriodicalIF\":7.1000,\"publicationDate\":\"2026-04-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Energy and Buildings\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0378778826001635\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2026/2/4 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"CONSTRUCTION & BUILDING TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Energy and Buildings","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0378778826001635","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2026/2/4 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"CONSTRUCTION & BUILDING TECHNOLOGY","Score":null,"Total":0}
A data informativeness evaluation method for grey-box modeling of building thermal dynamics
Grey-box modeling has been widely used in building thermal modeling due to its adaptability and interpretability. The identification of model parameters mainly depends on the measured dataset, and its optimal construction is critical for ensuring model accuracy. Existing studies commonly discuss the influence of training data quantity on the model accuracy. However, the training data informativeness is always ignored, which reflects the quality and richness of information within the data samples and informs the estimates of model parameter values. Notably, the informativeness level may vary among samples, and the quantity of data does not necessarily correlate with its informativeness. Here, we propose a data informativeness evaluation method that can well select informative training data for grey-box models under different scenarios. The method establishes two evaluation criteria based on the characteristics of grey-box model: one describes the consistency between training and forecasting data distributions, and the other outlines the distribution variations within the training data. The effectiveness of the proposed method is demonstrated using data from experiment case. The results indicate that the proposed data informativeness index reflects the quality of the dataset well and has a high correlation with prediction accuracy (The Pearson correlation coefficient varies from −0.6 to −0.8). This evaluation method will be of great significance for optimizing the dataset construction of grey-box model of building thermal dynamics.
期刊介绍:
An international journal devoted to investigations of energy use and efficiency in buildings
Energy and Buildings is an international journal publishing articles with explicit links to energy use in buildings. The aim is to present new research results, and new proven practice aimed at reducing the energy needs of a building and improving indoor environment quality.