A data informativeness evaluation method for grey-box modeling of building thermal dynamics

IF 7.1 2区 工程技术 Q1 CONSTRUCTION & BUILDING TECHNOLOGY
Energy and Buildings Pub Date : 2026-04-15 Epub Date: 2026-02-04 DOI:10.1016/j.enbuild.2026.117103
Xinyi Lin , Zhe Tian , Adrian Chong , Yakai Lu , Jide Niu , Na Deng
{"title":"A data informativeness evaluation method for grey-box modeling of building thermal dynamics","authors":"Xinyi Lin ,&nbsp;Zhe Tian ,&nbsp;Adrian Chong ,&nbsp;Yakai Lu ,&nbsp;Jide Niu ,&nbsp;Na Deng","doi":"10.1016/j.enbuild.2026.117103","DOIUrl":null,"url":null,"abstract":"<div><div>Grey-box modeling has been widely used in building thermal modeling due to its adaptability and interpretability. The identification of model parameters mainly depends on the measured dataset, and its optimal construction is critical for ensuring model accuracy. Existing studies commonly discuss the influence of training data quantity on the model accuracy. However, the training data informativeness is always ignored, which reflects the quality and richness of information within the data samples and informs the estimates of model parameter values. Notably, the informativeness level may vary among samples, and the quantity of data does not necessarily correlate with its informativeness. Here, we propose a data informativeness evaluation method that can well select informative training data for grey-box models under different scenarios. The method establishes two evaluation criteria based on the characteristics of grey-box model: one describes the consistency between training and forecasting data distributions, and the other outlines the distribution variations within the training data. The effectiveness of the proposed method is demonstrated using data from experiment case. The results indicate that the proposed data informativeness index reflects the quality of the dataset well and has a high correlation with prediction accuracy (The Pearson correlation coefficient varies from −0.6 to −0.8). This evaluation method will be of great significance for optimizing the dataset construction of grey-box model of building thermal dynamics.</div></div>","PeriodicalId":11641,"journal":{"name":"Energy and Buildings","volume":"357 ","pages":"Article 117103"},"PeriodicalIF":7.1000,"publicationDate":"2026-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Energy and Buildings","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0378778826001635","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2026/2/4 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"CONSTRUCTION & BUILDING TECHNOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Grey-box modeling has been widely used in building thermal modeling due to its adaptability and interpretability. The identification of model parameters mainly depends on the measured dataset, and its optimal construction is critical for ensuring model accuracy. Existing studies commonly discuss the influence of training data quantity on the model accuracy. However, the training data informativeness is always ignored, which reflects the quality and richness of information within the data samples and informs the estimates of model parameter values. Notably, the informativeness level may vary among samples, and the quantity of data does not necessarily correlate with its informativeness. Here, we propose a data informativeness evaluation method that can well select informative training data for grey-box models under different scenarios. The method establishes two evaluation criteria based on the characteristics of grey-box model: one describes the consistency between training and forecasting data distributions, and the other outlines the distribution variations within the training data. The effectiveness of the proposed method is demonstrated using data from experiment case. The results indicate that the proposed data informativeness index reflects the quality of the dataset well and has a high correlation with prediction accuracy (The Pearson correlation coefficient varies from −0.6 to −0.8). This evaluation method will be of great significance for optimizing the dataset construction of grey-box model of building thermal dynamics.
建筑热动力学灰盒模型的数据信息量评价方法
灰盒模型以其适应性和可解释性在建筑热建模中得到了广泛的应用。模型参数的识别主要依赖于实测数据集,其优化构造是保证模型精度的关键。现有研究普遍讨论训练数据量对模型精度的影响。然而,训练数据的信息量往往被忽略,它反映了数据样本中信息的质量和丰富程度,并为模型参数值的估计提供了信息。值得注意的是,样本的信息水平可能会有所不同,数据的数量并不一定与其信息相关。本文提出了一种数据信息量评价方法,可以很好地为灰盒模型在不同场景下选择信息量大的训练数据。该方法基于灰盒模型的特点,建立了两个评价标准:一个描述训练数据与预测数据分布的一致性,另一个描述训练数据内部分布的变化。实验数据验证了该方法的有效性。结果表明,所提出的数据信息量指数较好地反映了数据集的质量,并且与预测精度具有较高的相关性(Pearson相关系数在−0.6 ~−0.8之间)。该评价方法对优化建筑热动力学灰盒模型的数据集构建具有重要意义。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Energy and Buildings
Energy and Buildings 工程技术-工程:土木
CiteScore
12.70
自引率
11.90%
发文量
863
审稿时长
38 days
期刊介绍: An international journal devoted to investigations of energy use and efficiency in buildings Energy and Buildings is an international journal publishing articles with explicit links to energy use in buildings. The aim is to present new research results, and new proven practice aimed at reducing the energy needs of a building and improving indoor environment quality.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信
小红书