利用信息熵选择训练数据:应用于中国北方农村居民采暖负荷建模

IF 0.7 4区 地球科学 Q4 GEOCHEMISTRY & GEOPHYSICS
Li-gai Kang, Hao Li, Zhi-chao Wang, Dong-xiang Sun, Jin-zhu Wang, Yang Yang, Xu Zhang
{"title":"利用信息熵选择训练数据:应用于中国北方农村居民采暖负荷建模","authors":"Li-gai Kang, Hao Li, Zhi-chao Wang, Dong-xiang Sun, Jin-zhu Wang, Yang Yang, Xu Zhang","doi":"10.1007/s11770-024-1120-9","DOIUrl":null,"url":null,"abstract":"<p>The selection of input variables and their amount has been an important issue in big data load forecasting. Taking heating load forecasting as an example, this paper proposed a method for data filtering based on information entropy. First, the heating data from an air source heat pump system adopted by a rural residence in northern China were employed. Moreover, the training data were classified based on linear or nonlinear variations of outdoor temperature and its changing ranges, while the validation data included three different types of weather conditions, namely, cold, cool, and mild. Then, the information entropy under 2-h, 4-h, 6-h and 8-h training window was quantified to be 1.811, 1.839, 1.877 and 1.856, respectively. For the employed rural residence, an equivalent three-resistance and two-capacity model was established to validate the effectiveness of the training window. Using the derived optimal thermal resistance and capacity, the various selection of outdoor temperature variation trend and range were compared and optimized. Results showed that 6 h of training data had the maximum information entropy and the most abundant information, the minimum errors between actual and forecasting data were observed under 6 h of training data, linear change, and lower outdoor temperature. The mean absolute percentage errors for the load forecasting of three typical days were 5.63%, 8.46%, and 12.10%, respectively.</p>","PeriodicalId":55500,"journal":{"name":"Applied Geophysics","volume":"18 1","pages":""},"PeriodicalIF":0.7000,"publicationDate":"2024-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Training data selection using information entropy: Application to heating load modeling of rural residence in northern China\",\"authors\":\"Li-gai Kang, Hao Li, Zhi-chao Wang, Dong-xiang Sun, Jin-zhu Wang, Yang Yang, Xu Zhang\",\"doi\":\"10.1007/s11770-024-1120-9\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>The selection of input variables and their amount has been an important issue in big data load forecasting. Taking heating load forecasting as an example, this paper proposed a method for data filtering based on information entropy. First, the heating data from an air source heat pump system adopted by a rural residence in northern China were employed. Moreover, the training data were classified based on linear or nonlinear variations of outdoor temperature and its changing ranges, while the validation data included three different types of weather conditions, namely, cold, cool, and mild. Then, the information entropy under 2-h, 4-h, 6-h and 8-h training window was quantified to be 1.811, 1.839, 1.877 and 1.856, respectively. For the employed rural residence, an equivalent three-resistance and two-capacity model was established to validate the effectiveness of the training window. Using the derived optimal thermal resistance and capacity, the various selection of outdoor temperature variation trend and range were compared and optimized. Results showed that 6 h of training data had the maximum information entropy and the most abundant information, the minimum errors between actual and forecasting data were observed under 6 h of training data, linear change, and lower outdoor temperature. The mean absolute percentage errors for the load forecasting of three typical days were 5.63%, 8.46%, and 12.10%, respectively.</p>\",\"PeriodicalId\":55500,\"journal\":{\"name\":\"Applied Geophysics\",\"volume\":\"18 1\",\"pages\":\"\"},\"PeriodicalIF\":0.7000,\"publicationDate\":\"2024-07-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Geophysics\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://doi.org/10.1007/s11770-024-1120-9\",\"RegionNum\":4,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"GEOCHEMISTRY & GEOPHYSICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Geophysics","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1007/s11770-024-1120-9","RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"GEOCHEMISTRY & GEOPHYSICS","Score":null,"Total":0}
引用次数: 0

摘要

输入变量及其数量的选择一直是大数据负荷预测中的一个重要问题。本文以采暖负荷预测为例,提出了一种基于信息熵的数据过滤方法。首先,采用了中国北方农村居民采用的空气源热泵系统的采暖数据。此外,训练数据根据室外温度的线性或非线性变化及其变化范围进行分类,而验证数据则包括三种不同的天气条件,即寒冷、凉爽和温和。然后,将 2 小时、4 小时、6 小时和 8 小时训练窗口下的信息熵分别量化为 1.811、1.839、1.877 和 1.856。针对所采用的农村住宅,建立了一个等效的三阻力和两容量模型,以验证训练窗口的有效性。利用得出的最佳热阻和热容量,对室外温度变化趋势和范围的各种选择进行了比较和优化。结果表明,6 小时的训练数据具有最大的信息熵和最丰富的信息,在 6 小时训练数据、线性变化和室外温度较低的情况下,实际数据与预测数据之间的误差最小。三个典型日负荷预测的平均绝对百分比误差分别为 5.63%、8.46% 和 12.10%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Training data selection using information entropy: Application to heating load modeling of rural residence in northern China

The selection of input variables and their amount has been an important issue in big data load forecasting. Taking heating load forecasting as an example, this paper proposed a method for data filtering based on information entropy. First, the heating data from an air source heat pump system adopted by a rural residence in northern China were employed. Moreover, the training data were classified based on linear or nonlinear variations of outdoor temperature and its changing ranges, while the validation data included three different types of weather conditions, namely, cold, cool, and mild. Then, the information entropy under 2-h, 4-h, 6-h and 8-h training window was quantified to be 1.811, 1.839, 1.877 and 1.856, respectively. For the employed rural residence, an equivalent three-resistance and two-capacity model was established to validate the effectiveness of the training window. Using the derived optimal thermal resistance and capacity, the various selection of outdoor temperature variation trend and range were compared and optimized. Results showed that 6 h of training data had the maximum information entropy and the most abundant information, the minimum errors between actual and forecasting data were observed under 6 h of training data, linear change, and lower outdoor temperature. The mean absolute percentage errors for the load forecasting of three typical days were 5.63%, 8.46%, and 12.10%, respectively.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Applied Geophysics
Applied Geophysics 地学-地球化学与地球物理
CiteScore
1.50
自引率
14.30%
发文量
912
审稿时长
2 months
期刊介绍: The journal is designed to provide an academic realm for a broad blend of academic and industry papers to promote rapid communication and exchange of ideas between Chinese and world-wide geophysicists. The publication covers the applications of geoscience, geophysics, and related disciplines in the fields of energy, resources, environment, disaster, engineering, information, military, and surveying.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信