集成学习增强的逐步聚类分析在河流融冰日期预测中的应用

W. Sun, Q. Shi, Y. Huang, Y. Lv
{"title":"集成学习增强的逐步聚类分析在河流融冰日期预测中的应用","authors":"W. Sun, Q. Shi, Y. Huang, Y. Lv","doi":"10.3808/JEIL.201900005","DOIUrl":null,"url":null,"abstract":"Frequently occurring ice jams often cause concern in northern regions. Breakup timing is directly related to emergency responses preparation and thus its early accurate forecasting is beneficial to ice-related flooding management. The stepwise cluster analysis (SCA) is a non-parameter regression method, which generates a classification tree in the sense of probability through cutting or merging operations according to certain statistic criteria. To enhance SCA’s predictive performance, a SCA ensemble (SCAE) method is developed and applied to forecasting of annual river ice breakup dates (BDs). In detail, the SCA is employed as a base model at the lower level while the simple average method is selected as combining models at the upper level. The SCA base models are selected according to different performance selection criteria and searched for further combination. A site on a representative river prone to river ice flooding in Alberta, Canada is selected to demonstrate the effectiveness of the proposed SCAE. The results mainly show that: the SCA base models with multiple combinations of inputs and internal parameters are able to predict the BDs with good performances (the highest average of correlation coefficients for training can be 0.958); the optimal SCA base model has three inputs, which indicates that the temperatures before breakup and just after freeze-up as well as the maximum of water flow in March are relatively important indicators of BD. The optimal SCAE, including base models from different performance selection criteria, has the lowest average of root mean squared error, which improves upon the optimal SCA base model by 25.3%. It indicates the different model selection criteria do improve the diversity and thus further help to improve the performance of ensemble models. This first application of the SCAE to river ice forecasting highlights the possibility of using the ensemble learning paradigm to enhance the SCA. The potential applications of the SCAE to other forecasting problems are expected.","PeriodicalId":143718,"journal":{"name":"Journal of Environmental Informatics Letters","volume":"59 3-4","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Ensemble Learning Enhanced Stepwise Cluster Analysis for River Ice Breakup Date Forecasting\",\"authors\":\"W. Sun, Q. Shi, Y. Huang, Y. Lv\",\"doi\":\"10.3808/JEIL.201900005\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Frequently occurring ice jams often cause concern in northern regions. Breakup timing is directly related to emergency responses preparation and thus its early accurate forecasting is beneficial to ice-related flooding management. The stepwise cluster analysis (SCA) is a non-parameter regression method, which generates a classification tree in the sense of probability through cutting or merging operations according to certain statistic criteria. To enhance SCA’s predictive performance, a SCA ensemble (SCAE) method is developed and applied to forecasting of annual river ice breakup dates (BDs). In detail, the SCA is employed as a base model at the lower level while the simple average method is selected as combining models at the upper level. The SCA base models are selected according to different performance selection criteria and searched for further combination. A site on a representative river prone to river ice flooding in Alberta, Canada is selected to demonstrate the effectiveness of the proposed SCAE. The results mainly show that: the SCA base models with multiple combinations of inputs and internal parameters are able to predict the BDs with good performances (the highest average of correlation coefficients for training can be 0.958); the optimal SCA base model has three inputs, which indicates that the temperatures before breakup and just after freeze-up as well as the maximum of water flow in March are relatively important indicators of BD. The optimal SCAE, including base models from different performance selection criteria, has the lowest average of root mean squared error, which improves upon the optimal SCA base model by 25.3%. It indicates the different model selection criteria do improve the diversity and thus further help to improve the performance of ensemble models. This first application of the SCAE to river ice forecasting highlights the possibility of using the ensemble learning paradigm to enhance the SCA. The potential applications of the SCAE to other forecasting problems are expected.\",\"PeriodicalId\":143718,\"journal\":{\"name\":\"Journal of Environmental Informatics Letters\",\"volume\":\"59 3-4\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-04-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Environmental Informatics Letters\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3808/JEIL.201900005\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Environmental Informatics Letters","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3808/JEIL.201900005","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

摘要

频繁发生的冰堵塞经常引起北方地区的关注。崩解时间与应急准备直接相关,早期准确预测崩解时间有利于冰害洪水管理。逐步聚类分析(SCA)是一种无参数回归方法,它根据一定的统计准则,通过切割或合并操作生成概率意义上的分类树。为了提高SCA的预测能力,提出了一种SCA集合(SCAE)方法,并将其应用于河流冰崩解日期的预测。其中,底层采用SCA作为基础模型,上层采用简单平均方法作为组合模型。根据不同的性能选择标准选择SCA基本模型,并搜索进一步的组合。选取加拿大艾伯塔省一条易发生河冰洪水的代表性河流上的一个地点,以证明拟议的SCAE的有效性。结果主要表明:具有多种输入和内部参数组合的SCA基础模型能够较好地预测BDs(训练相关系数的最高平均值可达0.958);最优SCA基础模型有3个输入,表明破碎前和冻结后的温度以及3月份的最大水流量是BD较为重要的指标。包括不同性能选择标准的基础模型的最优SCAE的均方根误差平均值最低,比最优SCA基础模型提高了25.3%。表明不同的模型选择标准确实提高了集成模型的多样性,从而有助于进一步提高集成模型的性能。SCAE在河冰预报中的首次应用突出了使用集成学习范式来增强SCA的可能性。预计SCAE在其他预报问题上的潜在应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Ensemble Learning Enhanced Stepwise Cluster Analysis for River Ice Breakup Date Forecasting
Frequently occurring ice jams often cause concern in northern regions. Breakup timing is directly related to emergency responses preparation and thus its early accurate forecasting is beneficial to ice-related flooding management. The stepwise cluster analysis (SCA) is a non-parameter regression method, which generates a classification tree in the sense of probability through cutting or merging operations according to certain statistic criteria. To enhance SCA’s predictive performance, a SCA ensemble (SCAE) method is developed and applied to forecasting of annual river ice breakup dates (BDs). In detail, the SCA is employed as a base model at the lower level while the simple average method is selected as combining models at the upper level. The SCA base models are selected according to different performance selection criteria and searched for further combination. A site on a representative river prone to river ice flooding in Alberta, Canada is selected to demonstrate the effectiveness of the proposed SCAE. The results mainly show that: the SCA base models with multiple combinations of inputs and internal parameters are able to predict the BDs with good performances (the highest average of correlation coefficients for training can be 0.958); the optimal SCA base model has three inputs, which indicates that the temperatures before breakup and just after freeze-up as well as the maximum of water flow in March are relatively important indicators of BD. The optimal SCAE, including base models from different performance selection criteria, has the lowest average of root mean squared error, which improves upon the optimal SCA base model by 25.3%. It indicates the different model selection criteria do improve the diversity and thus further help to improve the performance of ensemble models. This first application of the SCAE to river ice forecasting highlights the possibility of using the ensemble learning paradigm to enhance the SCA. The potential applications of the SCAE to other forecasting problems are expected.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信