模型结构复杂度和数据预处理对水文过程模拟人工神经网络预测性能的影响

M. Y. Otache, J. Musa, I. Kuti, Mustapha Mohammed, Lydia Ezekiel Pam
{"title":"模型结构复杂度和数据预处理对水文过程模拟人工神经网络预测性能的影响","authors":"M. Y. Otache, J. Musa, I. Kuti, Mustapha Mohammed, Lydia Ezekiel Pam","doi":"10.4236/OJMH.2021.111001","DOIUrl":null,"url":null,"abstract":"The choice of a particular Artificial Neural Network (ANN) structure is a seemingly difficult task; worthy of relevance is that there is no systematic way for establishing a suitable architecture. In view of this, the study looked at the effects of ANN structural complexity and data pre-processing regime on its forecast performance. To address this aim, two ANN structural configurations: 1) Single-hidden layer, and 2) Double-hidden layer feed-forward back propagation network were employed. Results obtained revealed generally that: a) ANN comprised of double hidden layers tends to be less robust and converges with less accuracy than its single-hidden layer counterpart under identical situations; b) for a univariate time series, phase-space reconstruction using embedding dimension which is based on dynamical systems theory is an effective way for determining the appropriate number of ANN input neurons, and c) data pre-processing via the scaling approach excessively limits the output range of the transfer function. In specific terms considering extreme flow prediction capability on the basis of effective correlation: Percent maximum and minimum correlation coefficient (Rmax% and Rmin%), on the average for one-day ahead forecast during the training and validation phases respectively for the adopted network structures: 8 7 5 (i.e., 8 input nodes, 7 nodes in the hidden layer, and 5 output nodes in the output layer), 8 5 2 5 (8 nodes in the input layer, 5 nodes in the first hidden layer, 2 nodes in the second hidden layer, and 5 nodes in the output layer), and 8 4 3 5 (8 nodes in the input layer, 4 nodes in the first hidden layer, 3 nodes in the second hidden layer, and 5 nodes in the output layer) gave: 101.2, 99.4; 100.2, 218.3; 93.7, 95.0 in all instances irrespective of the training algorithm (i.e., pooled). On the other hand, in terms of percent of correct event prediction, the respective performances of the models for both low and high flows during the training and validation phases, respectively were: 0.78, 0.96: 0.65, 0.87; 0.76, 0.93: 0.61, 0.83; and 0.79, 0.96: 0.65, 0.87. Thus, it suffices to note that on the basis of coherence or regularity of prediction consistency, the ANN model: 8 4 3 5 performed better. This implies that though the adoption of large hidden layers vis-a-vis corresponding large neuronal signatures could be counter-productive because of network over-fitting, however, it may provide additional representational power. Based on the findings, it is imperative to note that ANN model is by no means a substitute for conceptual watershed modelling, therefore, exogenous variables should be incorporated in streamflow modelling and forecasting exercise because of their hydrologic evolutions.","PeriodicalId":70695,"journal":{"name":"现代水文学期刊(英文)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2021-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Effects of Model Structural Complexity and Data Pre-Processing on Artificial Neural Network (ANN) Forecast Performance for Hydrological Process Modelling\",\"authors\":\"M. Y. Otache, J. Musa, I. Kuti, Mustapha Mohammed, Lydia Ezekiel Pam\",\"doi\":\"10.4236/OJMH.2021.111001\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The choice of a particular Artificial Neural Network (ANN) structure is a seemingly difficult task; worthy of relevance is that there is no systematic way for establishing a suitable architecture. In view of this, the study looked at the effects of ANN structural complexity and data pre-processing regime on its forecast performance. To address this aim, two ANN structural configurations: 1) Single-hidden layer, and 2) Double-hidden layer feed-forward back propagation network were employed. Results obtained revealed generally that: a) ANN comprised of double hidden layers tends to be less robust and converges with less accuracy than its single-hidden layer counterpart under identical situations; b) for a univariate time series, phase-space reconstruction using embedding dimension which is based on dynamical systems theory is an effective way for determining the appropriate number of ANN input neurons, and c) data pre-processing via the scaling approach excessively limits the output range of the transfer function. In specific terms considering extreme flow prediction capability on the basis of effective correlation: Percent maximum and minimum correlation coefficient (Rmax% and Rmin%), on the average for one-day ahead forecast during the training and validation phases respectively for the adopted network structures: 8 7 5 (i.e., 8 input nodes, 7 nodes in the hidden layer, and 5 output nodes in the output layer), 8 5 2 5 (8 nodes in the input layer, 5 nodes in the first hidden layer, 2 nodes in the second hidden layer, and 5 nodes in the output layer), and 8 4 3 5 (8 nodes in the input layer, 4 nodes in the first hidden layer, 3 nodes in the second hidden layer, and 5 nodes in the output layer) gave: 101.2, 99.4; 100.2, 218.3; 93.7, 95.0 in all instances irrespective of the training algorithm (i.e., pooled). On the other hand, in terms of percent of correct event prediction, the respective performances of the models for both low and high flows during the training and validation phases, respectively were: 0.78, 0.96: 0.65, 0.87; 0.76, 0.93: 0.61, 0.83; and 0.79, 0.96: 0.65, 0.87. Thus, it suffices to note that on the basis of coherence or regularity of prediction consistency, the ANN model: 8 4 3 5 performed better. This implies that though the adoption of large hidden layers vis-a-vis corresponding large neuronal signatures could be counter-productive because of network over-fitting, however, it may provide additional representational power. Based on the findings, it is imperative to note that ANN model is by no means a substitute for conceptual watershed modelling, therefore, exogenous variables should be incorporated in streamflow modelling and forecasting exercise because of their hydrologic evolutions.\",\"PeriodicalId\":70695,\"journal\":{\"name\":\"现代水文学期刊(英文)\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-01-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"现代水文学期刊(英文)\",\"FirstCategoryId\":\"1089\",\"ListUrlMain\":\"https://doi.org/10.4236/OJMH.2021.111001\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"现代水文学期刊(英文)","FirstCategoryId":"1089","ListUrlMain":"https://doi.org/10.4236/OJMH.2021.111001","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

选择特定的人工神经网络(ANN)结构是一项看似困难的任务;值得关注的是,没有系统的方法来建立合适的体系结构。鉴于此,本研究考察了人工神经网络结构复杂性和数据预处理制度对其预测性能的影响。为了解决这一问题,采用了两种ANN结构配置:1)单隐层和2)双隐层前馈反传播网络。结果表明:a)在相同的情况下,由双隐层组成的人工神经网络的鲁棒性和收敛精度都低于单隐层的人工神经网络;b)对于单变量时间序列,基于动力系统理论的嵌入维数相空间重构是确定ANN输入神经元数量的有效方法;c)缩放方法的数据预处理过度限制了传递函数的输出范围。具体考虑基于有效相关的极端流量预测能力:所采用的网络结构在训练和验证阶段的最大最小相关系数百分比(Rmax%和Rmin%)分别为前一天预测的平均值:8 7 5(即8输入节点,7个隐层节点,和5个输出节点在输出层),5 2 5 8(8节点输入层,5第一隐层节点,2在第二隐层节点,节点和5在输出层),和8 4 3 5(8节点的输入层,4个节点在第一隐层,3第二隐层节点,节点和5在输出层)为:101.2,99.4;100.2、218.3;93.7, 95.0在所有的情况下,无论训练算法(即,池)。另一方面,在训练和验证阶段,低流量和高流量模型的事件预测正确率分别为:0.78、0.96、0.65、0.87;0.76, 0.93; 0.61, 0.83;0.79, 0.96, 0.65, 0.87。因此,值得注意的是,基于预测一致性的一致性或规律性,ANN模型:8 4 3 5表现更好。这意味着,尽管由于网络过度拟合,相对于相应的大型神经元特征采用大型隐藏层可能会适得其反,但它可能提供额外的表征能力。基于这些发现,必须注意到,人工神经网络模型绝不是概念性流域建模的替代品,因此,由于外生变量的水文演变,应将其纳入流量建模和预测工作中。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Effects of Model Structural Complexity and Data Pre-Processing on Artificial Neural Network (ANN) Forecast Performance for Hydrological Process Modelling
The choice of a particular Artificial Neural Network (ANN) structure is a seemingly difficult task; worthy of relevance is that there is no systematic way for establishing a suitable architecture. In view of this, the study looked at the effects of ANN structural complexity and data pre-processing regime on its forecast performance. To address this aim, two ANN structural configurations: 1) Single-hidden layer, and 2) Double-hidden layer feed-forward back propagation network were employed. Results obtained revealed generally that: a) ANN comprised of double hidden layers tends to be less robust and converges with less accuracy than its single-hidden layer counterpart under identical situations; b) for a univariate time series, phase-space reconstruction using embedding dimension which is based on dynamical systems theory is an effective way for determining the appropriate number of ANN input neurons, and c) data pre-processing via the scaling approach excessively limits the output range of the transfer function. In specific terms considering extreme flow prediction capability on the basis of effective correlation: Percent maximum and minimum correlation coefficient (Rmax% and Rmin%), on the average for one-day ahead forecast during the training and validation phases respectively for the adopted network structures: 8 7 5 (i.e., 8 input nodes, 7 nodes in the hidden layer, and 5 output nodes in the output layer), 8 5 2 5 (8 nodes in the input layer, 5 nodes in the first hidden layer, 2 nodes in the second hidden layer, and 5 nodes in the output layer), and 8 4 3 5 (8 nodes in the input layer, 4 nodes in the first hidden layer, 3 nodes in the second hidden layer, and 5 nodes in the output layer) gave: 101.2, 99.4; 100.2, 218.3; 93.7, 95.0 in all instances irrespective of the training algorithm (i.e., pooled). On the other hand, in terms of percent of correct event prediction, the respective performances of the models for both low and high flows during the training and validation phases, respectively were: 0.78, 0.96: 0.65, 0.87; 0.76, 0.93: 0.61, 0.83; and 0.79, 0.96: 0.65, 0.87. Thus, it suffices to note that on the basis of coherence or regularity of prediction consistency, the ANN model: 8 4 3 5 performed better. This implies that though the adoption of large hidden layers vis-a-vis corresponding large neuronal signatures could be counter-productive because of network over-fitting, however, it may provide additional representational power. Based on the findings, it is imperative to note that ANN model is by no means a substitute for conceptual watershed modelling, therefore, exogenous variables should be incorporated in streamflow modelling and forecasting exercise because of their hydrologic evolutions.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
84
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信