The Impact of Preprocessing Approaches on Neural Network Performance: A Case Study on Evaporation in Adana, a Mediterranean Climate

O. Katipoğlu, M. Peki̇n, Sercan Akil
{"title":"The Impact of Preprocessing Approaches on Neural Network Performance: A Case Study on Evaporation in Adana, a Mediterranean Climate","authors":"O. Katipoğlu, M. Peki̇n, Sercan Akil","doi":"10.52562/injoes.2023.821","DOIUrl":null,"url":null,"abstract":"The application of artificial intelligence (AI) technologies is quickly expanding in water management. Additionally, the artificial neural network methodology has an advantage over traditional statistical approaches in that it does not need assumptions about the distribution of data and variables. These methods can be used if there is a large enough data collection and criteria relevant to the nature of the problem. Preprocessing data before utilizing it improves the performance of the AI model. Evaporation matters in water management, agriculture processes and soil science. It is critical to ensure proper estimation of evaporation losses for effective water resource planning and management particularly in drought-prone areas such as Adana. Artificial intelligence approaches can be applied successfully in evaporation calculation. In this research, we used the Standard scaler, power transformer, robust scaler quantile transformer (Uniform) and quantile transformer (Normal), and min-max scaler preprocessing techniques to preprocess the multilayer perceptron neural network (MLPNN). We also trained the MLPNN using unprocessed data and compared it to the results of the preprocessed model. In the setup of the model, daily temperature, pressure, wind, sunny hours, and humidity parameters covering the years 2018-2021 were presented as input to the MLPNN model. Consequently, we pinpoint that all preprocessing approaches produce better outcomes than unscaled. Although all models produced statistically high accuracy predictions according to statistical criteria, the MLPNN model established without transformation (test phase: r2: 0.93, NSE : 0.927, SMAPE: 10.77, RMSE: 1.2, MAE: 0.9) exhibited the lowest accuracy. The evaporation prediction model that was developed using the MLPNN-based standard scalar optimization algorithm exhibited the highest level of accuracy  (test phase: r2: 0.978, NSE: 0.977, SMAPE: 5.93, RMSE: 0.68, MAE: 0.48). Power Transformer (test phase: r2: 0.978, NSE: 0.977, SMAPE: 5.81, RMSE: 0.67, MAE: 0.49) showed second-degree promising results. It can be concluded from these results that the estimation of meteorological variables requires the scaling and presentation of data in a uniform structure. Therefore, improving efficiency and productivity in water management or agricultural processes can be enhanced by making more accurate evaporation estimates.","PeriodicalId":197067,"journal":{"name":"Indonesian Journal of Earth Sciences","volume":"8 13","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Indonesian Journal of Earth Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.52562/injoes.2023.821","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The application of artificial intelligence (AI) technologies is quickly expanding in water management. Additionally, the artificial neural network methodology has an advantage over traditional statistical approaches in that it does not need assumptions about the distribution of data and variables. These methods can be used if there is a large enough data collection and criteria relevant to the nature of the problem. Preprocessing data before utilizing it improves the performance of the AI model. Evaporation matters in water management, agriculture processes and soil science. It is critical to ensure proper estimation of evaporation losses for effective water resource planning and management particularly in drought-prone areas such as Adana. Artificial intelligence approaches can be applied successfully in evaporation calculation. In this research, we used the Standard scaler, power transformer, robust scaler quantile transformer (Uniform) and quantile transformer (Normal), and min-max scaler preprocessing techniques to preprocess the multilayer perceptron neural network (MLPNN). We also trained the MLPNN using unprocessed data and compared it to the results of the preprocessed model. In the setup of the model, daily temperature, pressure, wind, sunny hours, and humidity parameters covering the years 2018-2021 were presented as input to the MLPNN model. Consequently, we pinpoint that all preprocessing approaches produce better outcomes than unscaled. Although all models produced statistically high accuracy predictions according to statistical criteria, the MLPNN model established without transformation (test phase: r2: 0.93, NSE : 0.927, SMAPE: 10.77, RMSE: 1.2, MAE: 0.9) exhibited the lowest accuracy. The evaporation prediction model that was developed using the MLPNN-based standard scalar optimization algorithm exhibited the highest level of accuracy  (test phase: r2: 0.978, NSE: 0.977, SMAPE: 5.93, RMSE: 0.68, MAE: 0.48). Power Transformer (test phase: r2: 0.978, NSE: 0.977, SMAPE: 5.81, RMSE: 0.67, MAE: 0.49) showed second-degree promising results. It can be concluded from these results that the estimation of meteorological variables requires the scaling and presentation of data in a uniform structure. Therefore, improving efficiency and productivity in water management or agricultural processes can be enhanced by making more accurate evaporation estimates.
预处理方法对神经网络性能的影响:地中海气候阿达纳蒸发量案例研究
人工智能(AI)技术在水资源管理领域的应用正在迅速扩大。此外,与传统的统计方法相比,人工神经网络方法的优势在于不需要假设数据和变量的分布。如果有足够多的数据收集和与问题性质相关的标准,就可以使用这些方法。在利用数据之前对其进行预处理,可以提高人工智能模型的性能。蒸发在水资源管理、农业生产过程和土壤科学中非常重要。确保正确估算蒸发损失对于有效的水资源规划和管理至关重要,尤其是在阿达纳等干旱多发地区。人工智能方法可成功应用于蒸发计算。在这项研究中,我们使用了标准标度器、功率变换器、鲁棒性标度器量化变换器(均匀)和量化变换器(正常)以及最小-最大标度器预处理技术,对多层感知器神经网络(MLPNN)进行预处理。我们还使用未经处理的数据对 MLPNN 进行了训练,并将其与预处理模型的结果进行了比较。在模型的设置中,2018-2021 年的日气温、气压、风力、日照时数和湿度参数被作为 MLPNN 模型的输入。因此,我们确定所有预处理方法都比未缩放方法产生更好的结果。虽然根据统计标准,所有模型都产生了较高的预测精度,但未进行转换的 MLPNN 模型(测试阶段:r2:0.93,NSE:0.927,SMAPE:10.77,RMSE:1.2,MAE:0.9)的精度最低。使用基于 MLPNN 的标准标量优化算法开发的蒸发预测模型准确度最高(测试阶段:r2:0.978,NSE:0.977,SMAPE:5.93,RMSE:0.68,MAE:0.48)。电力变压器(测试阶段:r2: 0.978,NSE: 0.977,SMAPE: 5.81,RMSE: 0.67,MAE: 0.49)显示出二级良好结果。从这些结果中可以得出结论,气象变量的估算需要以统一的结构对数据进行缩放和展示。因此,通过更准确地估算蒸发量,可以提高水资源管理或农业生产过程的效率和生产力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信