Comparison of temporal and spatial patterns of water quality parameters in Anzali Wetland (southwest of the Caspian Sea) using Support vector machine model
M. Fallah, A. P. Zefrehei, Seyyed Aliakbar Hedayati, T. Bagheri
{"title":"Comparison of temporal and spatial patterns of water quality parameters in Anzali Wetland (southwest of the Caspian Sea) using Support vector machine model","authors":"M. Fallah, A. P. Zefrehei, Seyyed Aliakbar Hedayati, T. Bagheri","doi":"10.22124/CJES.2021.4500","DOIUrl":null,"url":null,"abstract":"Urgent is growing to have reliable information from the country's water resources. In recent years, data mining models such as artificial neural network (ANN), gene expression programming, Bayesian network, machine algorithms, such as a support vector machine (SVM), and Random Forest have found widespread use in the field of simulation and prediction of components in aquatic ecosystems. Variables vary greatly on water quality parameters (due to nonlinear and complex relationships). Therefore, conventional methods are not eligible to solve water resource quality management problems. The aim of this study was to investigate the possibility of simulating the spatial and temporal alterations in water quality parameters during the period 1985-2014 in Anzali Wetland using a SVM model. Based on principal components analysis (PCA), the parameters EC, TDS, pH and BOD5 were selected for analysis in this study. Spearman correlation was calculated to determine the inputs of the model and the correlation coefficient(CC) between the water quality parameters. According to the results of the correlation table analysis, 8 types of structures including different inputs were used to predict the parameters with machine vector. In the next stage, 70% of the data were used to train, while the rest were used for analyzing the models. Criteria for determination coefficient (R2) and root mean square error (RMSE) were used for evaluation and model performance. The results revealed that in verification stage among different used models, the pH had the highest accuracy (0.95), while the lowest RMSE (0.20). Trend of alterations for optimal model of each parameter on a time scale, indicated an adequate estimation at most points. In general, the results exhibited the appropriate accuracy and acceptable performance of the SVM model in simulating water parameters.","PeriodicalId":9640,"journal":{"name":"caspian journal of environmental sciences","volume":"93 1","pages":"95-104"},"PeriodicalIF":0.0000,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"caspian journal of environmental sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.22124/CJES.2021.4500","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Agricultural and Biological Sciences","Score":null,"Total":0}
引用次数: 10
Abstract
Urgent is growing to have reliable information from the country's water resources. In recent years, data mining models such as artificial neural network (ANN), gene expression programming, Bayesian network, machine algorithms, such as a support vector machine (SVM), and Random Forest have found widespread use in the field of simulation and prediction of components in aquatic ecosystems. Variables vary greatly on water quality parameters (due to nonlinear and complex relationships). Therefore, conventional methods are not eligible to solve water resource quality management problems. The aim of this study was to investigate the possibility of simulating the spatial and temporal alterations in water quality parameters during the period 1985-2014 in Anzali Wetland using a SVM model. Based on principal components analysis (PCA), the parameters EC, TDS, pH and BOD5 were selected for analysis in this study. Spearman correlation was calculated to determine the inputs of the model and the correlation coefficient(CC) between the water quality parameters. According to the results of the correlation table analysis, 8 types of structures including different inputs were used to predict the parameters with machine vector. In the next stage, 70% of the data were used to train, while the rest were used for analyzing the models. Criteria for determination coefficient (R2) and root mean square error (RMSE) were used for evaluation and model performance. The results revealed that in verification stage among different used models, the pH had the highest accuracy (0.95), while the lowest RMSE (0.20). Trend of alterations for optimal model of each parameter on a time scale, indicated an adequate estimation at most points. In general, the results exhibited the appropriate accuracy and acceptable performance of the SVM model in simulating water parameters.