Prediction of carbon dioxide emissions from Atlantic Canadian potato fields using advanced hybridized machine learning algorithms – Nexus of field data and modelling
{"title":"Prediction of carbon dioxide emissions from Atlantic Canadian potato fields using advanced hybridized machine learning algorithms – Nexus of field data and modelling","authors":"","doi":"10.1016/j.atech.2024.100559","DOIUrl":null,"url":null,"abstract":"<div><p>In this study, three novel machine learning algorithms of additive regression-random forest (AR-RF), Iterative Classifier Optimizer (ICO-AR-RF), and multi-scheme (MS-RF) were explored for carbon dioxide (CO<sub>2</sub>) flux rate prediction from three agricultural fields. To build the dataset, 401 samples were collected from two fields in Prince Edward Island (PEI) and 122 samples from the New Brunswick (NB), Canada. In addition, soil moisture (SM), temperature (ST), and electrical conductivity (EC), alongside eight climatic variables including wind speed (WS), solar radiation (SR), relative humidity (RH), precipitation (PCP), air temperature (AT), dew point (DP), vapour pressure difference (VPD) and reference evapotranspiration (ET<sub>o</sub>) were also collected. Greedy stepwise (GS) approach was implemented for feature selection. Finally, different qualitative (scatter plot, line graph, Taylor diagram, box plot, and Rug plot), and quantitative (uncertainty analysis, root mean square error (RMSE), percent of BIAS (PBIAS), Nash Sutcliff efficiency (NSE) and RMSE-observations standard deviation ratio (RSR)) techniques were used for model evaluation and comparison. Results of feature selection approaches revealed that DP, AT, SM, and ST are the four most effective variables at CO<sub>2</sub> prediction in PEI, while AT, RH, DP, and ST are the most effective in the NB study area. For optimum input scenario, the GS algorithm was applied, and results showed that a combination of DP, AT, ST, SM, and ET<sub>o</sub> was the best for the PEI study area, while for NB, all input variables should be involved. Our analysis, for prediction of CO<sub>2</sub> fluxes, confirmed that the ICO-AR-RF model performed the best at both PEI (RMSE=0.70, NSE=0.76, PBIAS=-5.11, RSR=0.48) and NB (RMSE=0.74, NSE=0.75, PBIAS=3.23, RSR=0.50), followed by MS-RF and AR-RF. Uncertainty analysis showed that CO<sub>2</sub> prediction is more sensitive to input scenario selection than models in both study areas. Results revealed that climatic variables are more effective in CO<sub>2</sub> prediction than soil characteristics and the developed hybrid model ICO-AR-RF can be a promising tool for decision-makers and beneficial for stakeholders.</p></div>","PeriodicalId":74813,"journal":{"name":"Smart agricultural technology","volume":null,"pages":null},"PeriodicalIF":6.3000,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772375524001643/pdfft?md5=b1067ceb74ec6307b3844c44064c8b87&pid=1-s2.0-S2772375524001643-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Smart agricultural technology","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772375524001643","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURAL ENGINEERING","Score":null,"Total":0}
引用次数: 0
Abstract
In this study, three novel machine learning algorithms of additive regression-random forest (AR-RF), Iterative Classifier Optimizer (ICO-AR-RF), and multi-scheme (MS-RF) were explored for carbon dioxide (CO2) flux rate prediction from three agricultural fields. To build the dataset, 401 samples were collected from two fields in Prince Edward Island (PEI) and 122 samples from the New Brunswick (NB), Canada. In addition, soil moisture (SM), temperature (ST), and electrical conductivity (EC), alongside eight climatic variables including wind speed (WS), solar radiation (SR), relative humidity (RH), precipitation (PCP), air temperature (AT), dew point (DP), vapour pressure difference (VPD) and reference evapotranspiration (ETo) were also collected. Greedy stepwise (GS) approach was implemented for feature selection. Finally, different qualitative (scatter plot, line graph, Taylor diagram, box plot, and Rug plot), and quantitative (uncertainty analysis, root mean square error (RMSE), percent of BIAS (PBIAS), Nash Sutcliff efficiency (NSE) and RMSE-observations standard deviation ratio (RSR)) techniques were used for model evaluation and comparison. Results of feature selection approaches revealed that DP, AT, SM, and ST are the four most effective variables at CO2 prediction in PEI, while AT, RH, DP, and ST are the most effective in the NB study area. For optimum input scenario, the GS algorithm was applied, and results showed that a combination of DP, AT, ST, SM, and ETo was the best for the PEI study area, while for NB, all input variables should be involved. Our analysis, for prediction of CO2 fluxes, confirmed that the ICO-AR-RF model performed the best at both PEI (RMSE=0.70, NSE=0.76, PBIAS=-5.11, RSR=0.48) and NB (RMSE=0.74, NSE=0.75, PBIAS=3.23, RSR=0.50), followed by MS-RF and AR-RF. Uncertainty analysis showed that CO2 prediction is more sensitive to input scenario selection than models in both study areas. Results revealed that climatic variables are more effective in CO2 prediction than soil characteristics and the developed hybrid model ICO-AR-RF can be a promising tool for decision-makers and beneficial for stakeholders.