Prediction of carbon dioxide emissions from Atlantic Canadian potato fields using advanced hybridized machine learning algorithms – Nexus of field data and modelling

IF 6.3 Q1 AGRICULTURAL ENGINEERING
{"title":"Prediction of carbon dioxide emissions from Atlantic Canadian potato fields using advanced hybridized machine learning algorithms – Nexus of field data and modelling","authors":"","doi":"10.1016/j.atech.2024.100559","DOIUrl":null,"url":null,"abstract":"<div><p>In this study, three novel machine learning algorithms of additive regression-random forest (AR-RF), Iterative Classifier Optimizer (ICO-AR-RF), and multi-scheme (MS-RF) were explored for carbon dioxide (CO<sub>2</sub>) flux rate prediction from three agricultural fields. To build the dataset, 401 samples were collected from two fields in Prince Edward Island (PEI) and 122 samples from the New Brunswick (NB), Canada. In addition, soil moisture (SM), temperature (ST), and electrical conductivity (EC), alongside eight climatic variables including wind speed (WS), solar radiation (SR), relative humidity (RH), precipitation (PCP), air temperature (AT), dew point (DP), vapour pressure difference (VPD) and reference evapotranspiration (ET<sub>o</sub>) were also collected. Greedy stepwise (GS) approach was implemented for feature selection. Finally, different qualitative (scatter plot, line graph, Taylor diagram, box plot, and Rug plot), and quantitative (uncertainty analysis, root mean square error (RMSE), percent of BIAS (PBIAS), Nash Sutcliff efficiency (NSE) and RMSE-observations standard deviation ratio (RSR)) techniques were used for model evaluation and comparison. Results of feature selection approaches revealed that DP, AT, SM, and ST are the four most effective variables at CO<sub>2</sub> prediction in PEI, while AT, RH, DP, and ST are the most effective in the NB study area. For optimum input scenario, the GS algorithm was applied, and results showed that a combination of DP, AT, ST, SM, and ET<sub>o</sub> was the best for the PEI study area, while for NB, all input variables should be involved. Our analysis, for prediction of CO<sub>2</sub> fluxes, confirmed that the ICO-AR-RF model performed the best at both PEI (RMSE=0.70, NSE=0.76, PBIAS=-5.11, RSR=0.48) and NB (RMSE=0.74, NSE=0.75, PBIAS=3.23, RSR=0.50), followed by MS-RF and AR-RF. Uncertainty analysis showed that CO<sub>2</sub> prediction is more sensitive to input scenario selection than models in both study areas. Results revealed that climatic variables are more effective in CO<sub>2</sub> prediction than soil characteristics and the developed hybrid model ICO-AR-RF can be a promising tool for decision-makers and beneficial for stakeholders.</p></div>","PeriodicalId":74813,"journal":{"name":"Smart agricultural technology","volume":null,"pages":null},"PeriodicalIF":6.3000,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772375524001643/pdfft?md5=b1067ceb74ec6307b3844c44064c8b87&pid=1-s2.0-S2772375524001643-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Smart agricultural technology","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772375524001643","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURAL ENGINEERING","Score":null,"Total":0}
引用次数: 0

Abstract

In this study, three novel machine learning algorithms of additive regression-random forest (AR-RF), Iterative Classifier Optimizer (ICO-AR-RF), and multi-scheme (MS-RF) were explored for carbon dioxide (CO2) flux rate prediction from three agricultural fields. To build the dataset, 401 samples were collected from two fields in Prince Edward Island (PEI) and 122 samples from the New Brunswick (NB), Canada. In addition, soil moisture (SM), temperature (ST), and electrical conductivity (EC), alongside eight climatic variables including wind speed (WS), solar radiation (SR), relative humidity (RH), precipitation (PCP), air temperature (AT), dew point (DP), vapour pressure difference (VPD) and reference evapotranspiration (ETo) were also collected. Greedy stepwise (GS) approach was implemented for feature selection. Finally, different qualitative (scatter plot, line graph, Taylor diagram, box plot, and Rug plot), and quantitative (uncertainty analysis, root mean square error (RMSE), percent of BIAS (PBIAS), Nash Sutcliff efficiency (NSE) and RMSE-observations standard deviation ratio (RSR)) techniques were used for model evaluation and comparison. Results of feature selection approaches revealed that DP, AT, SM, and ST are the four most effective variables at CO2 prediction in PEI, while AT, RH, DP, and ST are the most effective in the NB study area. For optimum input scenario, the GS algorithm was applied, and results showed that a combination of DP, AT, ST, SM, and ETo was the best for the PEI study area, while for NB, all input variables should be involved. Our analysis, for prediction of CO2 fluxes, confirmed that the ICO-AR-RF model performed the best at both PEI (RMSE=0.70, NSE=0.76, PBIAS=-5.11, RSR=0.48) and NB (RMSE=0.74, NSE=0.75, PBIAS=3.23, RSR=0.50), followed by MS-RF and AR-RF. Uncertainty analysis showed that CO2 prediction is more sensitive to input scenario selection than models in both study areas. Results revealed that climatic variables are more effective in CO2 prediction than soil characteristics and the developed hybrid model ICO-AR-RF can be a promising tool for decision-makers and beneficial for stakeholders.

Abstract Image

利用先进的混合机器学习算法预测加拿大大西洋马铃薯田的二氧化碳排放量--田间数据与建模的结合
本研究探索了三种新型机器学习算法,即加法回归-随机森林(AR-RF)、迭代分类优化器(ICO-AR-RF)和多方案(MS-RF),用于预测三块农田的二氧化碳(CO2)通量率。为建立数据集,从爱德华王子岛(PEI)的两块田地采集了 401 个样本,从加拿大新不伦瑞克(NB)采集了 122 个样本。此外,还收集了土壤湿度 (SM)、温度 (ST) 和导电率 (EC) 以及八个气候变量,包括风速 (WS)、太阳辐射 (SR)、相对湿度 (RH)、降水量 (PCP)、气温 (AT)、露点 (DP)、蒸汽压差 (VPD) 和参考蒸散量 (ETo)。采用贪婪逐步法(GS)进行特征选择。最后,采用了不同的定性(散点图、折线图、泰勒图、方框图和鲁格图)和定量(不确定性分析、均方根误差(RMSE)、BIAS 百分比(PBIAS)、纳什-苏特克利夫效率(NSE)和 RMSE-观测值标准偏差比(RSR))技术对模型进行评估和比较。特征选择方法的结果表明,DP、AT、SM 和 ST 是 PEI 预测二氧化碳最有效的四个变量,而 AT、RH、DP 和 ST 则是 NB 研究区最有效的变量。对于最佳输入方案,应用了 GS 算法,结果显示 DP、AT、ST、SM 和 ETo 的组合对于 PEI 研究区域是最佳的,而对于 NB,所有输入变量都应参与。我们对二氧化碳通量的预测分析表明,ICO-AR-RF 模型在 PEI(RMSE=0.70,NSE=0.76,PBIAS=-5.11,RSR=0.48)和 NB(RMSE=0.74,NSE=0.75,PBIAS=3.23,RSR=0.50)的表现最好,其次是 MS-RF 和 AR-RF。不确定性分析表明,在这两个研究地区,二氧化碳预测对输入情景选择的敏感性高于模型。结果表明,气候变量比土壤特性对二氧化碳预测更有效,所开发的 ICO-AR-RF 混合模型可成为决策者的有效工具,并为利益相关者带来益处。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
4.20
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信