{"title":"Improving carbon flux estimation in tea plantation ecosystems: A machine learning ensemble approach","authors":"Ali Raza, Yongguang Hu, Yongzong Lu","doi":"10.1016/j.eja.2024.127297","DOIUrl":null,"url":null,"abstract":"<div><p>Tea plant (<em>Camellia sinensis</em>) is a major global crop consumed as a drink after water. Quantifying carbon flux, specifically the net ecosystem exchange (NEE), in tea plantations is essential for determining carbon sequestration and ecosystem carbon balance. The Eddy covariance (EC) system is widely used for continuous monitoring of carbon flux but high costs associated with installation and maintenance limit its widespread adoption. In addition, EC flux data is often discarded due to malfunction of instruments caused by adverse weather conditions. Therefore, additional approaches for estimating NEE are necessary to overcome these challenges and ensure accurate NEE measurement. For this purpose, three standalone tree-based machine learning (ML) models were used for NEE estimation using EC flux data collected from tea ecosystem located in subtropical region (Danyang county of Zhenjiang city) of China. To address the accuracy limitations inherent in standalone ML models, the ensemble mechanism based on voting regressor method was proposed. In addition, k-fold cross-validation based on early stopping process was also used to enhance the performance of standalone ML models. Based on visual plots (scatter diagram, heatMap, Taylor diagram) and performing indices (root-mean-square error (RMSE), determination coefficient (R<sup>2</sup>), mean absolute error (MAE), mean absolute percentage error (MAPE), Nash–Sutcliffe efficiency (NSE), correlation coefficient (r), Kling Gupta Efficiency (KGE) and index of agreement (d)), the findings indicated that non-linear ensemble-generalized regression neural network (NLE-GRNN) significantly improved standalone ML model's results. In current study, the highest NSE, r and d in case of standalone ML model (DT) achieved 0.49, 0.73 and 0.75 respectively while our proposed NLE-GRNN model improved 48 % in NSE value (NSE = 0.97), 25 % in r value (r = 0.98) and 24 % in d value (d = 0.99). Likewise, NLE-GRNN significantly reduce errors (MAE, MAPE and RMSE) and provides NEE estimate closet to the observed value. The impact of climatic variables on NEE using shapley additive explanations (SHAP) analysis revealed that Rg (solar radiation) and Tair (air temperature) were the prime factors controlling NEE variation in the tea ecosystem. Considering the high accuracy and stability of the studied ML models, it is recommended to apply developed ensemble ML model (NLE-GRNN) for significant improvement of NEE estimate in the tea biomes or other ecosystems.</p></div>","PeriodicalId":51045,"journal":{"name":"European Journal of Agronomy","volume":"160 ","pages":"Article 127297"},"PeriodicalIF":4.5000,"publicationDate":"2024-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Agronomy","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1161030124002181","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRONOMY","Score":null,"Total":0}
引用次数: 0
Abstract
Tea plant (Camellia sinensis) is a major global crop consumed as a drink after water. Quantifying carbon flux, specifically the net ecosystem exchange (NEE), in tea plantations is essential for determining carbon sequestration and ecosystem carbon balance. The Eddy covariance (EC) system is widely used for continuous monitoring of carbon flux but high costs associated with installation and maintenance limit its widespread adoption. In addition, EC flux data is often discarded due to malfunction of instruments caused by adverse weather conditions. Therefore, additional approaches for estimating NEE are necessary to overcome these challenges and ensure accurate NEE measurement. For this purpose, three standalone tree-based machine learning (ML) models were used for NEE estimation using EC flux data collected from tea ecosystem located in subtropical region (Danyang county of Zhenjiang city) of China. To address the accuracy limitations inherent in standalone ML models, the ensemble mechanism based on voting regressor method was proposed. In addition, k-fold cross-validation based on early stopping process was also used to enhance the performance of standalone ML models. Based on visual plots (scatter diagram, heatMap, Taylor diagram) and performing indices (root-mean-square error (RMSE), determination coefficient (R2), mean absolute error (MAE), mean absolute percentage error (MAPE), Nash–Sutcliffe efficiency (NSE), correlation coefficient (r), Kling Gupta Efficiency (KGE) and index of agreement (d)), the findings indicated that non-linear ensemble-generalized regression neural network (NLE-GRNN) significantly improved standalone ML model's results. In current study, the highest NSE, r and d in case of standalone ML model (DT) achieved 0.49, 0.73 and 0.75 respectively while our proposed NLE-GRNN model improved 48 % in NSE value (NSE = 0.97), 25 % in r value (r = 0.98) and 24 % in d value (d = 0.99). Likewise, NLE-GRNN significantly reduce errors (MAE, MAPE and RMSE) and provides NEE estimate closet to the observed value. The impact of climatic variables on NEE using shapley additive explanations (SHAP) analysis revealed that Rg (solar radiation) and Tair (air temperature) were the prime factors controlling NEE variation in the tea ecosystem. Considering the high accuracy and stability of the studied ML models, it is recommended to apply developed ensemble ML model (NLE-GRNN) for significant improvement of NEE estimate in the tea biomes or other ecosystems.
期刊介绍:
The European Journal of Agronomy, the official journal of the European Society for Agronomy, publishes original research papers reporting experimental and theoretical contributions to field-based agronomy and crop science. The journal will consider research at the field level for agricultural, horticultural and tree crops, that uses comprehensive and explanatory approaches. The EJA covers the following topics:
crop physiology
crop production and management including irrigation, fertilization and soil management
agroclimatology and modelling
plant-soil relationships
crop quality and post-harvest physiology
farming and cropping systems
agroecosystems and the environment
crop-weed interactions and management
organic farming
horticultural crops
papers from the European Society for Agronomy bi-annual meetings
In determining the suitability of submitted articles for publication, particular scrutiny is placed on the degree of novelty and significance of the research and the extent to which it adds to existing knowledge in agronomy.