{"title":"Unlocking vegetation health: optimizing GEDI data for accurate chlorophyll content estimation.","authors":"Cuifen Xia, Wenwu Zhou, Qingtai Shu, Zaikun Wu, Mingxing Wang, Li Xu, Zhengdao Yang, Jinge Yu, Hanyue Song, Dandan Duan","doi":"10.3389/fpls.2024.1492560","DOIUrl":null,"url":null,"abstract":"<p><p>Chlorophyll content is a vital indicator for evaluating vegetation health and estimating productivity. This study addresses the issue of Global Ecosystem Dynamics Investigation (GEDI) data discreteness and explores its potential in estimating chlorophyll content. This study used the empirical Bayesian Kriging regression prediction (EBKRP) method to obtain the continuous distribution of GEDI spot parameters in an unknown space. Initially, 52 measured sample data were employed to screen the modeling parameters with the Pearson and RF methods. Next, the Bayesian optimization (BO) algorithm was applied to optimize the KNN regression model, RFR model, and Gradient Boosting Regression Tree (GBRT) model. These steps were taken to establish the most effective RS estimation model for chlorophyll content in <i>Dendrocalamus giganteus</i> (<i>D. giganteus</i>). The results showed that: (1) The <i>R</i> <sup>2</sup> of the EBKRP method was 0.34~0.99, RMSE was 0.012~3,134.005, rRMSE was 0.011~0.854, and CRPS was 965.492~1,626.887. (2) The Pearson method selects five parameters (cover, pai, fhd_normal, rv, and rx_energy_a3) with a correlation greater than 0.37. The RF method opts for five parameters (cover, fhd_normal, sensitivity, rh100, and modis_nonvegetated) with a contribution threshold greater than 5.5%. (3) The BO-GBRT model in the RF method was used as the best estimation model (<i>R</i> <sup>2</sup> = 0.86, RMSE = 0.219 g/m<sup>2</sup>, rRMSE = 0.167 g/m<sup>2</sup>, <i>p</i> = 84.13%) to estimate and map the chlorophyll content of <i>D. giganteus</i> in the study area. The distribution range is 0.20~2.50 g/m<sup>2</sup>. The findings aligned with the distribution of <i>D. giganteus</i> in the experimental area, indicating the reliability of estimating forest biochemical parameters using GEDI data.</p>","PeriodicalId":12632,"journal":{"name":"Frontiers in Plant Science","volume":"15 ","pages":"1492560"},"PeriodicalIF":4.1000,"publicationDate":"2024-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11638747/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Plant Science","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.3389/fpls.2024.1492560","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"PLANT SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Chlorophyll content is a vital indicator for evaluating vegetation health and estimating productivity. This study addresses the issue of Global Ecosystem Dynamics Investigation (GEDI) data discreteness and explores its potential in estimating chlorophyll content. This study used the empirical Bayesian Kriging regression prediction (EBKRP) method to obtain the continuous distribution of GEDI spot parameters in an unknown space. Initially, 52 measured sample data were employed to screen the modeling parameters with the Pearson and RF methods. Next, the Bayesian optimization (BO) algorithm was applied to optimize the KNN regression model, RFR model, and Gradient Boosting Regression Tree (GBRT) model. These steps were taken to establish the most effective RS estimation model for chlorophyll content in Dendrocalamus giganteus (D. giganteus). The results showed that: (1) The R2 of the EBKRP method was 0.34~0.99, RMSE was 0.012~3,134.005, rRMSE was 0.011~0.854, and CRPS was 965.492~1,626.887. (2) The Pearson method selects five parameters (cover, pai, fhd_normal, rv, and rx_energy_a3) with a correlation greater than 0.37. The RF method opts for five parameters (cover, fhd_normal, sensitivity, rh100, and modis_nonvegetated) with a contribution threshold greater than 5.5%. (3) The BO-GBRT model in the RF method was used as the best estimation model (R2 = 0.86, RMSE = 0.219 g/m2, rRMSE = 0.167 g/m2, p = 84.13%) to estimate and map the chlorophyll content of D. giganteus in the study area. The distribution range is 0.20~2.50 g/m2. The findings aligned with the distribution of D. giganteus in the experimental area, indicating the reliability of estimating forest biochemical parameters using GEDI data.
期刊介绍:
In an ever changing world, plant science is of the utmost importance for securing the future well-being of humankind. Plants provide oxygen, food, feed, fibers, and building materials. In addition, they are a diverse source of industrial and pharmaceutical chemicals. Plants are centrally important to the health of ecosystems, and their understanding is critical for learning how to manage and maintain a sustainable biosphere. Plant science is extremely interdisciplinary, reaching from agricultural science to paleobotany, and molecular physiology to ecology. It uses the latest developments in computer science, optics, molecular biology and genomics to address challenges in model systems, agricultural crops, and ecosystems. Plant science research inquires into the form, function, development, diversity, reproduction, evolution and uses of both higher and lower plants and their interactions with other organisms throughout the biosphere. Frontiers in Plant Science welcomes outstanding contributions in any field of plant science from basic to applied research, from organismal to molecular studies, from single plant analysis to studies of populations and whole ecosystems, and from molecular to biophysical to computational approaches.
Frontiers in Plant Science publishes articles on the most outstanding discoveries across a wide research spectrum of Plant Science. The mission of Frontiers in Plant Science is to bring all relevant Plant Science areas together on a single platform.