{"title":"A multimodal and meta-learning approach for improved estimation of 3D vegetation structure from satellite imagery","authors":"Ram C. Sharma","doi":"10.1007/s12518-025-00619-5","DOIUrl":null,"url":null,"abstract":"<div><p>This research presents a multimodal and meta-learning approach that integrates multi-source satellite sensor and field plot-level data for enhanced retrieval of 3D vegetation structure. Specifically, the combined effect of integrating multispectral data from Landsat 8 OLI and Sentinel-2 MSI with radar data from Sentinel-1 CSAR was examined. For the utilization of multi-source inputs, the synergistic integration was implemented using efficient machine learning regressors—Random Forest Regressor (RFR) and Extreme Gradient Boosting Regressor (GBR)—ensembled within a meta-learning framework. Three meta-model layers—Multiple Linear Regressor (MLR), K-Nearest Neighbors Regressor (KNR), and RFR—were employed and evaluated. As a subroutine of this integration, a model-specific and data-type-specific feature selection method was employed, which involved training each model on a unique subset of features identified through permutation importance. The idea of the multimodal and meta-learning approach was implemented using extensive plot-wise data from diverse forest types in the New England region utilizing a rich dataset comprising spectral, spectral indices, and backscattering characteristics to capture the variability of forest biomass. The efficacy of multiple ensembling strategies was evaluated, specifically ensembling across data types or regressors, as well as meta-learning across both data types and regressors. Ensembling across data types, which leverages the strengths of both spectral and backscattering information, demonstrated a higher predictive ability, achieving an R2 of 0.68 and an RMSE of 54.21 Mg/ha. This was higher than the ensembling strategy across regressors using the same data type, which yielded an R2 of 0.59 and an RMSE of 61.4 Mg/ha. Nevertheless, the multimodal and meta-learning approach, which collectively leverages both data types and machine learning regressors, achieved superior performance, with an R2 of 0.82 and an RMSE of 40.5 Mg/ha. This was significantly greater than a conventional ensemble method, which lacked the meta-layer integration. Additionally, the meta-model layer using RFR yielded better results compared to the KNR or MLR layers, demonstrating the capability of RFR in handling complex interactions across variables. These results highlight the superior accuracy and reliability of the multimodal and meta-learning approach, indicating its substantial potential to enhance precision in ecological monitoring and carbon management.</p></div>","PeriodicalId":46286,"journal":{"name":"Applied Geomatics","volume":"17 2","pages":"255 - 268"},"PeriodicalIF":2.3000,"publicationDate":"2025-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Geomatics","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.1007/s12518-025-00619-5","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"REMOTE SENSING","Score":null,"Total":0}
引用次数: 0
Abstract
This research presents a multimodal and meta-learning approach that integrates multi-source satellite sensor and field plot-level data for enhanced retrieval of 3D vegetation structure. Specifically, the combined effect of integrating multispectral data from Landsat 8 OLI and Sentinel-2 MSI with radar data from Sentinel-1 CSAR was examined. For the utilization of multi-source inputs, the synergistic integration was implemented using efficient machine learning regressors—Random Forest Regressor (RFR) and Extreme Gradient Boosting Regressor (GBR)—ensembled within a meta-learning framework. Three meta-model layers—Multiple Linear Regressor (MLR), K-Nearest Neighbors Regressor (KNR), and RFR—were employed and evaluated. As a subroutine of this integration, a model-specific and data-type-specific feature selection method was employed, which involved training each model on a unique subset of features identified through permutation importance. The idea of the multimodal and meta-learning approach was implemented using extensive plot-wise data from diverse forest types in the New England region utilizing a rich dataset comprising spectral, spectral indices, and backscattering characteristics to capture the variability of forest biomass. The efficacy of multiple ensembling strategies was evaluated, specifically ensembling across data types or regressors, as well as meta-learning across both data types and regressors. Ensembling across data types, which leverages the strengths of both spectral and backscattering information, demonstrated a higher predictive ability, achieving an R2 of 0.68 and an RMSE of 54.21 Mg/ha. This was higher than the ensembling strategy across regressors using the same data type, which yielded an R2 of 0.59 and an RMSE of 61.4 Mg/ha. Nevertheless, the multimodal and meta-learning approach, which collectively leverages both data types and machine learning regressors, achieved superior performance, with an R2 of 0.82 and an RMSE of 40.5 Mg/ha. This was significantly greater than a conventional ensemble method, which lacked the meta-layer integration. Additionally, the meta-model layer using RFR yielded better results compared to the KNR or MLR layers, demonstrating the capability of RFR in handling complex interactions across variables. These results highlight the superior accuracy and reliability of the multimodal and meta-learning approach, indicating its substantial potential to enhance precision in ecological monitoring and carbon management.
期刊介绍:
Applied Geomatics (AGMJ) is the official journal of SIFET the Italian Society of Photogrammetry and Topography and covers all aspects and information on scientific and technical advances in the geomatics sciences. The Journal publishes innovative contributions in geomatics applications ranging from the integration of instruments, methodologies and technologies and their use in the environmental sciences, engineering and other natural sciences.
The areas of interest include many research fields such as: remote sensing, close range and videometric photogrammetry, image analysis, digital mapping, land and geographic information systems, geographic information science, integrated geodesy, spatial data analysis, heritage recording; network adjustment and numerical processes. Furthermore, Applied Geomatics is open to articles from all areas of deformation measurements and analysis, structural engineering, mechanical engineering and all trends in earth and planetary survey science and space technology. The Journal also contains notices of conferences and international workshops, industry news, and information on new products. It provides a useful forum for professional and academic scientists involved in geomatics science and technology.
Information on Open Research Funding and Support may be found here: https://www.springernature.com/gp/open-research/institutional-agreements