A deep learning framework for mapping evergreen conifer fractional cover at 30 m resolution using fused bi-temporal WorldView and time-series Landsat imagery in mixed mountain forests
Xiao Zhu , Tiejun Wang , Andrew K. Skidmore , Isla Duporge
{"title":"A deep learning framework for mapping evergreen conifer fractional cover at 30 m resolution using fused bi-temporal WorldView and time-series Landsat imagery in mixed mountain forests","authors":"Xiao Zhu , Tiejun Wang , Andrew K. Skidmore , Isla Duporge","doi":"10.1016/j.rse.2025.115055","DOIUrl":null,"url":null,"abstract":"<div><div>Evergreen conifers are key components of temperate broadleaf and mixed forests, playing a significant role in shaping ecosystem structure, function, and resilience to climate change. While very high-resolution (VHR) satellite imagery enables accurate classification of evergreen conifers and creation of reference fractional cover maps, scaling this capability to regional levels using coarser-resolution time-series satellite data remains challenging. Traditional machine learning approaches are limited by their inability to fully exploit the spatial detail of VHR imagery and capture sequential patterns in satellite time series. To address these limitations, we developed a deep learning-based framework for mapping evergreen conifer fractional cover at 30 m resolution in mountainous forests. The framework integrates a 3D U-Net model to extract spatial and spectral features from bi-temporal WorldView imagery—while mitigating terrain shadows—and a long short-term memory (LSTM) network to learn sequential dependencies from Landsat time series for regression. We compared our framework against a random forest baseline. Independent spatial and temporal transferability assessments showed that our approach achieved an R<sup>2</sup> of 0.71 and an RMSE of 0.14, outperforming the benchmark method. To further interpret the spatial predictions, we quantified the spatial configuration of evergreen conifers using landscape metrics across areas with varying conifer cover. Our findings demonstrate the value of combining multi-source, multi-resolution imagery with deep learning models tailored for spatial and temporal complexity. This framework improves the accuracy and transferability of fractional cover mapping and offers a scalable solution for ecosystem monitoring in topographically complex forested landscapes.</div></div>","PeriodicalId":417,"journal":{"name":"Remote Sensing of Environment","volume":"331 ","pages":"Article 115055"},"PeriodicalIF":11.4000,"publicationDate":"2025-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Remote Sensing of Environment","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0034425725004596","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Evergreen conifers are key components of temperate broadleaf and mixed forests, playing a significant role in shaping ecosystem structure, function, and resilience to climate change. While very high-resolution (VHR) satellite imagery enables accurate classification of evergreen conifers and creation of reference fractional cover maps, scaling this capability to regional levels using coarser-resolution time-series satellite data remains challenging. Traditional machine learning approaches are limited by their inability to fully exploit the spatial detail of VHR imagery and capture sequential patterns in satellite time series. To address these limitations, we developed a deep learning-based framework for mapping evergreen conifer fractional cover at 30 m resolution in mountainous forests. The framework integrates a 3D U-Net model to extract spatial and spectral features from bi-temporal WorldView imagery—while mitigating terrain shadows—and a long short-term memory (LSTM) network to learn sequential dependencies from Landsat time series for regression. We compared our framework against a random forest baseline. Independent spatial and temporal transferability assessments showed that our approach achieved an R2 of 0.71 and an RMSE of 0.14, outperforming the benchmark method. To further interpret the spatial predictions, we quantified the spatial configuration of evergreen conifers using landscape metrics across areas with varying conifer cover. Our findings demonstrate the value of combining multi-source, multi-resolution imagery with deep learning models tailored for spatial and temporal complexity. This framework improves the accuracy and transferability of fractional cover mapping and offers a scalable solution for ecosystem monitoring in topographically complex forested landscapes.
期刊介绍:
Remote Sensing of Environment (RSE) serves the Earth observation community by disseminating results on the theory, science, applications, and technology that contribute to advancing the field of remote sensing. With a thoroughly interdisciplinary approach, RSE encompasses terrestrial, oceanic, and atmospheric sensing.
The journal emphasizes biophysical and quantitative approaches to remote sensing at local to global scales, covering a diverse range of applications and techniques.
RSE serves as a vital platform for the exchange of knowledge and advancements in the dynamic field of remote sensing.