A transformer-based model for detecting land surface phenology from the irregular harmonized Landsat and Sentinel-2 time series across the United States
Khuong H. Tran , Xiaoyang Zhang , Hankui K. Zhang , Yu Shen , Yongchang Ye , Yuxia Liu , Shuai Gao , Shuai An
{"title":"A transformer-based model for detecting land surface phenology from the irregular harmonized Landsat and Sentinel-2 time series across the United States","authors":"Khuong H. Tran , Xiaoyang Zhang , Hankui K. Zhang , Yu Shen , Yongchang Ye , Yuxia Liu , Shuai Gao , Shuai An","doi":"10.1016/j.rse.2025.114656","DOIUrl":null,"url":null,"abstract":"<div><div>Land surface phenology (LSP) has been widely generated using traditional methods of fitting satellite-based time series of vegetation indices over the past two decades. However, these methods are highly vulnerable to the presence of temporal gaps and the use of specific smoothing or gap-filling algorithms. Several attempts have recently used Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs) to detect phenology, which produce an uncertainty of larger than three weeks and as large as two months. Further, these deep learning methods cannot handle missing data in time series and still need temporal gap-filling, which results in low to moderate accuracy in phenology detection. This study proposed a novel Transformer-based model to detect 30 m phenological events across the United States for the years 2019 and 2020. Specifically, we adapted the Transformer architecture to handle irregular time series and capture long-range relationships among all high-quality observations in the satellite time series. The Transformer-based model was trained using reference time series samples extracted from the high-quality LSP product, which was recently produced by fusing the HLS (Harmonized Landsat and Sentinel-2) observations with near-surface PhenoCam time series (HLS-PhenoCam LSP). The accuracy of the Transformer-based model for LSP detection using the two-band Enhanced Vegetation Index (EVI2) was investigated and compared with the most widely used Hybrid Piecewise Logistic Model based Land Surface Phenology Detection (HPLM-LSPD) algorithm and one dimensional (1D) CNN model. The phenological patterns detected from the Transformer-based model and the HPLM-LSPD algorithm were mostly comparable, despite the occasional differences in magnitude and local details when compared to the reference HLS-PhenoCam LSP product. The accuracy metrics indicated that the Transformer-based model produced overall higher accuracy than the HPLM-LSPD algorithm and the 1D-CNN model, with a correlation coefficient (R) of 0.74–0.88, a mean absolute difference (MAD) of 9.6–15 days, a root mean squared error (RMSE) of 13.5–20.6 days, and a mean systematic bias (MSB) of 0.7–4.7 days. The statistical analyses also showed that the Transformer-based model outperformed the HPLM-LSPD algorithm across all vegetation types and the HLS time series with different numbers of high-quality observations. Further, the accuracy of the Transformer-based model is contingent upon the proportion of high-quality observations. The model could achieve consistently high accuracy if the high-quality observations exceed 25 %. This study paved an effective way from traditional methodologies to machine learning methods for detecting phenological transition dates of vegetation development across various ecosystems.</div></div>","PeriodicalId":417,"journal":{"name":"Remote Sensing of Environment","volume":"320 ","pages":"Article 114656"},"PeriodicalIF":11.1000,"publicationDate":"2025-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Remote Sensing of Environment","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0034425725000604","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Land surface phenology (LSP) has been widely generated using traditional methods of fitting satellite-based time series of vegetation indices over the past two decades. However, these methods are highly vulnerable to the presence of temporal gaps and the use of specific smoothing or gap-filling algorithms. Several attempts have recently used Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs) to detect phenology, which produce an uncertainty of larger than three weeks and as large as two months. Further, these deep learning methods cannot handle missing data in time series and still need temporal gap-filling, which results in low to moderate accuracy in phenology detection. This study proposed a novel Transformer-based model to detect 30 m phenological events across the United States for the years 2019 and 2020. Specifically, we adapted the Transformer architecture to handle irregular time series and capture long-range relationships among all high-quality observations in the satellite time series. The Transformer-based model was trained using reference time series samples extracted from the high-quality LSP product, which was recently produced by fusing the HLS (Harmonized Landsat and Sentinel-2) observations with near-surface PhenoCam time series (HLS-PhenoCam LSP). The accuracy of the Transformer-based model for LSP detection using the two-band Enhanced Vegetation Index (EVI2) was investigated and compared with the most widely used Hybrid Piecewise Logistic Model based Land Surface Phenology Detection (HPLM-LSPD) algorithm and one dimensional (1D) CNN model. The phenological patterns detected from the Transformer-based model and the HPLM-LSPD algorithm were mostly comparable, despite the occasional differences in magnitude and local details when compared to the reference HLS-PhenoCam LSP product. The accuracy metrics indicated that the Transformer-based model produced overall higher accuracy than the HPLM-LSPD algorithm and the 1D-CNN model, with a correlation coefficient (R) of 0.74–0.88, a mean absolute difference (MAD) of 9.6–15 days, a root mean squared error (RMSE) of 13.5–20.6 days, and a mean systematic bias (MSB) of 0.7–4.7 days. The statistical analyses also showed that the Transformer-based model outperformed the HPLM-LSPD algorithm across all vegetation types and the HLS time series with different numbers of high-quality observations. Further, the accuracy of the Transformer-based model is contingent upon the proportion of high-quality observations. The model could achieve consistently high accuracy if the high-quality observations exceed 25 %. This study paved an effective way from traditional methodologies to machine learning methods for detecting phenological transition dates of vegetation development across various ecosystems.
期刊介绍:
Remote Sensing of Environment (RSE) serves the Earth observation community by disseminating results on the theory, science, applications, and technology that contribute to advancing the field of remote sensing. With a thoroughly interdisciplinary approach, RSE encompasses terrestrial, oceanic, and atmospheric sensing.
The journal emphasizes biophysical and quantitative approaches to remote sensing at local to global scales, covering a diverse range of applications and techniques.
RSE serves as a vital platform for the exchange of knowledge and advancements in the dynamic field of remote sensing.