Lingling Fan , Lang Xia , Jing Yang , Xiao Sun , Shangrong Wu , Bingwen Qiu , Jin Chen , Wenbin Wu , Peng Yang
{"title":"A temporal-spatial deep learning network for winter wheat mapping using time-series Sentinel-2 imagery","authors":"Lingling Fan , Lang Xia , Jing Yang , Xiao Sun , Shangrong Wu , Bingwen Qiu , Jin Chen , Wenbin Wu , Peng Yang","doi":"10.1016/j.isprsjprs.2024.06.005","DOIUrl":null,"url":null,"abstract":"<div><p>Accurate mapping of winter wheat provides essential information for food security and ecosystem protection. Deep learning approaches have achieved promising crop discrimination performance based on multitemporal satellite imagery. However, due to the high dimensionality of the data, sequential relations, and complex semantic information in time-series imagery, effective methods that can automatically capture temporal-spatial features with high separability and generalizability have received less attention. In this study, we proposed a U-shaped CNN-Transformer hybrid framework based on an attention mechanism, named the U-Temporal-Spatial-Transformer network (UTS-Former), for winter wheat mapping using Sentinel-2 imagery. This model includes an “encoder-decoder” structure for multiscale information mining of time series images and a temporal-spatial transformer module (TST) for learning comprehensive temporal sequence features and spatial semantic information. The results obtained from two study areas indicated that our UTS-Former achieved the best accuracy, with a mean MCC of 0.928 and an F1-score of 0.950, and the results of different band combinations also showed better performance than other popular time-series methods. We found that the MCC (MCC/All) of the UTS-Former using only RGB bands decreased by 4.53 %, while it decreased by 13.36 % and 35.02 % for UNet2d-LSTM and CNN-BiLSTM, respectively, compared with that of all the band combinations. The comparison demonstrated that the proposed UTS-Former could capture more global temporal-spatial information from winter wheat fields and achieve greater precision in terms of local details than other methods, resulting in high-quality mapping. The analysis of attention scores for the available acquisition dates revealed significant contributions of both beginning and ending growth images in winter wheat mapping, which is valuable for making appropriate selections of image dates. These findings suggest that the proposed approach has great potential for accurate, cost-effective, and high-quality winter wheat mapping.</p></div>","PeriodicalId":50269,"journal":{"name":"ISPRS Journal of Photogrammetry and Remote Sensing","volume":"214 ","pages":"Pages 48-64"},"PeriodicalIF":12.2000,"publicationDate":"2024-06-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ISPRS Journal of Photogrammetry and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0924271624002417","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOGRAPHY, PHYSICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate mapping of winter wheat provides essential information for food security and ecosystem protection. Deep learning approaches have achieved promising crop discrimination performance based on multitemporal satellite imagery. However, due to the high dimensionality of the data, sequential relations, and complex semantic information in time-series imagery, effective methods that can automatically capture temporal-spatial features with high separability and generalizability have received less attention. In this study, we proposed a U-shaped CNN-Transformer hybrid framework based on an attention mechanism, named the U-Temporal-Spatial-Transformer network (UTS-Former), for winter wheat mapping using Sentinel-2 imagery. This model includes an “encoder-decoder” structure for multiscale information mining of time series images and a temporal-spatial transformer module (TST) for learning comprehensive temporal sequence features and spatial semantic information. The results obtained from two study areas indicated that our UTS-Former achieved the best accuracy, with a mean MCC of 0.928 and an F1-score of 0.950, and the results of different band combinations also showed better performance than other popular time-series methods. We found that the MCC (MCC/All) of the UTS-Former using only RGB bands decreased by 4.53 %, while it decreased by 13.36 % and 35.02 % for UNet2d-LSTM and CNN-BiLSTM, respectively, compared with that of all the band combinations. The comparison demonstrated that the proposed UTS-Former could capture more global temporal-spatial information from winter wheat fields and achieve greater precision in terms of local details than other methods, resulting in high-quality mapping. The analysis of attention scores for the available acquisition dates revealed significant contributions of both beginning and ending growth images in winter wheat mapping, which is valuable for making appropriate selections of image dates. These findings suggest that the proposed approach has great potential for accurate, cost-effective, and high-quality winter wheat mapping.
期刊介绍:
The ISPRS Journal of Photogrammetry and Remote Sensing (P&RS) serves as the official journal of the International Society for Photogrammetry and Remote Sensing (ISPRS). It acts as a platform for scientists and professionals worldwide who are involved in various disciplines that utilize photogrammetry, remote sensing, spatial information systems, computer vision, and related fields. The journal aims to facilitate communication and dissemination of advancements in these disciplines, while also acting as a comprehensive source of reference and archive.
P&RS endeavors to publish high-quality, peer-reviewed research papers that are preferably original and have not been published before. These papers can cover scientific/research, technological development, or application/practical aspects. Additionally, the journal welcomes papers that are based on presentations from ISPRS meetings, as long as they are considered significant contributions to the aforementioned fields.
In particular, P&RS encourages the submission of papers that are of broad scientific interest, showcase innovative applications (especially in emerging fields), have an interdisciplinary focus, discuss topics that have received limited attention in P&RS or related journals, or explore new directions in scientific or professional realms. It is preferred that theoretical papers include practical applications, while papers focusing on systems and applications should include a theoretical background.