{"title":"GF-2 Remote Sensing-Based Winter Wheat Extraction With Multitask Learning Vision Transformer","authors":"Zhihao Zhao;Zihan Liu;Heng Luo;Hui Yang;Biao Wang;Yixin Jiang;Yanqi Liu;Yanlan Wu","doi":"10.1109/JSTARS.2025.3564680","DOIUrl":null,"url":null,"abstract":"Accurate mapping of winter wheat is essential for the advancement of precision agriculture and food security. However, classical semantic segmentation models frequently encounter difficulties in precise edge extraction, omission, and classification due to the presence of dense distributions and intraclass diversity. This study proposes a novel method for the extraction of winter wheat from remote sensing data using the GF-2 satellite. The method incorporates a multitask learning framework-Vision Transformer-based model (namely MCFormer) that combines semantic segmentation and boundary detection. Furthermore, the normalized difference vegetation index (NDVI) and land surface temperature (LST) derived from Landsat 8 images was included to enhance the representation of winter wheat's spectral characteristics. The method is evaluated in comparison to frequently used U-Net-, SegNet-, SegFormer-, and MANet-based winter wheat extraction methods in northern Anhui Province. The results indicate that the MCFormer-based method achieves the intersection over union (IoU), F1 score, recall, precision and overall accuracy (OA) of 0.9790, 0.9893, 0.9953, 0.9835, and 0.9900, respectively, outperforming the U-Net-, SegNet-, SegFormer-, and MANet-based methods. The incorporation of multitask learning with NDVI and LST data has been demonstrated to enhance several key performance metrics, including improvements in the IoU, F1 score, recall, precision, and OA by 5.95%, 3.65%, 3.75%, 2.79%, and 2.24%, respectively. Our proposed approach improves the accuracy of winter wheat extraction from remote sensing images, which has the potential to facilitate precision agriculture and enhance food security.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"18 ","pages":"12454-12469"},"PeriodicalIF":4.7000,"publicationDate":"2025-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10979367","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10979367/","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate mapping of winter wheat is essential for the advancement of precision agriculture and food security. However, classical semantic segmentation models frequently encounter difficulties in precise edge extraction, omission, and classification due to the presence of dense distributions and intraclass diversity. This study proposes a novel method for the extraction of winter wheat from remote sensing data using the GF-2 satellite. The method incorporates a multitask learning framework-Vision Transformer-based model (namely MCFormer) that combines semantic segmentation and boundary detection. Furthermore, the normalized difference vegetation index (NDVI) and land surface temperature (LST) derived from Landsat 8 images was included to enhance the representation of winter wheat's spectral characteristics. The method is evaluated in comparison to frequently used U-Net-, SegNet-, SegFormer-, and MANet-based winter wheat extraction methods in northern Anhui Province. The results indicate that the MCFormer-based method achieves the intersection over union (IoU), F1 score, recall, precision and overall accuracy (OA) of 0.9790, 0.9893, 0.9953, 0.9835, and 0.9900, respectively, outperforming the U-Net-, SegNet-, SegFormer-, and MANet-based methods. The incorporation of multitask learning with NDVI and LST data has been demonstrated to enhance several key performance metrics, including improvements in the IoU, F1 score, recall, precision, and OA by 5.95%, 3.65%, 3.75%, 2.79%, and 2.24%, respectively. Our proposed approach improves the accuracy of winter wheat extraction from remote sensing images, which has the potential to facilitate precision agriculture and enhance food security.
期刊介绍:
The IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing addresses the growing field of applications in Earth observations and remote sensing, and also provides a venue for the rapidly expanding special issues that are being sponsored by the IEEE Geosciences and Remote Sensing Society. The journal draws upon the experience of the highly successful “IEEE Transactions on Geoscience and Remote Sensing” and provide a complementary medium for the wide range of topics in applied earth observations. The ‘Applications’ areas encompasses the societal benefit areas of the Global Earth Observations Systems of Systems (GEOSS) program. Through deliberations over two years, ministers from 50 countries agreed to identify nine areas where Earth observation could positively impact the quality of life and health of their respective countries. Some of these are areas not traditionally addressed in the IEEE context. These include biodiversity, health and climate. Yet it is the skill sets of IEEE members, in areas such as observations, communications, computers, signal processing, standards and ocean engineering, that form the technical underpinnings of GEOSS. Thus, the Journal attracts a broad range of interests that serves both present members in new ways and expands the IEEE visibility into new areas.