抽样方法与模型结构的结合：基于Sentinel-1图像的数据驱动机器学习水稻制图的关键因素

IF 4.7 2区地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing Pub Date : 2025-03-11 DOI:10.1109/JSTARS.2025.3550109

Pengliang Wei;Jiao Guo;Jiaqian Lian;Chaoyang Wang

{"title":"抽样方法与模型结构的结合：基于Sentinel-1图像的数据驱动机器学习水稻制图的关键因素","authors":"Pengliang Wei;Jiao Guo;Jiaqian Lian;Chaoyang Wang","doi":"10.1109/JSTARS.2025.3550109","DOIUrl":null,"url":null,"abstract":"Agricultural remote sensing community is increasingly focusing on enhancing crop mapping accuracy by improving data-driven machine-learning model structures, yet ignoring impact of sampling–model structure combination on it, which may prevent full utilization of input data, especially for synthetic aperture radar images with fewer crop prior features. Consequently, this article took rice as target crop, and systematically performed rice mapping experiments based on Sentinel-1 images to assess mapping accuracies, model learning results, and model uncertainty under different sampling–model structures combinations. The sampling methods included pixel sampling in buffer or nonbuffer mode with equal proportion and equal quantity (pixel sample), as well as panoramic information sampling (image sample). The included model structures mainly focused on the models commonly used in rice mapping [i.e., Random Forest (RF) and Unet as traditional pixel and image data-driven machine-learning models], and related advanced model structures (i.e., popular transformer and Unet's variant, TransUnet, served as advanced model structures compared to the corresponding model structures commonly used in rice mapping). The experimental results showed that, when image sample was annotated well, both Unet and TransUnet were more suitable for rice mapping based on Sentinel-1 images, and their overall accuracies could reach 95% as sample size increased. Otherwise, when pixel sample size exceeded 100 000-level, nonbuffer equal proportion sampling–advanced transformer combination could be the currently optimal selection over the combination of this sampling method and RF, and its overall accuracy could reach 91% as sample size increased. Besides, it was worth noting that for data-driven machine-learning models commonly used in rice mapping, key factors for pixel data-driven ones to improve mapping accuracy was model structure upgrade, while for image data-driven ones, richness of image samples was more important.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"18 ","pages":"8340-8359"},"PeriodicalIF":4.7000,"publicationDate":"2025-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10921717","citationCount":"0","resultStr":"{\"title\":\"Combination Manner of Sampling Method and Model Structure: The Key Factor for Rice Mapping Based on Sentinel-1 Images Using Data-Driven Machine Learning\",\"authors\":\"Pengliang Wei;Jiao Guo;Jiaqian Lian;Chaoyang Wang\",\"doi\":\"10.1109/JSTARS.2025.3550109\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Agricultural remote sensing community is increasingly focusing on enhancing crop mapping accuracy by improving data-driven machine-learning model structures, yet ignoring impact of sampling–model structure combination on it, which may prevent full utilization of input data, especially for synthetic aperture radar images with fewer crop prior features. Consequently, this article took rice as target crop, and systematically performed rice mapping experiments based on Sentinel-1 images to assess mapping accuracies, model learning results, and model uncertainty under different sampling–model structures combinations. The sampling methods included pixel sampling in buffer or nonbuffer mode with equal proportion and equal quantity (pixel sample), as well as panoramic information sampling (image sample). The included model structures mainly focused on the models commonly used in rice mapping [i.e., Random Forest (RF) and Unet as traditional pixel and image data-driven machine-learning models], and related advanced model structures (i.e., popular transformer and Unet's variant, TransUnet, served as advanced model structures compared to the corresponding model structures commonly used in rice mapping). The experimental results showed that, when image sample was annotated well, both Unet and TransUnet were more suitable for rice mapping based on Sentinel-1 images, and their overall accuracies could reach 95% as sample size increased. Otherwise, when pixel sample size exceeded 100 000-level, nonbuffer equal proportion sampling–advanced transformer combination could be the currently optimal selection over the combination of this sampling method and RF, and its overall accuracy could reach 91% as sample size increased. Besides, it was worth noting that for data-driven machine-learning models commonly used in rice mapping, key factors for pixel data-driven ones to improve mapping accuracy was model structure upgrade, while for image data-driven ones, richness of image samples was more important.\",\"PeriodicalId\":13116,\"journal\":{\"name\":\"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing\",\"volume\":\"18 \",\"pages\":\"8340-8359\"},\"PeriodicalIF\":4.7000,\"publicationDate\":\"2025-03-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10921717\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10921717/\",\"RegionNum\":2,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10921717/","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

摘要

农业遥感界越来越关注通过改进数据驱动的机器学习模型结构来提高作物制图精度，而忽略了采样-模型结构组合对其的影响，这可能会影响输入数据的充分利用，特别是对于作物先验特征较少的合成孔径雷达图像。因此，本文以水稻为目标作物，系统地开展了基于Sentinel-1图像的水稻制图实验，以评估不同采样-模型结构组合下的制图精度、模型学习结果和模型不确定性。采样方式包括缓冲或非缓冲方式等比例等量像素采样（像素样本）和全景信息采样（图像样本）。所包含的模型结构主要集中在水稻绘图中常用的模型[即随机森林（Random Forest， RF）和Unet作为传统的像素和图像数据驱动的机器学习模型]，以及相关的高级模型结构（即流行的transformer和Unet的变体TransUnet，作为与水稻绘图中常用的相应模型结构相比的高级模型结构）。实验结果表明，当图像样本标注良好时，Unet和TransUnet更适合基于Sentinel-1图像的水稻制图，随着样本量的增加，两者的总体精度可达到95%。否则，当像素样本量超过10万级时，非缓冲等比例采样-先进变压器组合是目前该采样方法与射频组合的最优选择，随着样本量的增加，其总体精度可达91%。此外，值得注意的是，对于水稻制图中常用的数据驱动的机器学习模型，像素数据驱动的机器学习模型提高制图精度的关键因素是模型结构的升级，而图像数据驱动的机器学习模型更重要的是图像样本的丰富度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Combination Manner of Sampling Method and Model Structure: The Key Factor for Rice Mapping Based on Sentinel-1 Images Using Data-Driven Machine Learning

Agricultural remote sensing community is increasingly focusing on enhancing crop mapping accuracy by improving data-driven machine-learning model structures, yet ignoring impact of sampling–model structure combination on it, which may prevent full utilization of input data, especially for synthetic aperture radar images with fewer crop prior features. Consequently, this article took rice as target crop, and systematically performed rice mapping experiments based on Sentinel-1 images to assess mapping accuracies, model learning results, and model uncertainty under different sampling–model structures combinations. The sampling methods included pixel sampling in buffer or nonbuffer mode with equal proportion and equal quantity (pixel sample), as well as panoramic information sampling (image sample). The included model structures mainly focused on the models commonly used in rice mapping [i.e., Random Forest (RF) and Unet as traditional pixel and image data-driven machine-learning models], and related advanced model structures (i.e., popular transformer and Unet's variant, TransUnet, served as advanced model structures compared to the corresponding model structures commonly used in rice mapping). The experimental results showed that, when image sample was annotated well, both Unet and TransUnet were more suitable for rice mapping based on Sentinel-1 images, and their overall accuracies could reach 95% as sample size increased. Otherwise, when pixel sample size exceeded 100 000-level, nonbuffer equal proportion sampling–advanced transformer combination could be the currently optimal selection over the combination of this sampling method and RF, and its overall accuracy could reach 91% as sample size increased. Besides, it was worth noting that for data-driven machine-learning models commonly used in rice mapping, key factors for pixel data-driven ones to improve mapping accuracy was model structure upgrade, while for image data-driven ones, richness of image samples was more important.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 地学-成像科学与照相技术

CiteScore

9.30

自引率

10.90%

发文量

563

审稿时长

4.7 months

期刊介绍： The IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing addresses the growing field of applications in Earth observations and remote sensing, and also provides a venue for the rapidly expanding special issues that are being sponsored by the IEEE Geosciences and Remote Sensing Society. The journal draws upon the experience of the highly successful “IEEE Transactions on Geoscience and Remote Sensing” and provide a complementary medium for the wide range of topics in applied earth observations. The ‘Applications’ areas encompasses the societal benefit areas of the Global Earth Observations Systems of Systems (GEOSS) program. Through deliberations over two years, ministers from 50 countries agreed to identify nine areas where Earth observation could positively impact the quality of life and health of their respective countries. Some of these are areas not traditionally addressed in the IEEE context. These include biodiversity, health and climate. Yet it is the skill sets of IEEE members, in areas such as observations, communications, computers, signal processing, standards and ocean engineering, that form the technical underpinnings of GEOSS. Thus, the Journal attracts a broad range of interests that serves both present members in new ways and expands the IEEE visibility into new areas.