面向细粒度语义分割的受控sam和上下文提升网络

IF 4.7 2区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC
Jinglin Zhang;Yuxia Li;Lei He;Bowei Zhang;Zhenye Niu;Yonghui Zhang;Shiyu Luo
{"title":"面向细粒度语义分割的受控sam和上下文提升网络","authors":"Jinglin Zhang;Yuxia Li;Lei He;Bowei Zhang;Zhenye Niu;Yonghui Zhang;Shiyu Luo","doi":"10.1109/JSTARS.2025.3581620","DOIUrl":null,"url":null,"abstract":"Fine-grained semantic segmentation of remote sensing imagery is critical for applications such as land use analysis and agricultural monitoring. However, it remains challenging due to the subtle inter-class differences between visually similar objects, which often result in misclassifications. This challenge becomes particularly evident in distinguishing classes such as rivers, ponds, and fishponds, which share similar spectral and spatial characteristics. To address these challenges, we propose CSCPNet, a novel framework optimized for fine-grained feature extraction and segmentation accuracy. CSCPNet features the controlled-segment anything model (SAM) encoder and the context promoting decoder. The controlled SAM encoder, by using shallow and deep feature fusion modules, integrates multiscale features from both a pretrained SAM encoder and a lightweight encoder, excelling in capturing detailed fine-grained features. The context promoting decoder with context attention is designed to iteratively refine feature maps through multistep decoding, effectively incorporating contextual information. Extensive experiments on FBP and ShengTeng datasets with fine-grained classes demonstrate that CSCPNet achieves state-of-the-art performance in fine-grained semantic segmentation. On the FBP dataset with 24 fine-grained classes, CSCPNet improves overall accuracy (OA), mean intersection over union (mIoU), and mF1 by 4.4%, 6.7%, and 9.3%, respectively. Similarly, on the ShengTeng dataset with 47 fine-grained classes, it achieves gains of 5.5% in OA, 7.3% in mIoU, and 7.9% in mF1. Meanwhile, CSCPNet maintains competitive accuracy in normal segmentation datasets such as Potsdam dataset and CZWZ dataset. These results demonstrate that CSCPNet excels at capturing fine-grained details and effectively distinguishing visually similar classes, making it a robust and efficient solution for fine-grained semantic segmentation of remote sensing images.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"18 ","pages":"15707-15724"},"PeriodicalIF":4.7000,"publicationDate":"2025-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11045311","citationCount":"0","resultStr":"{\"title\":\"Controlled-SAM and Context Promoting Network for Fine-Grained Semantic Segmentation\",\"authors\":\"Jinglin Zhang;Yuxia Li;Lei He;Bowei Zhang;Zhenye Niu;Yonghui Zhang;Shiyu Luo\",\"doi\":\"10.1109/JSTARS.2025.3581620\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Fine-grained semantic segmentation of remote sensing imagery is critical for applications such as land use analysis and agricultural monitoring. However, it remains challenging due to the subtle inter-class differences between visually similar objects, which often result in misclassifications. This challenge becomes particularly evident in distinguishing classes such as rivers, ponds, and fishponds, which share similar spectral and spatial characteristics. To address these challenges, we propose CSCPNet, a novel framework optimized for fine-grained feature extraction and segmentation accuracy. CSCPNet features the controlled-segment anything model (SAM) encoder and the context promoting decoder. The controlled SAM encoder, by using shallow and deep feature fusion modules, integrates multiscale features from both a pretrained SAM encoder and a lightweight encoder, excelling in capturing detailed fine-grained features. The context promoting decoder with context attention is designed to iteratively refine feature maps through multistep decoding, effectively incorporating contextual information. Extensive experiments on FBP and ShengTeng datasets with fine-grained classes demonstrate that CSCPNet achieves state-of-the-art performance in fine-grained semantic segmentation. On the FBP dataset with 24 fine-grained classes, CSCPNet improves overall accuracy (OA), mean intersection over union (mIoU), and mF1 by 4.4%, 6.7%, and 9.3%, respectively. Similarly, on the ShengTeng dataset with 47 fine-grained classes, it achieves gains of 5.5% in OA, 7.3% in mIoU, and 7.9% in mF1. Meanwhile, CSCPNet maintains competitive accuracy in normal segmentation datasets such as Potsdam dataset and CZWZ dataset. These results demonstrate that CSCPNet excels at capturing fine-grained details and effectively distinguishing visually similar classes, making it a robust and efficient solution for fine-grained semantic segmentation of remote sensing images.\",\"PeriodicalId\":13116,\"journal\":{\"name\":\"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing\",\"volume\":\"18 \",\"pages\":\"15707-15724\"},\"PeriodicalIF\":4.7000,\"publicationDate\":\"2025-06-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11045311\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11045311/\",\"RegionNum\":2,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/11045311/","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

摘要

遥感图像的细粒度语义分割对于土地利用分析和农业监测等应用至关重要。然而,由于视觉上相似的对象之间存在微妙的类间差异,这通常会导致错误分类,因此仍然具有挑战性。这一挑战在区分河流、池塘和鱼塘等具有相似光谱和空间特征的类别时尤为明显。为了解决这些挑战,我们提出了一种新的框架CSCPNet,该框架针对细粒度特征提取和分割精度进行了优化。CSCPNet的特点是控制段任意模型(SAM)编码器和上下文提升解码器。可控的SAM编码器通过使用浅特征和深特征融合模块,集成了预训练SAM编码器和轻量级编码器的多尺度特征,擅长捕获详细的细粒度特征。基于上下文关注的上下文促进解码器旨在通过多步解码迭代优化特征映射,有效地融合上下文信息。在带有细粒度类的FBP和ShengTeng数据集上进行的大量实验表明,CSCPNet在细粒度语义分割方面达到了最先进的性能。在包含24个细粒度类的FBP数据集上,CSCPNet将总体精度(OA)、平均交联(mIoU)和mF1分别提高了4.4%、6.7%和9.3%。同样,在具有47个细粒度类的ShengTeng数据集上,它在OA上实现了5.5%的增益,在mIoU上实现了7.3%的增益,在mF1上实现了7.9%的增益。同时,CSCPNet在波茨坦数据集和CZWZ数据集等常规分割数据集上保持了相当的精度。这些结果表明,CSCPNet在捕获细粒度细节和有效区分视觉相似类方面表现出色,是一种鲁棒且高效的遥感图像细粒度语义分割解决方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Controlled-SAM and Context Promoting Network for Fine-Grained Semantic Segmentation
Fine-grained semantic segmentation of remote sensing imagery is critical for applications such as land use analysis and agricultural monitoring. However, it remains challenging due to the subtle inter-class differences between visually similar objects, which often result in misclassifications. This challenge becomes particularly evident in distinguishing classes such as rivers, ponds, and fishponds, which share similar spectral and spatial characteristics. To address these challenges, we propose CSCPNet, a novel framework optimized for fine-grained feature extraction and segmentation accuracy. CSCPNet features the controlled-segment anything model (SAM) encoder and the context promoting decoder. The controlled SAM encoder, by using shallow and deep feature fusion modules, integrates multiscale features from both a pretrained SAM encoder and a lightweight encoder, excelling in capturing detailed fine-grained features. The context promoting decoder with context attention is designed to iteratively refine feature maps through multistep decoding, effectively incorporating contextual information. Extensive experiments on FBP and ShengTeng datasets with fine-grained classes demonstrate that CSCPNet achieves state-of-the-art performance in fine-grained semantic segmentation. On the FBP dataset with 24 fine-grained classes, CSCPNet improves overall accuracy (OA), mean intersection over union (mIoU), and mF1 by 4.4%, 6.7%, and 9.3%, respectively. Similarly, on the ShengTeng dataset with 47 fine-grained classes, it achieves gains of 5.5% in OA, 7.3% in mIoU, and 7.9% in mF1. Meanwhile, CSCPNet maintains competitive accuracy in normal segmentation datasets such as Potsdam dataset and CZWZ dataset. These results demonstrate that CSCPNet excels at capturing fine-grained details and effectively distinguishing visually similar classes, making it a robust and efficient solution for fine-grained semantic segmentation of remote sensing images.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
9.30
自引率
10.90%
发文量
563
审稿时长
4.7 months
期刊介绍: The IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing addresses the growing field of applications in Earth observations and remote sensing, and also provides a venue for the rapidly expanding special issues that are being sponsored by the IEEE Geosciences and Remote Sensing Society. The journal draws upon the experience of the highly successful “IEEE Transactions on Geoscience and Remote Sensing” and provide a complementary medium for the wide range of topics in applied earth observations. The ‘Applications’ areas encompasses the societal benefit areas of the Global Earth Observations Systems of Systems (GEOSS) program. Through deliberations over two years, ministers from 50 countries agreed to identify nine areas where Earth observation could positively impact the quality of life and health of their respective countries. Some of these are areas not traditionally addressed in the IEEE context. These include biodiversity, health and climate. Yet it is the skill sets of IEEE members, in areas such as observations, communications, computers, signal processing, standards and ocean engineering, that form the technical underpinnings of GEOSS. Thus, the Journal attracts a broad range of interests that serves both present members in new ways and expands the IEEE visibility into new areas.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信