RSProtoSemiSeg:基于概率分布原型的高空间分辨率遥感图像半监督语义分割

IF 12.2 1区 地球科学 Q1 GEOGRAPHY, PHYSICAL
Wenjie Sun , Yujie Lei , Danfeng Hong , Zhongwen Hu , Qingquan Li , Jie Zhang
{"title":"RSProtoSemiSeg:基于概率分布原型的高空间分辨率遥感图像半监督语义分割","authors":"Wenjie Sun ,&nbsp;Yujie Lei ,&nbsp;Danfeng Hong ,&nbsp;Zhongwen Hu ,&nbsp;Qingquan Li ,&nbsp;Jie Zhang","doi":"10.1016/j.isprsjprs.2025.07.040","DOIUrl":null,"url":null,"abstract":"<div><div>Semi-supervised semantic segmentation of high spatial resolution remote sensing images aims to mitigate the reliance on labeled data by using limited labeled data alongside extensive unlabeled data. This approach significantly reduces the dependency on labeled images and annotation costs and mitigates the challenge of obtaining large-scale labeled datasets. Current semi-supervised methods for remote sensing images semantic segmentation mainly focus on improving pseudo-label quality. However, the inherent noise in pseudo-labels remains a critical issue, leading to persistent inaccuracies. Furthermore, the high intra-class variance in such image complicate pixel-wise label propagation, exacerbating pseudo-label errors and substantially constraining segmentation accuracy. To tackle these limitations, we propose a contrastive learning framework based on a mixture of Gaussian mixture distributions. Our approach uses a mixture probability distribution prototype predictor to adaptively regulate the influence of intra-class prototypes on pixel representations, mapping features to a multivariate Gaussian model to mitigate pseudo-label inaccuracies. We also introduce a novel mixture contrastive loss function to guide pseudo-labeled pixels towards intra-class prototypes and away from inter-class prototypes while repelling, thereby enhancing representation fidelity. Extensive experiments on three remote sensing semantic segmentation datasets demonstrate the efficacy of our approach. Compared to baseline models, our method achieves mIoU improvements ranging from 0.26% to 1.86%. Compared to the state-of-the-art DWL model, it achieves even higher mIoU improvements, ranging from 0.37% to 2.36%. These results confirm the effectiveness of our approach over existing semi-supervised segmentation methods. The code will be available at <span><span>https://github.com/aitointerp/rsprotosemiseg</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50269,"journal":{"name":"ISPRS Journal of Photogrammetry and Remote Sensing","volume":"228 ","pages":"Pages 771-784"},"PeriodicalIF":12.2000,"publicationDate":"2025-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"RSProtoSemiSeg: Semi-supervised semantic segmentation of high spatial resolution remote sensing images with probabilistic distribution prototypes\",\"authors\":\"Wenjie Sun ,&nbsp;Yujie Lei ,&nbsp;Danfeng Hong ,&nbsp;Zhongwen Hu ,&nbsp;Qingquan Li ,&nbsp;Jie Zhang\",\"doi\":\"10.1016/j.isprsjprs.2025.07.040\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Semi-supervised semantic segmentation of high spatial resolution remote sensing images aims to mitigate the reliance on labeled data by using limited labeled data alongside extensive unlabeled data. This approach significantly reduces the dependency on labeled images and annotation costs and mitigates the challenge of obtaining large-scale labeled datasets. Current semi-supervised methods for remote sensing images semantic segmentation mainly focus on improving pseudo-label quality. However, the inherent noise in pseudo-labels remains a critical issue, leading to persistent inaccuracies. Furthermore, the high intra-class variance in such image complicate pixel-wise label propagation, exacerbating pseudo-label errors and substantially constraining segmentation accuracy. To tackle these limitations, we propose a contrastive learning framework based on a mixture of Gaussian mixture distributions. Our approach uses a mixture probability distribution prototype predictor to adaptively regulate the influence of intra-class prototypes on pixel representations, mapping features to a multivariate Gaussian model to mitigate pseudo-label inaccuracies. We also introduce a novel mixture contrastive loss function to guide pseudo-labeled pixels towards intra-class prototypes and away from inter-class prototypes while repelling, thereby enhancing representation fidelity. Extensive experiments on three remote sensing semantic segmentation datasets demonstrate the efficacy of our approach. Compared to baseline models, our method achieves mIoU improvements ranging from 0.26% to 1.86%. Compared to the state-of-the-art DWL model, it achieves even higher mIoU improvements, ranging from 0.37% to 2.36%. These results confirm the effectiveness of our approach over existing semi-supervised segmentation methods. The code will be available at <span><span>https://github.com/aitointerp/rsprotosemiseg</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":50269,\"journal\":{\"name\":\"ISPRS Journal of Photogrammetry and Remote Sensing\",\"volume\":\"228 \",\"pages\":\"Pages 771-784\"},\"PeriodicalIF\":12.2000,\"publicationDate\":\"2025-08-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ISPRS Journal of Photogrammetry and Remote Sensing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0924271625003107\",\"RegionNum\":1,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"GEOGRAPHY, PHYSICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ISPRS Journal of Photogrammetry and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0924271625003107","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOGRAPHY, PHYSICAL","Score":null,"Total":0}
引用次数: 0

摘要

高空间分辨率遥感图像的半监督语义分割旨在通过使用有限的标记数据和大量的未标记数据来减轻对标记数据的依赖。该方法显著降低了对标记图像的依赖和标注成本,减轻了获取大规模标记数据集的挑战。目前用于遥感图像语义分割的半监督方法主要集中在提高伪标签质量上。然而,伪标签中固有的噪声仍然是一个关键问题,导致持续的不准确。此外,此类图像的高类内方差使像素级标签传播复杂化,加剧了伪标签错误并极大地限制了分割精度。为了解决这些限制,我们提出了一个基于高斯混合分布的对比学习框架。我们的方法使用混合概率分布原型预测器自适应调节类内原型对像素表示的影响,将特征映射到多变量高斯模型以减轻伪标签不准确性。我们还引入了一种新的混合对比损失函数来引导伪标记像素在排斥时接近类内原型并远离类间原型,从而提高了表示保真度。在三个遥感语义分割数据集上的大量实验证明了该方法的有效性。与基线模型相比,我们的方法实现了0.26%至1.86%的mIoU改进。与最先进的DWL模型相比,它实现了更高的mIoU改进,范围从0.37%到2.36%。这些结果证实了我们的方法比现有的半监督分割方法的有效性。代码可在https://github.com/aitointerp/rsprotosemiseg上获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
RSProtoSemiSeg: Semi-supervised semantic segmentation of high spatial resolution remote sensing images with probabilistic distribution prototypes
Semi-supervised semantic segmentation of high spatial resolution remote sensing images aims to mitigate the reliance on labeled data by using limited labeled data alongside extensive unlabeled data. This approach significantly reduces the dependency on labeled images and annotation costs and mitigates the challenge of obtaining large-scale labeled datasets. Current semi-supervised methods for remote sensing images semantic segmentation mainly focus on improving pseudo-label quality. However, the inherent noise in pseudo-labels remains a critical issue, leading to persistent inaccuracies. Furthermore, the high intra-class variance in such image complicate pixel-wise label propagation, exacerbating pseudo-label errors and substantially constraining segmentation accuracy. To tackle these limitations, we propose a contrastive learning framework based on a mixture of Gaussian mixture distributions. Our approach uses a mixture probability distribution prototype predictor to adaptively regulate the influence of intra-class prototypes on pixel representations, mapping features to a multivariate Gaussian model to mitigate pseudo-label inaccuracies. We also introduce a novel mixture contrastive loss function to guide pseudo-labeled pixels towards intra-class prototypes and away from inter-class prototypes while repelling, thereby enhancing representation fidelity. Extensive experiments on three remote sensing semantic segmentation datasets demonstrate the efficacy of our approach. Compared to baseline models, our method achieves mIoU improvements ranging from 0.26% to 1.86%. Compared to the state-of-the-art DWL model, it achieves even higher mIoU improvements, ranging from 0.37% to 2.36%. These results confirm the effectiveness of our approach over existing semi-supervised segmentation methods. The code will be available at https://github.com/aitointerp/rsprotosemiseg.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
ISPRS Journal of Photogrammetry and Remote Sensing
ISPRS Journal of Photogrammetry and Remote Sensing 工程技术-成像科学与照相技术
CiteScore
21.00
自引率
6.30%
发文量
273
审稿时长
40 days
期刊介绍: The ISPRS Journal of Photogrammetry and Remote Sensing (P&RS) serves as the official journal of the International Society for Photogrammetry and Remote Sensing (ISPRS). It acts as a platform for scientists and professionals worldwide who are involved in various disciplines that utilize photogrammetry, remote sensing, spatial information systems, computer vision, and related fields. The journal aims to facilitate communication and dissemination of advancements in these disciplines, while also acting as a comprehensive source of reference and archive. P&RS endeavors to publish high-quality, peer-reviewed research papers that are preferably original and have not been published before. These papers can cover scientific/research, technological development, or application/practical aspects. Additionally, the journal welcomes papers that are based on presentations from ISPRS meetings, as long as they are considered significant contributions to the aforementioned fields. In particular, P&RS encourages the submission of papers that are of broad scientific interest, showcase innovative applications (especially in emerging fields), have an interdisciplinary focus, discuss topics that have received limited attention in P&RS or related journals, or explore new directions in scientific or professional realms. It is preferred that theoretical papers include practical applications, while papers focusing on systems and applications should include a theoretical background.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信