PAMSNet: A point annotation-driven multi-source network for remote sensing semantic segmentation

IF 12.2 1区地球科学 Q1 GEOGRAPHY, PHYSICAL

ISPRS Journal of Photogrammetry and Remote Sensing Pub Date : 2025-08-18 DOI:10.1016/j.isprsjprs.2025.07.035

Yuanhao Zhao , Mingming Jia , Genyun Sun , Aizhu Zhang

{"title":"PAMSNet: A point annotation-driven multi-source network for remote sensing semantic segmentation","authors":"Yuanhao Zhao , Mingming Jia , Genyun Sun , Aizhu Zhang","doi":"10.1016/j.isprsjprs.2025.07.035","DOIUrl":null,"url":null,"abstract":"<div><div>Multi-source data semantic segmentation has proven to be an effective means of improving classification accuracy in remote sensing. With the rapid development of deep learning, the demand for large amounts of high-quality labeled samples has become a major bottleneck, limiting the broader application of these techniques. Weakly supervised learning has attracted increasing attention by reducing annotation costs. However, existing weakly supervised methods often suffer from limited accuracy. Effectively exploiting complementary information from multi-source remote sensing data using only a small number of labeled points remains a significant challenge. In this paper, we propose a novel architecture, named Point Annotation- Driven Multi-source Segmentation Network (PAMSNet), which leverages point annotations to effectively capture and integrate complementary features from multi-source remote sensing data. PAMSNet includes a Multi-source Feature Encoder and a Cross-Resolution Feature Integration (CRFI) module. The Multi-source Feature Encoder captures complementary global and local features using lightweight convolutional Global-Local Multi-source (GLMS) modules. And the boundary and spectral detail robustness are improved through Spectral-Edge Enhancement (SEE) modules, which effectively mitigate the impact of noise on segmentation accuracy. The CRFI module replaces conventional decoding structures by combining convolutional and Transformer mechanisms, enabling efficient cross-scale feature integration and improving the ability to identify multi-scale objects with reduced computational demands. Extensive experiments on the Vaihingen, WHU-IS, and WHU-OPT-SAR datasets validate the effectiveness of PAMSNet for weakly supervised multi-source segmentation as well as the validity of the proposed module. PAMSNet achieves state-of-the-art performance, with MIoU improvements of 2.4%, 2.1%, and 3.16% on three datasets, using only 0.01% point annotations. Additionally, PAMSNet can effectively balance the performance as well as the operational efficiency of the model compared to existing methods, which further promotes the application of deep learning in remote sensing image mapping.</div></div>","PeriodicalId":50269,"journal":{"name":"ISPRS Journal of Photogrammetry and Remote Sensing","volume":"229 ","pages":"Pages 1-16"},"PeriodicalIF":12.2000,"publicationDate":"2025-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ISPRS Journal of Photogrammetry and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0924271625003041","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOGRAPHY, PHYSICAL","Score":null,"Total":0}

引用次数: 0

Abstract

Multi-source data semantic segmentation has proven to be an effective means of improving classification accuracy in remote sensing. With the rapid development of deep learning, the demand for large amounts of high-quality labeled samples has become a major bottleneck, limiting the broader application of these techniques. Weakly supervised learning has attracted increasing attention by reducing annotation costs. However, existing weakly supervised methods often suffer from limited accuracy. Effectively exploiting complementary information from multi-source remote sensing data using only a small number of labeled points remains a significant challenge. In this paper, we propose a novel architecture, named Point Annotation- Driven Multi-source Segmentation Network (PAMSNet), which leverages point annotations to effectively capture and integrate complementary features from multi-source remote sensing data. PAMSNet includes a Multi-source Feature Encoder and a Cross-Resolution Feature Integration (CRFI) module. The Multi-source Feature Encoder captures complementary global and local features using lightweight convolutional Global-Local Multi-source (GLMS) modules. And the boundary and spectral detail robustness are improved through Spectral-Edge Enhancement (SEE) modules, which effectively mitigate the impact of noise on segmentation accuracy. The CRFI module replaces conventional decoding structures by combining convolutional and Transformer mechanisms, enabling efficient cross-scale feature integration and improving the ability to identify multi-scale objects with reduced computational demands. Extensive experiments on the Vaihingen, WHU-IS, and WHU-OPT-SAR datasets validate the effectiveness of PAMSNet for weakly supervised multi-source segmentation as well as the validity of the proposed module. PAMSNet achieves state-of-the-art performance, with MIoU improvements of 2.4%, 2.1%, and 3.16% on three datasets, using only 0.01% point annotations. Additionally, PAMSNet can effectively balance the performance as well as the operational efficiency of the model compared to existing methods, which further promotes the application of deep learning in remote sensing image mapping.

查看原文本刊更多论文

PAMSNet：一个基于点标注的遥感语义分割多源网络

多源数据语义分割已被证明是提高遥感分类精度的有效手段。随着深度学习的快速发展，对大量高质量标记样本的需求已成为制约这些技术广泛应用的主要瓶颈。弱监督学习因其降低标注成本而受到越来越多的关注。然而，现有的弱监督方法往往精度有限。仅使用少量标记点有效地利用多源遥感数据中的互补信息仍然是一个重大挑战。本文提出了一种基于点标注驱动的多源分割网络（PAMSNet），该网络利用点标注有效地捕获和整合多源遥感数据的互补特征。PAMSNet包括一个多源特征编码器和一个跨分辨率特征集成（CRFI）模块。多源特征编码器使用轻量级卷积全局-局部多源（GLMS）模块捕获互补的全局和局部特征。通过光谱边缘增强（SEE）模块提高了边界和光谱细节的鲁棒性，有效地减轻了噪声对分割精度的影响。CRFI模块通过结合卷积和Transformer机制取代了传统的解码结构，实现了高效的跨尺度特征集成，提高了识别多尺度目标的能力，同时减少了计算需求。在Vaihingen、WHU-IS和WHU-OPT-SAR数据集上的大量实验验证了PAMSNet用于弱监督多源分割的有效性以及所提出模块的有效性。PAMSNet实现了最先进的性能，仅使用0.01%的点注释，在三个数据集上的MIoU提高了2.4%、2.1%和3.16%。此外，与现有方法相比，PAMSNet可以有效地平衡模型的性能和运行效率，这进一步促进了深度学习在遥感图像制图中的应用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ISPRS Journal of Photogrammetry and Remote Sensing 工程技术-成像科学与照相技术

CiteScore

21.00

自引率

6.30%

发文量

273

审稿时长

40 days

期刊介绍： The ISPRS Journal of Photogrammetry and Remote Sensing (P&RS) serves as the official journal of the International Society for Photogrammetry and Remote Sensing (ISPRS). It acts as a platform for scientists and professionals worldwide who are involved in various disciplines that utilize photogrammetry, remote sensing, spatial information systems, computer vision, and related fields. The journal aims to facilitate communication and dissemination of advancements in these disciplines, while also acting as a comprehensive source of reference and archive. P&RS endeavors to publish high-quality, peer-reviewed research papers that are preferably original and have not been published before. These papers can cover scientific/research, technological development, or application/practical aspects. Additionally, the journal welcomes papers that are based on presentations from ISPRS meetings, as long as they are considered significant contributions to the aforementioned fields. In particular, P&RS encourages the submission of papers that are of broad scientific interest, showcase innovative applications (especially in emerging fields), have an interdisciplinary focus, discuss topics that have received limited attention in P&RS or related journals, or explore new directions in scientific or professional realms. It is preferred that theoretical papers include practical applications, while papers focusing on systems and applications should include a theoretical background.