Ze Zhang;Yue Zhou;Linqing Huang;Xue Jiang;Guozheng Xu;Xingzhao Liu
{"title":"基于CNN和Mamba的局部和全局特征集成半监督变化检测","authors":"Ze Zhang;Yue Zhou;Linqing Huang;Xue Jiang;Guozheng Xu;Xingzhao Liu","doi":"10.1109/TIM.2025.3573775","DOIUrl":null,"url":null,"abstract":"Semi-supervised change detection (SSCD) has become increasingly important in remote sensing image (RSI) analysis due to the scarcity of labeled data. While state-of-the-art SSCD methods have achieved notable results through pseudo-labeling and weak-to-strong consistency regularization with various perturbations, they face several inherent limitations: 1) learners (models or architectures with the same design) often focus exclusively on either local or global features, which fails to capture the intricate bi-temporal difference feature; 2) low-quality pseudo-labels, often resulting from inadequate learner diversity or suboptimal perturbations, struggle to reflect accurate changes, exacerbating confirmation bias; and 3) a predominant emphasis on pixel-level consistency overlooks broader image context, limiting the ability to capture complex, large-scale spatiotemporal changes. To address these challenges, we propose a novel SSCD framework, cross-supervision with convolutional neural network and Mamba (CSCM), which adopts the convolutional neural network (CNN) and Mamba as two independent learners within a cross-supervision mechanism, enabling the integration of local and global feature representations. The CNN excels at capturing fine-grained local details, while Mamba efficiently models long-range dependencies with linear complexity, making it particularly well-suited for processing large-scale RSIs. To enhance the collaboration between these architectures, we introduce the cross-architecture fusion module (CAFM), which fuses difference features extracted by the CNN and Mamba, combining local sensitivity with global awareness to produce refined pseudo-labels. Additionally, we incorporate instance-level global consistency to capture broader image context, ensuring a more comprehensive understanding of spatiotemporal changes beyond pixel-level consistency. Extensive experiments on three public datasets demonstrate that our approach significantly enhances CD accuracy and training efficiency, outperforming several state-of-the-art methods.","PeriodicalId":13341,"journal":{"name":"IEEE Transactions on Instrumentation and Measurement","volume":"74 ","pages":"1-15"},"PeriodicalIF":5.6000,"publicationDate":"2025-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Integrating Local and Global Features via CNN and Mamba for Semi-Supervised Change Detection\",\"authors\":\"Ze Zhang;Yue Zhou;Linqing Huang;Xue Jiang;Guozheng Xu;Xingzhao Liu\",\"doi\":\"10.1109/TIM.2025.3573775\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Semi-supervised change detection (SSCD) has become increasingly important in remote sensing image (RSI) analysis due to the scarcity of labeled data. While state-of-the-art SSCD methods have achieved notable results through pseudo-labeling and weak-to-strong consistency regularization with various perturbations, they face several inherent limitations: 1) learners (models or architectures with the same design) often focus exclusively on either local or global features, which fails to capture the intricate bi-temporal difference feature; 2) low-quality pseudo-labels, often resulting from inadequate learner diversity or suboptimal perturbations, struggle to reflect accurate changes, exacerbating confirmation bias; and 3) a predominant emphasis on pixel-level consistency overlooks broader image context, limiting the ability to capture complex, large-scale spatiotemporal changes. To address these challenges, we propose a novel SSCD framework, cross-supervision with convolutional neural network and Mamba (CSCM), which adopts the convolutional neural network (CNN) and Mamba as two independent learners within a cross-supervision mechanism, enabling the integration of local and global feature representations. The CNN excels at capturing fine-grained local details, while Mamba efficiently models long-range dependencies with linear complexity, making it particularly well-suited for processing large-scale RSIs. To enhance the collaboration between these architectures, we introduce the cross-architecture fusion module (CAFM), which fuses difference features extracted by the CNN and Mamba, combining local sensitivity with global awareness to produce refined pseudo-labels. Additionally, we incorporate instance-level global consistency to capture broader image context, ensuring a more comprehensive understanding of spatiotemporal changes beyond pixel-level consistency. Extensive experiments on three public datasets demonstrate that our approach significantly enhances CD accuracy and training efficiency, outperforming several state-of-the-art methods.\",\"PeriodicalId\":13341,\"journal\":{\"name\":\"IEEE Transactions on Instrumentation and Measurement\",\"volume\":\"74 \",\"pages\":\"1-15\"},\"PeriodicalIF\":5.6000,\"publicationDate\":\"2025-06-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Instrumentation and Measurement\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11018868/\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Instrumentation and Measurement","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/11018868/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Integrating Local and Global Features via CNN and Mamba for Semi-Supervised Change Detection
Semi-supervised change detection (SSCD) has become increasingly important in remote sensing image (RSI) analysis due to the scarcity of labeled data. While state-of-the-art SSCD methods have achieved notable results through pseudo-labeling and weak-to-strong consistency regularization with various perturbations, they face several inherent limitations: 1) learners (models or architectures with the same design) often focus exclusively on either local or global features, which fails to capture the intricate bi-temporal difference feature; 2) low-quality pseudo-labels, often resulting from inadequate learner diversity or suboptimal perturbations, struggle to reflect accurate changes, exacerbating confirmation bias; and 3) a predominant emphasis on pixel-level consistency overlooks broader image context, limiting the ability to capture complex, large-scale spatiotemporal changes. To address these challenges, we propose a novel SSCD framework, cross-supervision with convolutional neural network and Mamba (CSCM), which adopts the convolutional neural network (CNN) and Mamba as two independent learners within a cross-supervision mechanism, enabling the integration of local and global feature representations. The CNN excels at capturing fine-grained local details, while Mamba efficiently models long-range dependencies with linear complexity, making it particularly well-suited for processing large-scale RSIs. To enhance the collaboration between these architectures, we introduce the cross-architecture fusion module (CAFM), which fuses difference features extracted by the CNN and Mamba, combining local sensitivity with global awareness to produce refined pseudo-labels. Additionally, we incorporate instance-level global consistency to capture broader image context, ensuring a more comprehensive understanding of spatiotemporal changes beyond pixel-level consistency. Extensive experiments on three public datasets demonstrate that our approach significantly enhances CD accuracy and training efficiency, outperforming several state-of-the-art methods.
期刊介绍:
Papers are sought that address innovative solutions to the development and use of electrical and electronic instruments and equipment to measure, monitor and/or record physical phenomena for the purpose of advancing measurement science, methods, functionality and applications. The scope of these papers may encompass: (1) theory, methodology, and practice of measurement; (2) design, development and evaluation of instrumentation and measurement systems and components used in generating, acquiring, conditioning and processing signals; (3) analysis, representation, display, and preservation of the information obtained from a set of measurements; and (4) scientific and technical support to establishment and maintenance of technical standards in the field of Instrumentation and Measurement.