Jindou Zhang , Ruiqian Zhang , Xiao Huang , Zhizheng Zhang , Bowen Cai , Xianwei Lv , Zhenfeng Shao , Deren Li
{"title":"Joint content-aware and difference-transform lightweight network for remote sensing images semantic change detection","authors":"Jindou Zhang , Ruiqian Zhang , Xiao Huang , Zhizheng Zhang , Bowen Cai , Xianwei Lv , Zhenfeng Shao , Deren Li","doi":"10.1016/j.inffus.2025.103276","DOIUrl":null,"url":null,"abstract":"<div><div>Advancements in Earth observation technology have enabled effective monitoring of complex surface changes. Semantic change detection (SCD) using high-resolution remote sensing images is crucial for urban planning and environmental monitoring. However, existing deep learning-based SCD methods, which combine semantic segmentation (SS) and binary change detection (BCD), face challenges in lightweight design and consistency between semantic and change results, limiting their accuracy and applicability. To overcome these limitations, we propose the Joint Content-Aware and Difference-Transform Lightweight Network (CDLNet). CDLNet features a lightweight architecture, skip connections, and a multi-task decoding mechanism. The Temporal-Spatial Content-Aware Fusion module (TSAF) in the SS decoding branch incorporates change information to improve semantic classification accuracy within change regions. The Multi-Type Temporal Difference-Transform module (MTDT) in the BCD decoding branch enhances change localization for accurate SCD through efficient transformation of temporal difference features. Experiments on the SECOND, HiUCD mini, MSSCD, and Landsat-SCD datasets demonstrate that CDLNet outperforms thirteen state-of-the-art methods, achieving average improvements of 1.41%, 1.53% and 1.49% in the <span><math><mrow><mi>F</mi><msub><mrow><mn>1</mn></mrow><mrow><mi>s</mi><mi>c</mi><mi>d</mi></mrow></msub></mrow></math></span>, <span><math><mrow><mi>I</mi><mi>o</mi><mi>U</mi><mi>c</mi></mrow></math></span> and <span><math><mrow><mi>S</mi><mi>c</mi><mi>o</mi><mi>r</mi><mi>e</mi></mrow></math></span> metrics, respectively. Ablation studies confirm the effectiveness of the TSAF and MTDT modules and the rationality of multi-task loss weight configuration. Furthermore, CDLNet utilizes only 20% of the parameters (12.88M) and 7.5% of the FLOPs (30.11G) of the leading model, achieving an inference speed of 41 FPS, which underscores its superior lightweight characteristics. The results indicate that CDLNet offers excellent detection performance, generalization, and robustness within a lightweight framework. The code of our paper is accessible at: <span><span>https://github.com/zjd1836/CDLNet</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50367,"journal":{"name":"Information Fusion","volume":"123 ","pages":"Article 103276"},"PeriodicalIF":14.7000,"publicationDate":"2025-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Fusion","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1566253525003495","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Advancements in Earth observation technology have enabled effective monitoring of complex surface changes. Semantic change detection (SCD) using high-resolution remote sensing images is crucial for urban planning and environmental monitoring. However, existing deep learning-based SCD methods, which combine semantic segmentation (SS) and binary change detection (BCD), face challenges in lightweight design and consistency between semantic and change results, limiting their accuracy and applicability. To overcome these limitations, we propose the Joint Content-Aware and Difference-Transform Lightweight Network (CDLNet). CDLNet features a lightweight architecture, skip connections, and a multi-task decoding mechanism. The Temporal-Spatial Content-Aware Fusion module (TSAF) in the SS decoding branch incorporates change information to improve semantic classification accuracy within change regions. The Multi-Type Temporal Difference-Transform module (MTDT) in the BCD decoding branch enhances change localization for accurate SCD through efficient transformation of temporal difference features. Experiments on the SECOND, HiUCD mini, MSSCD, and Landsat-SCD datasets demonstrate that CDLNet outperforms thirteen state-of-the-art methods, achieving average improvements of 1.41%, 1.53% and 1.49% in the , and metrics, respectively. Ablation studies confirm the effectiveness of the TSAF and MTDT modules and the rationality of multi-task loss weight configuration. Furthermore, CDLNet utilizes only 20% of the parameters (12.88M) and 7.5% of the FLOPs (30.11G) of the leading model, achieving an inference speed of 41 FPS, which underscores its superior lightweight characteristics. The results indicate that CDLNet offers excellent detection performance, generalization, and robustness within a lightweight framework. The code of our paper is accessible at: https://github.com/zjd1836/CDLNet.
期刊介绍:
Information Fusion serves as a central platform for showcasing advancements in multi-sensor, multi-source, multi-process information fusion, fostering collaboration among diverse disciplines driving its progress. It is the leading outlet for sharing research and development in this field, focusing on architectures, algorithms, and applications. Papers dealing with fundamental theoretical analyses as well as those demonstrating their application to real-world problems will be welcome.