Xiaoliang Tan , Guanzhou Chen , Xiaodong Zhang , Tong Wang , Jiaqi Wang , Kui Wang , Tingxuan Miao
{"title":"TripleS: Mitigating multi-task learning conflicts for semantic change detection in high-resolution remote sensing imagery","authors":"Xiaoliang Tan , Guanzhou Chen , Xiaodong Zhang , Tong Wang , Jiaqi Wang , Kui Wang , Tingxuan Miao","doi":"10.1016/j.isprsjprs.2025.09.019","DOIUrl":null,"url":null,"abstract":"<div><div>Periodical earth observation from multi-temporal high spatial resolution remote sensing imagery (RSI) offers valuable insights into the complex dynamics of land surface changes. Semantic change detection (SCD), cooperating with deep learning (DL) architectures, has evolved from binary change detection (BCD) into an effective technique capable of not only identifying change locations but also specifying land-cover and land-use (LCLU) categories. Recent advancements suggest that SCD can be modeled as a multi-task learning (MTL) framework, involving multiple branches for individual subtasks to process dual RSI inputs, and optimized through joint training. However, limitations remain in the inadequate interactions between bi-temporal branches and semantic-change branches, as well as the pervasive gradient conflicts among subtasks within MTL frameworks, which can lead to counterbalanced performances. To address the above limitations, we propose an MTL-oriented SCD model (MOSCD), which mutually enhances bi-temporal features, while ensuring that representations across the subtask branches are coherently correlated. Furthermore, the TripleS framework is designed to enhance the optimization of the MTL framework through counteracting the conflicting subtask objectives, which incorporates three novel schemes: Stepwise multi-task optimization, Selective parameter binding, and Scheduling for dynamically training MTL bindings. Extensive experiments conducted on three full-coverage land-cover SCD datasets, including one public dataset (HRSCD) and two self-constructed datasets (SC-SCD7 and CC-SCD5), demonstrate that the MOSCD enhanced with TripleS outperforms eleven existing SCD methods and three MTL methods by up to 21.17% on SeK metrics. The robust performances over diverse landscapes and transferability on other componentized benchmarks validate that the MOSCD trained with TripleS is a practicable tool for detecting subtle land-cover changes from high spatial resolution RSI data. Codes and the two constructed datasets will be available at <span><span>https://github.com/StephenApX/MTL-TripleS</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50269,"journal":{"name":"ISPRS Journal of Photogrammetry and Remote Sensing","volume":"230 ","pages":"Pages 374-401"},"PeriodicalIF":12.2000,"publicationDate":"2025-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ISPRS Journal of Photogrammetry and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0924271625003776","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOGRAPHY, PHYSICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Periodical earth observation from multi-temporal high spatial resolution remote sensing imagery (RSI) offers valuable insights into the complex dynamics of land surface changes. Semantic change detection (SCD), cooperating with deep learning (DL) architectures, has evolved from binary change detection (BCD) into an effective technique capable of not only identifying change locations but also specifying land-cover and land-use (LCLU) categories. Recent advancements suggest that SCD can be modeled as a multi-task learning (MTL) framework, involving multiple branches for individual subtasks to process dual RSI inputs, and optimized through joint training. However, limitations remain in the inadequate interactions between bi-temporal branches and semantic-change branches, as well as the pervasive gradient conflicts among subtasks within MTL frameworks, which can lead to counterbalanced performances. To address the above limitations, we propose an MTL-oriented SCD model (MOSCD), which mutually enhances bi-temporal features, while ensuring that representations across the subtask branches are coherently correlated. Furthermore, the TripleS framework is designed to enhance the optimization of the MTL framework through counteracting the conflicting subtask objectives, which incorporates three novel schemes: Stepwise multi-task optimization, Selective parameter binding, and Scheduling for dynamically training MTL bindings. Extensive experiments conducted on three full-coverage land-cover SCD datasets, including one public dataset (HRSCD) and two self-constructed datasets (SC-SCD7 and CC-SCD5), demonstrate that the MOSCD enhanced with TripleS outperforms eleven existing SCD methods and three MTL methods by up to 21.17% on SeK metrics. The robust performances over diverse landscapes and transferability on other componentized benchmarks validate that the MOSCD trained with TripleS is a practicable tool for detecting subtle land-cover changes from high spatial resolution RSI data. Codes and the two constructed datasets will be available at https://github.com/StephenApX/MTL-TripleS.
期刊介绍:
The ISPRS Journal of Photogrammetry and Remote Sensing (P&RS) serves as the official journal of the International Society for Photogrammetry and Remote Sensing (ISPRS). It acts as a platform for scientists and professionals worldwide who are involved in various disciplines that utilize photogrammetry, remote sensing, spatial information systems, computer vision, and related fields. The journal aims to facilitate communication and dissemination of advancements in these disciplines, while also acting as a comprehensive source of reference and archive.
P&RS endeavors to publish high-quality, peer-reviewed research papers that are preferably original and have not been published before. These papers can cover scientific/research, technological development, or application/practical aspects. Additionally, the journal welcomes papers that are based on presentations from ISPRS meetings, as long as they are considered significant contributions to the aforementioned fields.
In particular, P&RS encourages the submission of papers that are of broad scientific interest, showcase innovative applications (especially in emerging fields), have an interdisciplinary focus, discuss topics that have received limited attention in P&RS or related journals, or explore new directions in scientific or professional realms. It is preferred that theoretical papers include practical applications, while papers focusing on systems and applications should include a theoretical background.