{"title":"SDGANets: a semantically enhanced dual graph-aware network for affine and registration of remote sensing images","authors":"Xie Zhuli, Wan Gang, Liu Jia, Bu Dongdong","doi":"10.1007/s40747-025-01792-1","DOIUrl":null,"url":null,"abstract":"<p>Remote sensing image pairs of different time phases have complex and changeable semantic contents, and traditional convolutional registration methods are challenging in modeling subtle local changes and global large-scale deformation differences in detail. This results in poor registration performance and poor feature representation. To address these problems, a semantically enhanced dual-graph perception framework is proposed. This framework aims to gradually achieve semantic alignment and precise registration of remote sensing image pairs of different time phases via coarse to fine stages. On the one hand, a newly designed large-selection kernel convolution attention module is used to learn affine transformation parameters. Attention to global semantics perceives the large pixel displacement deviation caused by large-scale deformation, and the association relationship is established between remote sensing image pairs of different time phases. At the same time, dual-graph perception modules are embedded in multiple subspace structures, and the subtle local changes of remote sensing image pairs are modeled through the dynamic aggregation ability of graph perception nodes to achieve coarse registration of remote sensing images. On the other hand, a U-shaped module guided by global attention with deformable convolution is used to refine the local spatial structural features and global contextual semantic information of the rough registration, establish dependencies between channels, and correct the pixel displacement deviation of remote sensing image pairs of different phases through position encoding. It is worth noting that the newly designed weighted loss function supervises the learning of each module and the entire network structure from the perspective of inverse consistency, promoting the network’s optimal performance. Finally, the experimental results on the AerialData and GFRS datasets show that the proposed framework has good registration performance, with mean absolute error (MAE) of 3.64 and 3.81, respectively.</p>","PeriodicalId":10524,"journal":{"name":"Complex & Intelligent Systems","volume":"86 1","pages":""},"PeriodicalIF":5.0000,"publicationDate":"2025-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Complex & Intelligent Systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s40747-025-01792-1","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Remote sensing image pairs of different time phases have complex and changeable semantic contents, and traditional convolutional registration methods are challenging in modeling subtle local changes and global large-scale deformation differences in detail. This results in poor registration performance and poor feature representation. To address these problems, a semantically enhanced dual-graph perception framework is proposed. This framework aims to gradually achieve semantic alignment and precise registration of remote sensing image pairs of different time phases via coarse to fine stages. On the one hand, a newly designed large-selection kernel convolution attention module is used to learn affine transformation parameters. Attention to global semantics perceives the large pixel displacement deviation caused by large-scale deformation, and the association relationship is established between remote sensing image pairs of different time phases. At the same time, dual-graph perception modules are embedded in multiple subspace structures, and the subtle local changes of remote sensing image pairs are modeled through the dynamic aggregation ability of graph perception nodes to achieve coarse registration of remote sensing images. On the other hand, a U-shaped module guided by global attention with deformable convolution is used to refine the local spatial structural features and global contextual semantic information of the rough registration, establish dependencies between channels, and correct the pixel displacement deviation of remote sensing image pairs of different phases through position encoding. It is worth noting that the newly designed weighted loss function supervises the learning of each module and the entire network structure from the perspective of inverse consistency, promoting the network’s optimal performance. Finally, the experimental results on the AerialData and GFRS datasets show that the proposed framework has good registration performance, with mean absolute error (MAE) of 3.64 and 3.81, respectively.
期刊介绍:
Complex & Intelligent Systems aims to provide a forum for presenting and discussing novel approaches, tools and techniques meant for attaining a cross-fertilization between the broad fields of complex systems, computational simulation, and intelligent analytics and visualization. The transdisciplinary research that the journal focuses on will expand the boundaries of our understanding by investigating the principles and processes that underlie many of the most profound problems facing society today.