{"title":"基于侧窗高斯空间的多模态遥感图像鲁棒匹配","authors":"Chongyue Zheng;Shanshan Li;Chengyou Wang;Bing Zhang","doi":"10.1109/TGRS.2025.3591504","DOIUrl":null,"url":null,"abstract":"Robust and accurate image matching and registration are foundational tasks for numerous applications. However, current methods often fail when dealing with multimodal remote sensing images (MRSIs) that exhibit significant spatial geometric differences (SGDs) and nonlinear radiometric differences (NRDs). To address these challenges, this article proposes a novel MRSI matching (MRSIM) method: matching using side window Gaussian space (MSG). MSG leverages an intuitive concept that human visual perception relies heavily on salient features at image edges for precise matching. Specifically, the proposed method: 1) constructs a multiscale side window Gaussian filter scale space (MSGSS) that preserves edge information at different scales while blurring the image; 2) enhances the repeatability of keypoints by performing corner detection on edge maps; 3) increases descriptor robustness by using second-order gradients combined with steerable filtering; and 4) further utilizes a two-stage matching strategy within a constrained search space and designs a new distance, making full use of densely distributed edge keypoints. Quantitative and qualitative experiments conducted on five datasets spanning 957 image pairs across nine multimodal types demonstrate that MSG outperforms nine advanced algorithms (six feature-based methods: SIFT, OS-SIFT, RIFT, CoFSM, HOWP, and POS-GIFT; and three deep learning-based methods: SuperPoint + SuperGlue, LoFTR, and ReDFeat). The results indicate that MSG achieved a number of correct matches (NCMs) much higher than the compared algorithms, with the highest success rate (SR), lowest RMSE, and good time efficiency while achieving both scale and rotation invariance. Codes are available at <uri>https://github.com/ZCYla/MSG</uri>","PeriodicalId":13213,"journal":{"name":"IEEE Transactions on Geoscience and Remote Sensing","volume":"63 ","pages":"1-23"},"PeriodicalIF":8.6000,"publicationDate":"2025-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"MSG: Robust Multimodal Remote Sensing Image Matching Using Side Window Gaussian Space\",\"authors\":\"Chongyue Zheng;Shanshan Li;Chengyou Wang;Bing Zhang\",\"doi\":\"10.1109/TGRS.2025.3591504\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Robust and accurate image matching and registration are foundational tasks for numerous applications. However, current methods often fail when dealing with multimodal remote sensing images (MRSIs) that exhibit significant spatial geometric differences (SGDs) and nonlinear radiometric differences (NRDs). To address these challenges, this article proposes a novel MRSI matching (MRSIM) method: matching using side window Gaussian space (MSG). MSG leverages an intuitive concept that human visual perception relies heavily on salient features at image edges for precise matching. Specifically, the proposed method: 1) constructs a multiscale side window Gaussian filter scale space (MSGSS) that preserves edge information at different scales while blurring the image; 2) enhances the repeatability of keypoints by performing corner detection on edge maps; 3) increases descriptor robustness by using second-order gradients combined with steerable filtering; and 4) further utilizes a two-stage matching strategy within a constrained search space and designs a new distance, making full use of densely distributed edge keypoints. Quantitative and qualitative experiments conducted on five datasets spanning 957 image pairs across nine multimodal types demonstrate that MSG outperforms nine advanced algorithms (six feature-based methods: SIFT, OS-SIFT, RIFT, CoFSM, HOWP, and POS-GIFT; and three deep learning-based methods: SuperPoint + SuperGlue, LoFTR, and ReDFeat). The results indicate that MSG achieved a number of correct matches (NCMs) much higher than the compared algorithms, with the highest success rate (SR), lowest RMSE, and good time efficiency while achieving both scale and rotation invariance. Codes are available at <uri>https://github.com/ZCYla/MSG</uri>\",\"PeriodicalId\":13213,\"journal\":{\"name\":\"IEEE Transactions on Geoscience and Remote Sensing\",\"volume\":\"63 \",\"pages\":\"1-23\"},\"PeriodicalIF\":8.6000,\"publicationDate\":\"2025-07-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Geoscience and Remote Sensing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11088253/\",\"RegionNum\":1,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Geoscience and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/11088253/","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
MSG: Robust Multimodal Remote Sensing Image Matching Using Side Window Gaussian Space
Robust and accurate image matching and registration are foundational tasks for numerous applications. However, current methods often fail when dealing with multimodal remote sensing images (MRSIs) that exhibit significant spatial geometric differences (SGDs) and nonlinear radiometric differences (NRDs). To address these challenges, this article proposes a novel MRSI matching (MRSIM) method: matching using side window Gaussian space (MSG). MSG leverages an intuitive concept that human visual perception relies heavily on salient features at image edges for precise matching. Specifically, the proposed method: 1) constructs a multiscale side window Gaussian filter scale space (MSGSS) that preserves edge information at different scales while blurring the image; 2) enhances the repeatability of keypoints by performing corner detection on edge maps; 3) increases descriptor robustness by using second-order gradients combined with steerable filtering; and 4) further utilizes a two-stage matching strategy within a constrained search space and designs a new distance, making full use of densely distributed edge keypoints. Quantitative and qualitative experiments conducted on five datasets spanning 957 image pairs across nine multimodal types demonstrate that MSG outperforms nine advanced algorithms (six feature-based methods: SIFT, OS-SIFT, RIFT, CoFSM, HOWP, and POS-GIFT; and three deep learning-based methods: SuperPoint + SuperGlue, LoFTR, and ReDFeat). The results indicate that MSG achieved a number of correct matches (NCMs) much higher than the compared algorithms, with the highest success rate (SR), lowest RMSE, and good time efficiency while achieving both scale and rotation invariance. Codes are available at https://github.com/ZCYla/MSG
期刊介绍:
IEEE Transactions on Geoscience and Remote Sensing (TGRS) is a monthly publication that focuses on the theory, concepts, and techniques of science and engineering as applied to sensing the land, oceans, atmosphere, and space; and the processing, interpretation, and dissemination of this information.