{"title":"交叉视角地理定位的频率增强网络","authors":"Qiyuan Zeng, Jiayi Wu, Guorui Feng","doi":"10.1016/j.measurement.2025.117736","DOIUrl":null,"url":null,"abstract":"<div><div>Cross-view geolocalization is a crucial task that aims to match remote sensing images captured from different perspectives within the same scene, particularly important for applications involving drone and satellite imagery, where varying viewpoints can significantly impact analysis accuracy. To address these challenges, we propose FENet (Frequency-Enhanced Network), a novel multiple frequency scale framework that integrates frequency decomposition with adaptive feature fusion for hierarchical feature extraction. FENet uniquely decomposes images into different frequency components, enabling the extraction of fine-grained local details and global structural information at various scales, enhancing the network’s ability to capture complementary features. Leveraging the robust visual encoding capabilities of the CLIP model without modifying its structure, FENet ensures adaptability and efficiency across diverse geolocalization tasks. Its adaptive feature fusion module dynamically integrates features from multiple frequency components, while the InfoNCE loss function supervises cross-scale feature alignment, ensuring consistent and precise matching between views. Extensive experiments on SUES-200, University-1652, and DenseUAV datasets show that FENet outperforms the state-of-the-art methods in cross-view image matching.</div></div>","PeriodicalId":18349,"journal":{"name":"Measurement","volume":"254 ","pages":"Article 117736"},"PeriodicalIF":5.2000,"publicationDate":"2025-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Frequency-Enhanced Network for cross-view geolocalization\",\"authors\":\"Qiyuan Zeng, Jiayi Wu, Guorui Feng\",\"doi\":\"10.1016/j.measurement.2025.117736\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Cross-view geolocalization is a crucial task that aims to match remote sensing images captured from different perspectives within the same scene, particularly important for applications involving drone and satellite imagery, where varying viewpoints can significantly impact analysis accuracy. To address these challenges, we propose FENet (Frequency-Enhanced Network), a novel multiple frequency scale framework that integrates frequency decomposition with adaptive feature fusion for hierarchical feature extraction. FENet uniquely decomposes images into different frequency components, enabling the extraction of fine-grained local details and global structural information at various scales, enhancing the network’s ability to capture complementary features. Leveraging the robust visual encoding capabilities of the CLIP model without modifying its structure, FENet ensures adaptability and efficiency across diverse geolocalization tasks. Its adaptive feature fusion module dynamically integrates features from multiple frequency components, while the InfoNCE loss function supervises cross-scale feature alignment, ensuring consistent and precise matching between views. Extensive experiments on SUES-200, University-1652, and DenseUAV datasets show that FENet outperforms the state-of-the-art methods in cross-view image matching.</div></div>\",\"PeriodicalId\":18349,\"journal\":{\"name\":\"Measurement\",\"volume\":\"254 \",\"pages\":\"Article 117736\"},\"PeriodicalIF\":5.2000,\"publicationDate\":\"2025-05-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Measurement\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0263224125010954\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Measurement","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0263224125010954","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
Frequency-Enhanced Network for cross-view geolocalization
Cross-view geolocalization is a crucial task that aims to match remote sensing images captured from different perspectives within the same scene, particularly important for applications involving drone and satellite imagery, where varying viewpoints can significantly impact analysis accuracy. To address these challenges, we propose FENet (Frequency-Enhanced Network), a novel multiple frequency scale framework that integrates frequency decomposition with adaptive feature fusion for hierarchical feature extraction. FENet uniquely decomposes images into different frequency components, enabling the extraction of fine-grained local details and global structural information at various scales, enhancing the network’s ability to capture complementary features. Leveraging the robust visual encoding capabilities of the CLIP model without modifying its structure, FENet ensures adaptability and efficiency across diverse geolocalization tasks. Its adaptive feature fusion module dynamically integrates features from multiple frequency components, while the InfoNCE loss function supervises cross-scale feature alignment, ensuring consistent and precise matching between views. Extensive experiments on SUES-200, University-1652, and DenseUAV datasets show that FENet outperforms the state-of-the-art methods in cross-view image matching.
期刊介绍:
Contributions are invited on novel achievements in all fields of measurement and instrumentation science and technology. Authors are encouraged to submit novel material, whose ultimate goal is an advancement in the state of the art of: measurement and metrology fundamentals, sensors, measurement instruments, measurement and estimation techniques, measurement data processing and fusion algorithms, evaluation procedures and methodologies for plants and industrial processes, performance analysis of systems, processes and algorithms, mathematical models for measurement-oriented purposes, distributed measurement systems in a connected world.