Qixian Zhang , Duoqian Miao , Qi Zhang , Cairong Zhao , Hongyun Zhang , Ye Sun , Ruizhi Wang
{"title":"Dynamic frequency selection and spatial interaction fusion for robust person search","authors":"Qixian Zhang , Duoqian Miao , Qi Zhang , Cairong Zhao , Hongyun Zhang , Ye Sun , Ruizhi Wang","doi":"10.1016/j.inffus.2025.103314","DOIUrl":null,"url":null,"abstract":"<div><div>Person search aims to locate target individuals in large image databases captured by multiple non-overlapping cameras. Existing models primarily rely on spatial feature extraction to capture fine-grained local details, which is vulnerable to background clutter and occlusions and leads to unstable feature representations. To address the issues, we propose a Dynamic Frequency Selection and Spatial Interaction Fusion Network (PS-DFSI), marking the first attempt to introduce frequency decoupling and selection into person search. By integrating frequency and spatial features, PS-DFSI enhances feature expressiveness and robustness. Specifically, it comprises two core modules: the Dynamic Frequency Selection Module (DFSM) and the Spatial Frequency Interaction Module (SFIM). DFSM decouples feature maps into low-frequency and high-frequency components using learnable low-pass and high-pass filters, and a frequency selection modulator emphasizes key frequency components via channel attention. SFIM refines local details by fusing frequency-enhanced features with high-level semantic representations, leveraging multi-scale receptive fields and cross-feature attention for efficient spatial-frequency integration. Extensive experiments on CUHK-SYSU and PRW demonstrate that PS-DFSI significantly improves person search performance, validating its effectiveness and robustness.</div></div>","PeriodicalId":50367,"journal":{"name":"Information Fusion","volume":"124 ","pages":"Article 103314"},"PeriodicalIF":14.7000,"publicationDate":"2025-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Fusion","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1566253525003872","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Person search aims to locate target individuals in large image databases captured by multiple non-overlapping cameras. Existing models primarily rely on spatial feature extraction to capture fine-grained local details, which is vulnerable to background clutter and occlusions and leads to unstable feature representations. To address the issues, we propose a Dynamic Frequency Selection and Spatial Interaction Fusion Network (PS-DFSI), marking the first attempt to introduce frequency decoupling and selection into person search. By integrating frequency and spatial features, PS-DFSI enhances feature expressiveness and robustness. Specifically, it comprises two core modules: the Dynamic Frequency Selection Module (DFSM) and the Spatial Frequency Interaction Module (SFIM). DFSM decouples feature maps into low-frequency and high-frequency components using learnable low-pass and high-pass filters, and a frequency selection modulator emphasizes key frequency components via channel attention. SFIM refines local details by fusing frequency-enhanced features with high-level semantic representations, leveraging multi-scale receptive fields and cross-feature attention for efficient spatial-frequency integration. Extensive experiments on CUHK-SYSU and PRW demonstrate that PS-DFSI significantly improves person search performance, validating its effectiveness and robustness.
期刊介绍:
Information Fusion serves as a central platform for showcasing advancements in multi-sensor, multi-source, multi-process information fusion, fostering collaboration among diverse disciplines driving its progress. It is the leading outlet for sharing research and development in this field, focusing on architectures, algorithms, and applications. Papers dealing with fundamental theoretical analyses as well as those demonstrating their application to real-world problems will be welcome.