Gaoxin Ma , Xingquan Zhu , Zhen Tian , Yangdong Ye , Zhenfeng Zhu
{"title":"Few-Shot Object Counting with frequency attention and multi-perception head","authors":"Gaoxin Ma , Xingquan Zhu , Zhen Tian , Yangdong Ye , Zhenfeng Zhu","doi":"10.1016/j.neucom.2025.130598","DOIUrl":null,"url":null,"abstract":"<div><div>Few-Shot Object Counting (FSC) is a critical technique in computer vision, which focuses on estimating the number of exemplar objects in target tasks. This technique is highly versatile and applicable in diverse domains, such as crowd monitoring, traffic management, and wildlife tracking. The primary challenge in FSC is achieving robust feature matching despite the gap between the diversity of targets and the scarcity of exemplars. In this research, we propose the Few-shot Object Counting Network with Frequency Attention and Multi-Perception Head (FFMP), which aims to enhance the limited examples by identifying additional instances within query images. The FFMP framework comprises three core components: Frequency Domain Feature Fusion (FDF), Self-Adaptive Feature Enhancement (SFE), and Multi-Perception Head (MP). The FDF component fuses features from both spatial and frequency domains to generate more precise similarity maps. The SFE component identifies and focuses on recurrent target features within query images, enriching the initial set of examples and providing a detailed understanding of the target category. Additionally, the MP component integrates counting and detection tasks, thereby improving overall performance. Extensive experiments on the FSC-147 dataset and various class-specific counting datasets demonstrate that FFMP achieves competitive counting performance compared to state-of-the-art methods. Code is available at <span><span>https://github.com/dsl161/FFMP</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"648 ","pages":"Article 130598"},"PeriodicalIF":5.5000,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925231225012706","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Few-Shot Object Counting (FSC) is a critical technique in computer vision, which focuses on estimating the number of exemplar objects in target tasks. This technique is highly versatile and applicable in diverse domains, such as crowd monitoring, traffic management, and wildlife tracking. The primary challenge in FSC is achieving robust feature matching despite the gap between the diversity of targets and the scarcity of exemplars. In this research, we propose the Few-shot Object Counting Network with Frequency Attention and Multi-Perception Head (FFMP), which aims to enhance the limited examples by identifying additional instances within query images. The FFMP framework comprises three core components: Frequency Domain Feature Fusion (FDF), Self-Adaptive Feature Enhancement (SFE), and Multi-Perception Head (MP). The FDF component fuses features from both spatial and frequency domains to generate more precise similarity maps. The SFE component identifies and focuses on recurrent target features within query images, enriching the initial set of examples and providing a detailed understanding of the target category. Additionally, the MP component integrates counting and detection tasks, thereby improving overall performance. Extensive experiments on the FSC-147 dataset and various class-specific counting datasets demonstrate that FFMP achieves competitive counting performance compared to state-of-the-art methods. Code is available at https://github.com/dsl161/FFMP.
期刊介绍:
Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.