Xinhui Lin , Mingzhu Shi , Yuhao Su , Wenxin Zhang , Yujiao Cai , Lei Liu , Zhaowei Liu
{"title":"SEACount: Semantic-driven Exemplar query Attention framework for image boosting class-agnostic Counting in Internet of Things","authors":"Xinhui Lin , Mingzhu Shi , Yuhao Su , Wenxin Zhang , Yujiao Cai , Lei Liu , Zhaowei Liu","doi":"10.1016/j.iot.2025.101741","DOIUrl":null,"url":null,"abstract":"<div><div>In the Internet of Things (IoT) environment, large amounts of visual data are continuously collected, providing a rich resource for intelligent surveillance and management. For the task of class-agnostic counting in images, this paper proposes the Semantic-driven Exemplar query Attention Counting (SEACount) framework, which aims to quickly adapt and count unseen classes of objects using a few-shot exemplars. This is critical for real-time monitoring and analyzing visual semantic information in IoT. Specifically, we introduce two new components to extend Object Detection with Transformers (DETR): the Exemplar Query Attention (EQA) and the Dynamic Reshaping Module (DRM). EQA injects exemplar queries with rich semantic information into the decoder, facilitating the global image response to exemplar targets and enhancing the exemplar-to-image similarity metrics. The DRM, instead of only utilizing decoder features, fuses them with image features to enhance local details, reduce noise interference, and reshape the feature maps required for predicting density maps. This approach efficiently captures exemplar-relevant targets in images and quickly adapts to new categories without fine-tuning. Experimental results demonstrate that our proposed SEACount framework significantly outperforms other state-of-the-art methods on the latest FSC-147 dataset. We release the code at <span><span>https://github.com/lxinhui1109/SEACount.git</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":29968,"journal":{"name":"Internet of Things","volume":"34 ","pages":"Article 101741"},"PeriodicalIF":7.6000,"publicationDate":"2025-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Internet of Things","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2542660525002550","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
In the Internet of Things (IoT) environment, large amounts of visual data are continuously collected, providing a rich resource for intelligent surveillance and management. For the task of class-agnostic counting in images, this paper proposes the Semantic-driven Exemplar query Attention Counting (SEACount) framework, which aims to quickly adapt and count unseen classes of objects using a few-shot exemplars. This is critical for real-time monitoring and analyzing visual semantic information in IoT. Specifically, we introduce two new components to extend Object Detection with Transformers (DETR): the Exemplar Query Attention (EQA) and the Dynamic Reshaping Module (DRM). EQA injects exemplar queries with rich semantic information into the decoder, facilitating the global image response to exemplar targets and enhancing the exemplar-to-image similarity metrics. The DRM, instead of only utilizing decoder features, fuses them with image features to enhance local details, reduce noise interference, and reshape the feature maps required for predicting density maps. This approach efficiently captures exemplar-relevant targets in images and quickly adapts to new categories without fine-tuning. Experimental results demonstrate that our proposed SEACount framework significantly outperforms other state-of-the-art methods on the latest FSC-147 dataset. We release the code at https://github.com/lxinhui1109/SEACount.git.
期刊介绍:
Internet of Things; Engineering Cyber Physical Human Systems is a comprehensive journal encouraging cross collaboration between researchers, engineers and practitioners in the field of IoT & Cyber Physical Human Systems. The journal offers a unique platform to exchange scientific information on the entire breadth of technology, science, and societal applications of the IoT.
The journal will place a high priority on timely publication, and provide a home for high quality.
Furthermore, IOT is interested in publishing topical Special Issues on any aspect of IOT.