Full Perception Head: Bridging the Gap Between Local and Global Features

IF 19.2 1区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS
Jie Hua;Zhongyuan Wang;Xin Tian;Qin Zou;Jinsheng Xiao;Jiayi Ma
{"title":"Full Perception Head: Bridging the Gap Between Local and Global Features","authors":"Jie Hua;Zhongyuan Wang;Xin Tian;Qin Zou;Jinsheng Xiao;Jiayi Ma","doi":"10.1109/JAS.2025.125333","DOIUrl":null,"url":null,"abstract":"Object detection is a fundamental task in computer vision that involves identifying and localizing objects within an image. Local features extracted by convolutions, etc., capture fine-grained details such as edges and textures, while global features extracted by full connection layers, etc., represent the overall structure and long-range relationships within the image. These features are crucial for accurate object detection, yet most existing methods focus on aggregating local and global features, often overlooking the importance of medium-range dependencies. To address this gap, we propose a novel full perception module (FP-Module), a simple yet effective feature extraction module designed to simultaneously capture local details, medium-range dependencies, and long-range dependencies. Building on this, we construct a full perception head (FP-Head) by cascading multiple FP-Modules, enabling the prediction layer to leverage the most informative features. Experimental results in the MS COCO dataset demonstrate that our approach significantly enhances object recognition and localization, achieving 2.7-5.7 AP<inf>val</inf> gains when integrated into standard object detectors. Notably, the FP-Module is a universal solution that can be seamlessly incorporated into existing detectors to boost performance. The code will be released at https://github.com/Idcogroup/FP-Head.","PeriodicalId":54230,"journal":{"name":"Ieee-Caa Journal of Automatica Sinica","volume":"12 7","pages":"1391-1406"},"PeriodicalIF":19.2000,"publicationDate":"2025-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ieee-Caa Journal of Automatica Sinica","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/11004449/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Object detection is a fundamental task in computer vision that involves identifying and localizing objects within an image. Local features extracted by convolutions, etc., capture fine-grained details such as edges and textures, while global features extracted by full connection layers, etc., represent the overall structure and long-range relationships within the image. These features are crucial for accurate object detection, yet most existing methods focus on aggregating local and global features, often overlooking the importance of medium-range dependencies. To address this gap, we propose a novel full perception module (FP-Module), a simple yet effective feature extraction module designed to simultaneously capture local details, medium-range dependencies, and long-range dependencies. Building on this, we construct a full perception head (FP-Head) by cascading multiple FP-Modules, enabling the prediction layer to leverage the most informative features. Experimental results in the MS COCO dataset demonstrate that our approach significantly enhances object recognition and localization, achieving 2.7-5.7 APval gains when integrated into standard object detectors. Notably, the FP-Module is a universal solution that can be seamlessly incorporated into existing detectors to boost performance. The code will be released at https://github.com/Idcogroup/FP-Head.
全感知头:弥合局部和全局特征之间的差距
物体检测是计算机视觉的一项基本任务,它涉及识别和定位图像中的物体。通过卷积等提取的局部特征捕获了边缘、纹理等细粒度细节,而通过全连接层等提取的全局特征则代表了图像内部的整体结构和长期关系。这些特征对于精确的目标检测是至关重要的,然而大多数现有的方法都集中在聚合局部和全局特征上,往往忽略了中等范围依赖关系的重要性。为了解决这一差距,我们提出了一种新颖的全感知模块(FP-Module),这是一种简单而有效的特征提取模块,旨在同时捕获局部细节、中期依赖关系和远程依赖关系。在此基础上,我们通过级联多个fp -模块构建了一个完整的感知头(FP-Head),使预测层能够利用最具信息量的特征。MS COCO数据集的实验结果表明,我们的方法显著增强了目标识别和定位,当集成到标准目标检测器中时,APval增益达到2.7-5.7。值得注意的是,FP-Module是一种通用的解决方案,可以无缝地集成到现有的探测器中,以提高性能。代码将在https://github.com/Idcogroup/FP-Head上发布。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Ieee-Caa Journal of Automatica Sinica
Ieee-Caa Journal of Automatica Sinica Engineering-Control and Systems Engineering
CiteScore
23.50
自引率
11.00%
发文量
880
期刊介绍: The IEEE/CAA Journal of Automatica Sinica is a reputable journal that publishes high-quality papers in English on original theoretical/experimental research and development in the field of automation. The journal covers a wide range of topics including automatic control, artificial intelligence and intelligent control, systems theory and engineering, pattern recognition and intelligent systems, automation engineering and applications, information processing and information systems, network-based automation, robotics, sensing and measurement, and navigation, guidance, and control. Additionally, the journal is abstracted/indexed in several prominent databases including SCIE (Science Citation Index Expanded), EI (Engineering Index), Inspec, Scopus, SCImago, DBLP, CNKI (China National Knowledge Infrastructure), CSCD (Chinese Science Citation Database), and IEEE Xplore.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信