Small object detection by Edge-aware Neural Network

IF 7.5 2区 计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS
Xianhong Zhang, Tao Lu, Jiaming Wang, Shichang Fu, Fangqun Gao
{"title":"Small object detection by Edge-aware Neural Network","authors":"Xianhong Zhang,&nbsp;Tao Lu,&nbsp;Jiaming Wang,&nbsp;Shichang Fu,&nbsp;Fangqun Gao","doi":"10.1016/j.engappai.2024.109406","DOIUrl":null,"url":null,"abstract":"<div><div>The object detection method is widely applied in industrial inspections. However, many detectors face challenges in accurately capturing the blurred edge details of small objects, resulting in inaccurate bounding box predictions. To address this, we propose an Edge-aware Neural Network (EANN) for small object detection. Firstly, we introduce a Channel and Spatial Attention Fusion Module (CSAFM) to enhance the edge features of small objects, enabling the network to extract more discriminative information. Next, we propose a Multiple Aggregation Feature Pyramid (MAFP) to integrate multi-scale deep features into shallow features. This fusion enriches the shallow features with abundant semantic information, thereby aiding in the detection of small objects. Additionally, we propose a Side and Center Point Aligned Intersection over Union loss (SCPAIoULoss) to enhance the bounding box regression when there is minimal overlap between predicted and ground truth boxes. SCPAIoULoss combines Side Ratio (SR) loss, Center Point Distance (CPD) loss, and Intersection over Union (IoU) loss. The utilization of SR Loss directly constrains the width and height regression of bounding boxes, while CPD loss introduces stricter constraints to facilitate bounding box regression. Furthermore, IoU loss promotes the overall regression of predicted boxes. We extensively experiment on Tiny CityPersons, WiderFace, and our proposed dataset of base station data centers to validate the effectiveness of our method. The results indicate that our method surpasses several State-of-The-Art (SOTA) approaches in small object detection and can be effectively applied to the task of inspecting base station data centers.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":null,"pages":null},"PeriodicalIF":7.5000,"publicationDate":"2024-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197624015641","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

The object detection method is widely applied in industrial inspections. However, many detectors face challenges in accurately capturing the blurred edge details of small objects, resulting in inaccurate bounding box predictions. To address this, we propose an Edge-aware Neural Network (EANN) for small object detection. Firstly, we introduce a Channel and Spatial Attention Fusion Module (CSAFM) to enhance the edge features of small objects, enabling the network to extract more discriminative information. Next, we propose a Multiple Aggregation Feature Pyramid (MAFP) to integrate multi-scale deep features into shallow features. This fusion enriches the shallow features with abundant semantic information, thereby aiding in the detection of small objects. Additionally, we propose a Side and Center Point Aligned Intersection over Union loss (SCPAIoULoss) to enhance the bounding box regression when there is minimal overlap between predicted and ground truth boxes. SCPAIoULoss combines Side Ratio (SR) loss, Center Point Distance (CPD) loss, and Intersection over Union (IoU) loss. The utilization of SR Loss directly constrains the width and height regression of bounding boxes, while CPD loss introduces stricter constraints to facilitate bounding box regression. Furthermore, IoU loss promotes the overall regression of predicted boxes. We extensively experiment on Tiny CityPersons, WiderFace, and our proposed dataset of base station data centers to validate the effectiveness of our method. The results indicate that our method surpasses several State-of-The-Art (SOTA) approaches in small object detection and can be effectively applied to the task of inspecting base station data centers.
利用边缘感知神经网络检测小物体
物体检测方法被广泛应用于工业检测领域。然而,许多检测器在准确捕捉小物体模糊边缘细节方面面临挑战,导致边界框预测不准确。针对这一问题,我们提出了一种边缘感知神经网络(EANN)用于小物体检测。首先,我们引入了通道和空间注意力融合模块(CSAFM)来增强小物体的边缘特征,使网络能够提取更多的判别信息。接下来,我们提出了多重聚合特征金字塔(MAFP),将多尺度深度特征整合到浅层特征中。这种融合为浅层特征提供了丰富的语义信息,从而有助于小物体的检测。此外,我们还提出了边点和中心点对齐交集联合损失(SCPAIoULoss),用于在预测框和地面实况框重叠最小的情况下增强边界框回归。SCPAIoULoss 结合了边距损失 (SR)、中心点距离损失 (CPD) 和联合交叉损失 (IoU)。SR 损失的使用直接限制了边界框的宽度和高度回归,而 CPD 损失则引入了更严格的限制以促进边界框回归。此外,IoU 损失还能促进预测框的整体回归。我们在 Tiny CityPersons、WiderFace 和我们提出的基站数据中心数据集上进行了广泛实验,以验证我们方法的有效性。结果表明,我们的方法在小物体检测方面超越了几种最新方法(SOTA),可以有效地应用于基站数据中心的检测任务。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Engineering Applications of Artificial Intelligence
Engineering Applications of Artificial Intelligence 工程技术-工程:电子与电气
CiteScore
9.60
自引率
10.00%
发文量
505
审稿时长
68 days
期刊介绍: Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信