CFPA-Net: Cross-layer Feature Fusion And Parallel Attention Network For Detection And Classification of Prohibited Items in X-ray Baggage Images

Yifan Wei, Yizhuo Wang, Hong Song
{"title":"CFPA-Net: Cross-layer Feature Fusion And Parallel Attention Network For Detection And Classification of Prohibited Items in X-ray Baggage Images","authors":"Yifan Wei, Yizhuo Wang, Hong Song","doi":"10.1109/CCIS53392.2021.9754631","DOIUrl":null,"url":null,"abstract":"As objects in the baggage are often heavily overlapped and cluttered, the X-ray baggage inspection is an inherently challenging task. In this paper, we propose a cross-layer feature fusion and parallel attention network named CFPA-Net to detect and classify the prohibited items in X-ray baggage images. The CFPA-Net is based on RetinaNet with three modules: cross-layer feature extraction fusion module (CEF-Module), paralleled attention module (PA-Module) and FreeAnchor. In CEF-Module, an improved feature pyramid network is proposed by adding multi-directional lateral connections for cross-layer feature extraction and fusion. It can help detect objects of various scales and supplement deficient semantic and localization information for low layer and high layer features respectively. PA-Module is presented to learn the feature relationship and fully utilize the extracted features by introducing two paralleled attention subnets Squeeze-and-Excitation module and Non-local module. PA-Module can help improve the performance of detecting and classification by emphasizing useful features, suppressing useless features selectively and capturing long-range dependencies in images. FreeAnchor is adopted to deal with the restriction of hand-crafted anchor assignment according to Intersection-over-Unit. It can help find the best anchor for each object by learning, and improve the performance of detecting slender objects and the ones in crowded scenes. On the public dataset OPIXray, CFPA-Net achieves 85.82% detection mean Average Precision. Moreover, achieving 81.61% classification mean Average Precision on the SIXray10 dataset. The experimental results show that our proposed CFPA-Net is more accurate and robust for the X-ray baggage inspection with densely occluded objects and complicated backgrounds.","PeriodicalId":191226,"journal":{"name":"2021 IEEE 7th International Conference on Cloud Computing and Intelligent Systems (CCIS)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 7th International Conference on Cloud Computing and Intelligent Systems (CCIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCIS53392.2021.9754631","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

As objects in the baggage are often heavily overlapped and cluttered, the X-ray baggage inspection is an inherently challenging task. In this paper, we propose a cross-layer feature fusion and parallel attention network named CFPA-Net to detect and classify the prohibited items in X-ray baggage images. The CFPA-Net is based on RetinaNet with three modules: cross-layer feature extraction fusion module (CEF-Module), paralleled attention module (PA-Module) and FreeAnchor. In CEF-Module, an improved feature pyramid network is proposed by adding multi-directional lateral connections for cross-layer feature extraction and fusion. It can help detect objects of various scales and supplement deficient semantic and localization information for low layer and high layer features respectively. PA-Module is presented to learn the feature relationship and fully utilize the extracted features by introducing two paralleled attention subnets Squeeze-and-Excitation module and Non-local module. PA-Module can help improve the performance of detecting and classification by emphasizing useful features, suppressing useless features selectively and capturing long-range dependencies in images. FreeAnchor is adopted to deal with the restriction of hand-crafted anchor assignment according to Intersection-over-Unit. It can help find the best anchor for each object by learning, and improve the performance of detecting slender objects and the ones in crowded scenes. On the public dataset OPIXray, CFPA-Net achieves 85.82% detection mean Average Precision. Moreover, achieving 81.61% classification mean Average Precision on the SIXray10 dataset. The experimental results show that our proposed CFPA-Net is more accurate and robust for the X-ray baggage inspection with densely occluded objects and complicated backgrounds.
CFPA-Net: x射线行李图像中违禁物品检测与分类的跨层特征融合并行关注网络
由于行李中的物品经常严重重叠和混乱,x射线行李检查本身就是一项具有挑战性的任务。本文提出了一种名为CFPA-Net的跨层特征融合并行关注网络,对x射线行李图像中的违禁物品进行检测和分类。CFPA-Net基于retanet,包含三个模块:跨层特征提取融合模块(CEF-Module)、并行注意力模块(PA-Module)和FreeAnchor。在CEF-Module中,提出了一种改进的特征金字塔网络,通过增加多向横向连接进行跨层特征提取和融合。它可以帮助检测各种尺度的物体,并分别补充低层和高层特征的语义和定位信息不足。pa模块通过引入两个平行的注意子网挤压激励模块和非局部模块来学习特征关系并充分利用提取的特征。PA-Module可以通过强调有用的特征,选择性地抑制无用的特征和捕获图像中的远程依赖关系来提高检测和分类的性能。采用FreeAnchor解决了基于Intersection-over-Unit的手工锚点分配限制。它可以通过学习为每个物体找到最佳的锚点,提高对细长物体和拥挤场景中物体的检测性能。在公共数据集OPIXray上,CFPA-Net的平均检测精度达到85.82%。此外,在SIXray10数据集上实现了81.61%的分类平均精度。实验结果表明,本文提出的CFPA-Net对于具有密集遮挡物体和复杂背景的x射线行李检测具有更高的准确性和鲁棒性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信