CFPA-Net: Cross-layer Feature Fusion And Parallel Attention Network For Detection And Classification of Prohibited Items in X-ray Baggage Images

2021 IEEE 7th International Conference on Cloud Computing and Intelligent Systems (CCIS) Pub Date : 2021-11-07 DOI:10.1109/CCIS53392.2021.9754631

Yifan Wei, Yizhuo Wang, Hong Song

{"title":"CFPA-Net: Cross-layer Feature Fusion And Parallel Attention Network For Detection And Classification of Prohibited Items in X-ray Baggage Images","authors":"Yifan Wei, Yizhuo Wang, Hong Song","doi":"10.1109/CCIS53392.2021.9754631","DOIUrl":null,"url":null,"abstract":"As objects in the baggage are often heavily overlapped and cluttered, the X-ray baggage inspection is an inherently challenging task. In this paper, we propose a cross-layer feature fusion and parallel attention network named CFPA-Net to detect and classify the prohibited items in X-ray baggage images. The CFPA-Net is based on RetinaNet with three modules: cross-layer feature extraction fusion module (CEF-Module), paralleled attention module (PA-Module) and FreeAnchor. In CEF-Module, an improved feature pyramid network is proposed by adding multi-directional lateral connections for cross-layer feature extraction and fusion. It can help detect objects of various scales and supplement deficient semantic and localization information for low layer and high layer features respectively. PA-Module is presented to learn the feature relationship and fully utilize the extracted features by introducing two paralleled attention subnets Squeeze-and-Excitation module and Non-local module. PA-Module can help improve the performance of detecting and classification by emphasizing useful features, suppressing useless features selectively and capturing long-range dependencies in images. FreeAnchor is adopted to deal with the restriction of hand-crafted anchor assignment according to Intersection-over-Unit. It can help find the best anchor for each object by learning, and improve the performance of detecting slender objects and the ones in crowded scenes. On the public dataset OPIXray, CFPA-Net achieves 85.82% detection mean Average Precision. Moreover, achieving 81.61% classification mean Average Precision on the SIXray10 dataset. The experimental results show that our proposed CFPA-Net is more accurate and robust for the X-ray baggage inspection with densely occluded objects and complicated backgrounds.","PeriodicalId":191226,"journal":{"name":"2021 IEEE 7th International Conference on Cloud Computing and Intelligent Systems (CCIS)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 7th International Conference on Cloud Computing and Intelligent Systems (CCIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCIS53392.2021.9754631","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

As objects in the baggage are often heavily overlapped and cluttered, the X-ray baggage inspection is an inherently challenging task. In this paper, we propose a cross-layer feature fusion and parallel attention network named CFPA-Net to detect and classify the prohibited items in X-ray baggage images. The CFPA-Net is based on RetinaNet with three modules: cross-layer feature extraction fusion module (CEF-Module), paralleled attention module (PA-Module) and FreeAnchor. In CEF-Module, an improved feature pyramid network is proposed by adding multi-directional lateral connections for cross-layer feature extraction and fusion. It can help detect objects of various scales and supplement deficient semantic and localization information for low layer and high layer features respectively. PA-Module is presented to learn the feature relationship and fully utilize the extracted features by introducing two paralleled attention subnets Squeeze-and-Excitation module and Non-local module. PA-Module can help improve the performance of detecting and classification by emphasizing useful features, suppressing useless features selectively and capturing long-range dependencies in images. FreeAnchor is adopted to deal with the restriction of hand-crafted anchor assignment according to Intersection-over-Unit. It can help find the best anchor for each object by learning, and improve the performance of detecting slender objects and the ones in crowded scenes. On the public dataset OPIXray, CFPA-Net achieves 85.82% detection mean Average Precision. Moreover, achieving 81.61% classification mean Average Precision on the SIXray10 dataset. The experimental results show that our proposed CFPA-Net is more accurate and robust for the X-ray baggage inspection with densely occluded objects and complicated backgrounds.

查看原文本刊更多论文

CFPA-Net: x射线行李图像中违禁物品检测与分类的跨层特征融合并行关注网络

由于行李中的物品经常严重重叠和混乱，x射线行李检查本身就是一项具有挑战性的任务。本文提出了一种名为CFPA-Net的跨层特征融合并行关注网络，对x射线行李图像中的违禁物品进行检测和分类。CFPA-Net基于retanet，包含三个模块:跨层特征提取融合模块(CEF-Module)、并行注意力模块(PA-Module)和FreeAnchor。在CEF-Module中，提出了一种改进的特征金字塔网络，通过增加多向横向连接进行跨层特征提取和融合。它可以帮助检测各种尺度的物体，并分别补充低层和高层特征的语义和定位信息不足。pa模块通过引入两个平行的注意子网挤压激励模块和非局部模块来学习特征关系并充分利用提取的特征。PA-Module可以通过强调有用的特征，选择性地抑制无用的特征和捕获图像中的远程依赖关系来提高检测和分类的性能。采用FreeAnchor解决了基于Intersection-over-Unit的手工锚点分配限制。它可以通过学习为每个物体找到最佳的锚点，提高对细长物体和拥挤场景中物体的检测性能。在公共数据集OPIXray上，CFPA-Net的平均检测精度达到85.82%。此外，在SIXray10数据集上实现了81.61%的分类平均精度。实验结果表明，本文提出的CFPA-Net对于具有密集遮挡物体和复杂背景的x射线行李检测具有更高的准确性和鲁棒性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 IEEE 7th International Conference on Cloud Computing and Intelligent Systems (CCIS)

自引率

0.00%

发文量