Hybrid network with difficult–easy learning for concealed object detection in imbalanced terahertz image dataset

IF 1.1 4区 物理与天体物理 Q4 OPTICS
Pengfei Yang, Shaojuan Luo, Meiyun Chen, Genping Zhao, Heng Wu, Chunhua He
{"title":"Hybrid network with difficult–easy learning for concealed object detection in imbalanced terahertz image dataset","authors":"Pengfei Yang, Shaojuan Luo, Meiyun Chen, Genping Zhao, Heng Wu, Chunhua He","doi":"10.1007/s10043-024-00927-y","DOIUrl":null,"url":null,"abstract":"<p>Terahertz imaging technology has been widely used in security inspections due to its ability to detect various concealed hazardous materials and the advantage of being harmless to the human body. However, limited by the terahertz imaging system, it is challenging to detect concealed objects due to hard samples and imbalanced categories caused by terahertz image quality. To solve these issues, we propose a hybrid network with difficult–easy learning (DEL) for concealed object detection in the imbalanced activated terahertz image dataset. Based on the one-stage framework YOLOv5m, a path aggregation hybrid structure (PAHS) is proposed to improve the performance of the proposed network while maintaining real-time detection. Specifically, PAHS with transformer block (TB) and a fine-tuned global context attention (GCA) are designed to fully exploit and fuse the multi-scale information by path aggregation, which improves the detection accuracy of low contrast and noise-interfered objects. To solve the problem of imbalanced categories in the activated terahertz dataset, a DELoss is developed to guide the network classification. Moreover, EIOU is adopted to boost the network training, and a modified B-Ocl loss is used to discriminate the positive and negative samples. Experiments are conducted on a public imbalanced activate terahertz image dataset. The experimental results illustrate that the proposed network achieves competitive performance compared with recently reported state-of-the-art detection methods. Moreover, the proposed method improves the balanced detection ability of different categories.</p>","PeriodicalId":722,"journal":{"name":"Optical Review","volume":"5 1","pages":""},"PeriodicalIF":1.1000,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Optical Review","FirstCategoryId":"101","ListUrlMain":"https://doi.org/10.1007/s10043-024-00927-y","RegionNum":4,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"OPTICS","Score":null,"Total":0}
引用次数: 0

Abstract

Terahertz imaging technology has been widely used in security inspections due to its ability to detect various concealed hazardous materials and the advantage of being harmless to the human body. However, limited by the terahertz imaging system, it is challenging to detect concealed objects due to hard samples and imbalanced categories caused by terahertz image quality. To solve these issues, we propose a hybrid network with difficult–easy learning (DEL) for concealed object detection in the imbalanced activated terahertz image dataset. Based on the one-stage framework YOLOv5m, a path aggregation hybrid structure (PAHS) is proposed to improve the performance of the proposed network while maintaining real-time detection. Specifically, PAHS with transformer block (TB) and a fine-tuned global context attention (GCA) are designed to fully exploit and fuse the multi-scale information by path aggregation, which improves the detection accuracy of low contrast and noise-interfered objects. To solve the problem of imbalanced categories in the activated terahertz dataset, a DELoss is developed to guide the network classification. Moreover, EIOU is adopted to boost the network training, and a modified B-Ocl loss is used to discriminate the positive and negative samples. Experiments are conducted on a public imbalanced activate terahertz image dataset. The experimental results illustrate that the proposed network achieves competitive performance compared with recently reported state-of-the-art detection methods. Moreover, the proposed method improves the balanced detection ability of different categories.

在不平衡太赫兹图像数据集中利用难易学习混合网络进行隐蔽物体检测
太赫兹成像技术能够探测各种隐蔽的危险品,而且对人体无害,因此在安检领域得到了广泛应用。然而,受限于太赫兹成像系统,由于太赫兹图像质量造成的硬样本和不平衡类别,要检测出隐藏的物体具有一定的难度。为了解决这些问题,我们提出了一种具有难易学习(DEL)的混合网络,用于在不平衡激活的太赫兹图像数据集中检测隐藏物体。在单级框架 YOLOv5m 的基础上,我们提出了一种路径聚合混合结构(PAHS),以提高所提网络的性能,同时保持检测的实时性。具体来说,PAHS 设计了变压器块(TB)和微调全局上下文注意力(GCA),通过路径聚合充分利用和融合多尺度信息,从而提高了低对比度和噪声干扰物体的检测精度。为解决激活太赫兹数据集中类别不平衡的问题,开发了一种 DELoss 来指导网络分类。此外,还采用了 EIOU 来增强网络训练,并使用改进的 B-Ocl 损失来区分正负样本。实验在公共不平衡激活太赫兹图像数据集上进行。实验结果表明,与最近报道的最先进的检测方法相比,所提出的网络取得了具有竞争力的性能。此外,所提出的方法还提高了不同类别的均衡检测能力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Optical Review
Optical Review 物理-光学
CiteScore
2.30
自引率
0.00%
发文量
62
审稿时长
2 months
期刊介绍: Optical Review is an international journal published by the Optical Society of Japan. The scope of the journal is: General and physical optics; Quantum optics and spectroscopy; Information optics; Photonics and optoelectronics; Biomedical photonics and biological optics; Lasers; Nonlinear optics; Optical systems and technologies; Optical materials and manufacturing technologies; Vision; Infrared and short wavelength optics; Cross-disciplinary areas such as environmental, energy, food, agriculture and space technologies; Other optical methods and applications.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信