Research on multi-view collaborative detection system for UAV swarms based on Pix2Pix framework and BAM attention mechanism

IF 5 Q1 ENGINEERING, MULTIDISCIPLINARY

Defence Technology(防务技术) Pub Date : 2025-04-01 DOI:10.1016/j.dt.2024.11.002

Yan Ding, Qingxin Cao, Bozhi Zhang, Peilin Li, Zhongjiao Shi

{"title":"Research on multi-view collaborative detection system for UAV swarms based on Pix2Pix framework and BAM attention mechanism","authors":"Yan Ding, Qingxin Cao, Bozhi Zhang, Peilin Li, Zhongjiao Shi","doi":"10.1016/j.dt.2024.11.002","DOIUrl":null,"url":null,"abstract":"<div><div>Drone swarm systems, equipped with photoelectric imaging and intelligent target perception, are essential for reconnaissance and strike missions in complex and high-risk environments. They excel in information sharing, anti-jamming capabilities, and combat performance, making them critical for future warfare. However, varied perspectives in collaborative combat scenarios pose challenges to object detection, hindering traditional detection algorithms and reducing accuracy. Limited angle-prior data and sparse samples further complicate detection. This paper presents the Multi-View Collaborative Detection System, which tackles the challenges of multi-view object detection in collaborative combat scenarios. The system is designed to enhance multi-view image generation and detection algorithms, thereby improving the accuracy and efficiency of object detection across varying perspectives. First, an observation model for three-dimensional targets through line-of-sight angle transformation is constructed, and a multi-view image generation algorithm based on the Pix2Pix network is designed. For object detection, YOLOX is utilized, and a deep feature extraction network, BA-RepCSPDarknet, is developed to address challenges related to small target scale and feature extraction challenges. Additionally, a feature fusion network NS-PAFPN is developed to mitigate the issue of deep feature map information loss in UAV images. A visual attention module (BAM) is employed to manage appearance differences under varying angles, while a feature mapping module (DFM) prevents fine-grained feature loss. These advancements lead to the development of BA-YOLOX, a multi-view object detection network model suitable for drone platforms, enhancing accuracy and effectively targeting small objects.</div></div>","PeriodicalId":58209,"journal":{"name":"Defence Technology(防务技术)","volume":"46 ","pages":"Pages 213-226"},"PeriodicalIF":5.0000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Defence Technology(防务技术)","FirstCategoryId":"1087","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2214914724002575","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

Abstract

Drone swarm systems, equipped with photoelectric imaging and intelligent target perception, are essential for reconnaissance and strike missions in complex and high-risk environments. They excel in information sharing, anti-jamming capabilities, and combat performance, making them critical for future warfare. However, varied perspectives in collaborative combat scenarios pose challenges to object detection, hindering traditional detection algorithms and reducing accuracy. Limited angle-prior data and sparse samples further complicate detection. This paper presents the Multi-View Collaborative Detection System, which tackles the challenges of multi-view object detection in collaborative combat scenarios. The system is designed to enhance multi-view image generation and detection algorithms, thereby improving the accuracy and efficiency of object detection across varying perspectives. First, an observation model for three-dimensional targets through line-of-sight angle transformation is constructed, and a multi-view image generation algorithm based on the Pix2Pix network is designed. For object detection, YOLOX is utilized, and a deep feature extraction network, BA-RepCSPDarknet, is developed to address challenges related to small target scale and feature extraction challenges. Additionally, a feature fusion network NS-PAFPN is developed to mitigate the issue of deep feature map information loss in UAV images. A visual attention module (BAM) is employed to manage appearance differences under varying angles, while a feature mapping module (DFM) prevents fine-grained feature loss. These advancements lead to the development of BA-YOLOX, a multi-view object detection network model suitable for drone platforms, enhancing accuracy and effectively targeting small objects.

查看原文本刊更多论文

求助全文

约1分钟内获得全文求助全文

来源期刊

Defence Technology(防务技术) Mechanical Engineering, Control and Systems Engineering, Industrial and Manufacturing Engineering

CiteScore

8.70

自引率

0.00%

发文量

728

审稿时长

25 days

期刊介绍： Defence Technology, a peer reviewed journal, is published monthly and aims to become the best international academic exchange platform for the research related to defence technology. It publishes original research papers having direct bearing on defence, with a balanced coverage on analytical, experimental, numerical simulation and applied investigations. It covers various disciplines of science, technology and engineering.