红外微小目标边缘实时跟踪及fpga硬件实现

IF 3 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC
Yulian Li, Zhengwen Shen, Xu Wang, Xiao Yang, Jun Wang
{"title":"红外微小目标边缘实时跟踪及fpga硬件实现","authors":"Yulian Li,&nbsp;Zhengwen Shen,&nbsp;Xu Wang,&nbsp;Xiao Yang,&nbsp;Jun Wang","doi":"10.1016/j.dsp.2025.105412","DOIUrl":null,"url":null,"abstract":"<div><div>Infrared tiny object tracking holds significant application value in video surveillance and air defense systems. Deep convolutional neural networks (DCNNs) have demonstrated impressive performance in object tracking over the past few years. However, the high complexity and computing power requirements of the DCNN model make it difficult to deploy on power-sensitive and resource-constrained edge devices. To address this issue, we designed and implemented an FPGA-based infrared tiny object tracker by a hardware-software co-optimization approach, meeting the requirements for accuracy, latency, and power consumption under limited resources. First, we propose a lightweight and hardware-friendly object tracking network, SiamITO-Tiny, effectively improving the tracking accuracy for infrared tiny objects. Second, we design a full-mapping hardware acceleration architecture, mainly comprising a layer-fusion-based convolutional accelerator, a parallel pipelined adder tree, and an efficient data caching scheme. This architecture increases computational parallelism and data access bandwidth through layer fusion, loop optimization, pipelining, and array partitioning, while simultaneously balancing resource consumption and latency, thereby effectively improving computational performance and energy efficiency. Finally, the SiamITO-Tiny network is deployed on the Xilinx All Programmable SoC (ZYNQ) platform ZCU104. Experiments show that our method achieves 45.45 FPS and has a tracking score of 0.9. The computational performance reaches 307.2 GOP/s. The energy efficiency is 30.03 GOP/s/W, which is 39 and 6.81 times higher than it is on the CPU and GPU platforms, respectively. Compared to other acceleration methods, the energy efficiency is improved by 1.4 to 9.3 times, confirming the superiority of the proposed approach in edge real-time tracking.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"167 ","pages":"Article 105412"},"PeriodicalIF":3.0000,"publicationDate":"2025-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Edge real-time tracking and FPGA-based hardware implementation for infrared tiny object\",\"authors\":\"Yulian Li,&nbsp;Zhengwen Shen,&nbsp;Xu Wang,&nbsp;Xiao Yang,&nbsp;Jun Wang\",\"doi\":\"10.1016/j.dsp.2025.105412\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Infrared tiny object tracking holds significant application value in video surveillance and air defense systems. Deep convolutional neural networks (DCNNs) have demonstrated impressive performance in object tracking over the past few years. However, the high complexity and computing power requirements of the DCNN model make it difficult to deploy on power-sensitive and resource-constrained edge devices. To address this issue, we designed and implemented an FPGA-based infrared tiny object tracker by a hardware-software co-optimization approach, meeting the requirements for accuracy, latency, and power consumption under limited resources. First, we propose a lightweight and hardware-friendly object tracking network, SiamITO-Tiny, effectively improving the tracking accuracy for infrared tiny objects. Second, we design a full-mapping hardware acceleration architecture, mainly comprising a layer-fusion-based convolutional accelerator, a parallel pipelined adder tree, and an efficient data caching scheme. This architecture increases computational parallelism and data access bandwidth through layer fusion, loop optimization, pipelining, and array partitioning, while simultaneously balancing resource consumption and latency, thereby effectively improving computational performance and energy efficiency. Finally, the SiamITO-Tiny network is deployed on the Xilinx All Programmable SoC (ZYNQ) platform ZCU104. Experiments show that our method achieves 45.45 FPS and has a tracking score of 0.9. The computational performance reaches 307.2 GOP/s. The energy efficiency is 30.03 GOP/s/W, which is 39 and 6.81 times higher than it is on the CPU and GPU platforms, respectively. Compared to other acceleration methods, the energy efficiency is improved by 1.4 to 9.3 times, confirming the superiority of the proposed approach in edge real-time tracking.</div></div>\",\"PeriodicalId\":51011,\"journal\":{\"name\":\"Digital Signal Processing\",\"volume\":\"167 \",\"pages\":\"Article 105412\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2025-06-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Digital Signal Processing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1051200425004348\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1051200425004348","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

摘要

红外微小目标跟踪在视频监控和防空系统中具有重要的应用价值。在过去的几年中,深度卷积神经网络(DCNNs)在目标跟踪方面表现出了令人印象深刻的性能。然而,DCNN模型的高复杂性和计算能力要求使其难以部署在功率敏感和资源受限的边缘设备上。为了解决这一问题,我们设计并实现了一种基于fpga的红外微小目标跟踪器,采用软硬件协同优化的方法,在有限资源下满足精度、延迟和功耗的要求。首先,我们提出了一种轻量级且硬件友好的目标跟踪网络SiamITO-Tiny,有效提高了红外微小目标的跟踪精度。其次,设计了全映射硬件加速架构,主要包括基于层融合的卷积加速器、并行流水线加法器树和高效数据缓存方案。该架构通过层融合、环路优化、流水线化、阵列分区等方式提高计算并行度和数据访问带宽,同时平衡资源消耗和时延,有效提升计算性能和能效。最后,SiamITO-Tiny网络部署在赛灵思全可编程SoC (ZYNQ)平台ZCU104上。实验表明,该方法达到了45.45 FPS,跟踪分数为0.9。计算性能达到307.2 GOP/s。其能效为30.03 GOP/s/W,分别是CPU和GPU平台的39倍和6.81倍。与其他加速方法相比,能量效率提高了1.4 ~ 9.3倍,证实了该方法在边缘实时跟踪方面的优越性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Edge real-time tracking and FPGA-based hardware implementation for infrared tiny object
Infrared tiny object tracking holds significant application value in video surveillance and air defense systems. Deep convolutional neural networks (DCNNs) have demonstrated impressive performance in object tracking over the past few years. However, the high complexity and computing power requirements of the DCNN model make it difficult to deploy on power-sensitive and resource-constrained edge devices. To address this issue, we designed and implemented an FPGA-based infrared tiny object tracker by a hardware-software co-optimization approach, meeting the requirements for accuracy, latency, and power consumption under limited resources. First, we propose a lightweight and hardware-friendly object tracking network, SiamITO-Tiny, effectively improving the tracking accuracy for infrared tiny objects. Second, we design a full-mapping hardware acceleration architecture, mainly comprising a layer-fusion-based convolutional accelerator, a parallel pipelined adder tree, and an efficient data caching scheme. This architecture increases computational parallelism and data access bandwidth through layer fusion, loop optimization, pipelining, and array partitioning, while simultaneously balancing resource consumption and latency, thereby effectively improving computational performance and energy efficiency. Finally, the SiamITO-Tiny network is deployed on the Xilinx All Programmable SoC (ZYNQ) platform ZCU104. Experiments show that our method achieves 45.45 FPS and has a tracking score of 0.9. The computational performance reaches 307.2 GOP/s. The energy efficiency is 30.03 GOP/s/W, which is 39 and 6.81 times higher than it is on the CPU and GPU platforms, respectively. Compared to other acceleration methods, the energy efficiency is improved by 1.4 to 9.3 times, confirming the superiority of the proposed approach in edge real-time tracking.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Digital Signal Processing
Digital Signal Processing 工程技术-工程:电子与电气
CiteScore
5.30
自引率
17.20%
发文量
435
审稿时长
66 days
期刊介绍: Digital Signal Processing: A Review Journal is one of the oldest and most established journals in the field of signal processing yet it aims to be the most innovative. The Journal invites top quality research articles at the frontiers of research in all aspects of signal processing. Our objective is to provide a platform for the publication of ground-breaking research in signal processing with both academic and industrial appeal. The journal has a special emphasis on statistical signal processing methodology such as Bayesian signal processing, and encourages articles on emerging applications of signal processing such as: • big data• machine learning• internet of things• information security• systems biology and computational biology,• financial time series analysis,• autonomous vehicles,• quantum computing,• neuromorphic engineering,• human-computer interaction and intelligent user interfaces,• environmental signal processing,• geophysical signal processing including seismic signal processing,• chemioinformatics and bioinformatics,• audio, visual and performance arts,• disaster management and prevention,• renewable energy,
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信