Accurate, Low-latency, Efficient SAR Automatic Target Recognition on FPGA

2022 32nd International Conference on Field-Programmable Logic and Applications (FPL) Pub Date : 2022-08-01 DOI:10.1109/FPL57034.2022.00013

Bingyi Zhang, R. Kannan, V. Prasanna, Carl E. Busart

{"title":"Accurate, Low-latency, Efficient SAR Automatic Target Recognition on FPGA","authors":"Bingyi Zhang, R. Kannan, V. Prasanna, Carl E. Busart","doi":"10.1109/FPL57034.2022.00013","DOIUrl":null,"url":null,"abstract":"Synthetic aperture radar (SAR) automatic target recognition (ATR) is the key technique for remote-sensing image recognition. The state-of-the-art convolutional neural networks (CNNs) for SAR ATR suffer from high computation cost and large memory footprint, making them unsuitable to be deployed on resource-limited platforms, such as small/micro satellites. In this paper, we propose a comprehensive GNN-based model-architecture co-design on FPGA to address the above issues. Model design: we design a novel graph neural network (GNN) for SAR ATR. The proposed GNN model incorporates GraphSAGE layer operators and attention mechanism, achieving comparable accuracy as the state-of-the-art work with near 1/100 computation cost. Then, we propose a pruning approach including weight pruning and input pruning. While weight pruning through lasso regression reduces most parameters without accuracy drop, input pruning eliminates most input pixels with negligible accuracy drop. Architecture design: to fully unleash the computation parallelism within the proposed model, we develop a novel unified hardware architecture that can execute various computation kernels (feature aggregation, feature transformation, graph pooling). The proposed hardware design adopts the Scatter-Gather paradigm to efficiently handle the irregular computation patterns of various computation kernels. We deploy the proposed design on an embedded FPGA (AMD Xilinx ZCU104) and evaluate the performance using MSTAR dataset. Compared with the state-of-the-art CNNs, the proposed GNN achieves comparable accuracy with 1/3258 computation cost and 1/83 model size. Compared with the state-of-the-art CPU/GPU, our FPGA accelerator achieves 14.8×/2.5× speedup (latency) and is 62×/39× more energy efficient.","PeriodicalId":380116,"journal":{"name":"2022 32nd International Conference on Field-Programmable Logic and Applications (FPL)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 32nd International Conference on Field-Programmable Logic and Applications (FPL)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/FPL57034.2022.00013","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

Synthetic aperture radar (SAR) automatic target recognition (ATR) is the key technique for remote-sensing image recognition. The state-of-the-art convolutional neural networks (CNNs) for SAR ATR suffer from high computation cost and large memory footprint, making them unsuitable to be deployed on resource-limited platforms, such as small/micro satellites. In this paper, we propose a comprehensive GNN-based model-architecture co-design on FPGA to address the above issues. Model design: we design a novel graph neural network (GNN) for SAR ATR. The proposed GNN model incorporates GraphSAGE layer operators and attention mechanism, achieving comparable accuracy as the state-of-the-art work with near 1/100 computation cost. Then, we propose a pruning approach including weight pruning and input pruning. While weight pruning through lasso regression reduces most parameters without accuracy drop, input pruning eliminates most input pixels with negligible accuracy drop. Architecture design: to fully unleash the computation parallelism within the proposed model, we develop a novel unified hardware architecture that can execute various computation kernels (feature aggregation, feature transformation, graph pooling). The proposed hardware design adopts the Scatter-Gather paradigm to efficiently handle the irregular computation patterns of various computation kernels. We deploy the proposed design on an embedded FPGA (AMD Xilinx ZCU104) and evaluate the performance using MSTAR dataset. Compared with the state-of-the-art CNNs, the proposed GNN achieves comparable accuracy with 1/3258 computation cost and 1/83 model size. Compared with the state-of-the-art CPU/GPU, our FPGA accelerator achieves 14.8×/2.5× speedup (latency) and is 62×/39× more energy efficient.

查看原文本刊更多论文

基于FPGA的精确、低延迟、高效的SAR目标自动识别

合成孔径雷达(SAR)自动目标识别是遥感图像识别的关键技术。目前用于SAR ATR的卷积神经网络(cnn)存在计算成本高、内存占用大的问题，不适合部署在小/微卫星等资源有限的平台上。在本文中，我们在FPGA上提出了一种全面的基于gnn的模型架构协同设计来解决上述问题。模型设计:设计了一种新的SAR ATR图神经网络(GNN)。提出的GNN模型结合GraphSAGE层算子和注意力机制，以接近1/100的计算成本实现了与最先进的工作相当的精度。然后，我们提出了一种包括权值剪枝和输入剪枝的剪枝方法。套索回归的权值剪枝减少了大部分参数而精度没有下降，而输入剪枝消除了大部分输入像素，精度下降可以忽略不计。架构设计:为了充分释放模型内的计算并行性，我们开发了一种新的统一硬件架构，可以执行各种计算内核(特征聚合、特征转换、图池化)。所提出的硬件设计采用了Scatter-Gather范式来有效地处理各种计算内核的不规则计算模式。我们将提出的设计部署在嵌入式FPGA (AMD Xilinx ZCU104)上，并使用MSTAR数据集评估性能。与最先进的cnn相比，本文提出的GNN以1/3258的计算成本和1/83的模型尺寸达到了相当的精度。与最先进的CPU/GPU相比，我们的FPGA加速器实现了14.8×/2.5×的加速(延迟)，能效提高了62×/39×。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 32nd International Conference on Field-Programmable Logic and Applications (FPL)

自引率

0.00%

发文量