FPGA-based Medical Image Processing Using Hardware-Software Co-design Approach.

IF 4.9
Abhishek Yadav, Vyom Kumar Gupta, Binod Kumar
{"title":"FPGA-based Medical Image Processing Using Hardware-Software Co-design Approach.","authors":"Abhishek Yadav, Vyom Kumar Gupta, Binod Kumar","doi":"10.1109/TBCAS.2025.3594840","DOIUrl":null,"url":null,"abstract":"<p><p>This paper presents a field-programmable gate array (FPGA) based medical image processing framework using a hardware-software co-design approach for biomedical tasks such as Malaria and Pneumonia detection. The design is implemented on the AMD-Xilinx UltraScale+ MPSoC (ZCU104) FPGA, focusing on optimizing data movement between the Processing System (PS) and Programmable Logic (PL) through a customized high-level synthesis (HLS) process. Depth-wise convolution is employed to reduce computational complexity, while layer fusion is applied to optimize layer-wise execution, and custom cache is integrated to improve memory access efficiency. The accelerated architecture is integrated with AXI interconnects and tested using the PYNQ overlay process. The experimental results demonstrate that the proposed accelerator achieves a throughput of 298.22 FPS and 205.87 FPS for the detection of malaria and pneumonia, respectively. The proposed design significantly improves energy efficiency, consuming 14.62 mJ/img for the detection of malaria and 23.89 mJ/img for the detection of pneumonia. Compared to alternative hardware platforms like Raspberry Pi with Coral TPU, the FPGA-based implementation offers superior performance, achieving 8.3× higher throughput and 4.3× better energy efficiency, making it well-suited for real-time medical image processing applications.</p>","PeriodicalId":94031,"journal":{"name":"IEEE transactions on biomedical circuits and systems","volume":"PP ","pages":""},"PeriodicalIF":4.9000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on biomedical circuits and systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TBCAS.2025.3594840","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

This paper presents a field-programmable gate array (FPGA) based medical image processing framework using a hardware-software co-design approach for biomedical tasks such as Malaria and Pneumonia detection. The design is implemented on the AMD-Xilinx UltraScale+ MPSoC (ZCU104) FPGA, focusing on optimizing data movement between the Processing System (PS) and Programmable Logic (PL) through a customized high-level synthesis (HLS) process. Depth-wise convolution is employed to reduce computational complexity, while layer fusion is applied to optimize layer-wise execution, and custom cache is integrated to improve memory access efficiency. The accelerated architecture is integrated with AXI interconnects and tested using the PYNQ overlay process. The experimental results demonstrate that the proposed accelerator achieves a throughput of 298.22 FPS and 205.87 FPS for the detection of malaria and pneumonia, respectively. The proposed design significantly improves energy efficiency, consuming 14.62 mJ/img for the detection of malaria and 23.89 mJ/img for the detection of pneumonia. Compared to alternative hardware platforms like Raspberry Pi with Coral TPU, the FPGA-based implementation offers superior performance, achieving 8.3× higher throughput and 4.3× better energy efficiency, making it well-suited for real-time medical image processing applications.

基于fpga的医学图像处理软硬件协同设计方法。
本文提出了一种基于现场可编程门阵列(FPGA)的医学图像处理框架,该框架采用软硬件协同设计方法用于疟疾和肺炎检测等生物医学任务。该设计是在AMD-Xilinx UltraScale+ MPSoC (ZCU104) FPGA上实现的,重点是通过定制的高级合成(HLS)工艺优化处理系统(PS)和可编程逻辑(PL)之间的数据移动。采用深度卷积降低计算复杂度,采用层融合优化分层执行,集成自定义缓存提高内存访问效率。加速架构与AXI互连集成,并使用PYNQ覆盖过程进行测试。实验结果表明,该加速器检测疟疾和肺炎的吞吐量分别为298.22 FPS和205.87 FPS。该设计显著提高了能效,疟疾检测能耗为14.62 mJ/img,肺炎检测能耗为23.89 mJ/img。与其他硬件平台(如带有Coral TPU的树莓派)相比,基于fpga的实现提供了卓越的性能,实现了8.3倍的吞吐量和4.3倍的能效,使其非常适合实时医疗图像处理应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信