{"title":"基于fpga的医学图像处理软硬件协同设计方法。","authors":"Abhishek Yadav, Vyom Kumar Gupta, Binod Kumar","doi":"10.1109/TBCAS.2025.3594840","DOIUrl":null,"url":null,"abstract":"<p><p>This paper presents a field-programmable gate array (FPGA) based medical image processing framework using a hardware-software co-design approach for biomedical tasks such as Malaria and Pneumonia detection. The design is implemented on the AMD-Xilinx UltraScale+ MPSoC (ZCU104) FPGA, focusing on optimizing data movement between the Processing System (PS) and Programmable Logic (PL) through a customized high-level synthesis (HLS) process. Depth-wise convolution is employed to reduce computational complexity, while layer fusion is applied to optimize layer-wise execution, and custom cache is integrated to improve memory access efficiency. The accelerated architecture is integrated with AXI interconnects and tested using the PYNQ overlay process. The experimental results demonstrate that the proposed accelerator achieves a throughput of 298.22 FPS and 205.87 FPS for the detection of malaria and pneumonia, respectively. The proposed design significantly improves energy efficiency, consuming 14.62 mJ/img for the detection of malaria and 23.89 mJ/img for the detection of pneumonia. Compared to alternative hardware platforms like Raspberry Pi with Coral TPU, the FPGA-based implementation offers superior performance, achieving 8.3× higher throughput and 4.3× better energy efficiency, making it well-suited for real-time medical image processing applications.</p>","PeriodicalId":94031,"journal":{"name":"IEEE transactions on biomedical circuits and systems","volume":"PP ","pages":""},"PeriodicalIF":4.9000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"FPGA-based Medical Image Processing Using Hardware-Software Co-design Approach.\",\"authors\":\"Abhishek Yadav, Vyom Kumar Gupta, Binod Kumar\",\"doi\":\"10.1109/TBCAS.2025.3594840\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>This paper presents a field-programmable gate array (FPGA) based medical image processing framework using a hardware-software co-design approach for biomedical tasks such as Malaria and Pneumonia detection. The design is implemented on the AMD-Xilinx UltraScale+ MPSoC (ZCU104) FPGA, focusing on optimizing data movement between the Processing System (PS) and Programmable Logic (PL) through a customized high-level synthesis (HLS) process. Depth-wise convolution is employed to reduce computational complexity, while layer fusion is applied to optimize layer-wise execution, and custom cache is integrated to improve memory access efficiency. The accelerated architecture is integrated with AXI interconnects and tested using the PYNQ overlay process. The experimental results demonstrate that the proposed accelerator achieves a throughput of 298.22 FPS and 205.87 FPS for the detection of malaria and pneumonia, respectively. The proposed design significantly improves energy efficiency, consuming 14.62 mJ/img for the detection of malaria and 23.89 mJ/img for the detection of pneumonia. Compared to alternative hardware platforms like Raspberry Pi with Coral TPU, the FPGA-based implementation offers superior performance, achieving 8.3× higher throughput and 4.3× better energy efficiency, making it well-suited for real-time medical image processing applications.</p>\",\"PeriodicalId\":94031,\"journal\":{\"name\":\"IEEE transactions on biomedical circuits and systems\",\"volume\":\"PP \",\"pages\":\"\"},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2025-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE transactions on biomedical circuits and systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/TBCAS.2025.3594840\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on biomedical circuits and systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TBCAS.2025.3594840","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
FPGA-based Medical Image Processing Using Hardware-Software Co-design Approach.
This paper presents a field-programmable gate array (FPGA) based medical image processing framework using a hardware-software co-design approach for biomedical tasks such as Malaria and Pneumonia detection. The design is implemented on the AMD-Xilinx UltraScale+ MPSoC (ZCU104) FPGA, focusing on optimizing data movement between the Processing System (PS) and Programmable Logic (PL) through a customized high-level synthesis (HLS) process. Depth-wise convolution is employed to reduce computational complexity, while layer fusion is applied to optimize layer-wise execution, and custom cache is integrated to improve memory access efficiency. The accelerated architecture is integrated with AXI interconnects and tested using the PYNQ overlay process. The experimental results demonstrate that the proposed accelerator achieves a throughput of 298.22 FPS and 205.87 FPS for the detection of malaria and pneumonia, respectively. The proposed design significantly improves energy efficiency, consuming 14.62 mJ/img for the detection of malaria and 23.89 mJ/img for the detection of pneumonia. Compared to alternative hardware platforms like Raspberry Pi with Coral TPU, the FPGA-based implementation offers superior performance, achieving 8.3× higher throughput and 4.3× better energy efficiency, making it well-suited for real-time medical image processing applications.