Adaptation of FPGA architecture for accelerated image preprocessing

Q3 Computer Science

Radioelectronic and Computer Systems Pub Date : 2023-05-25 DOI:10.32620/reks.2023.2.08

Olesia Barkovska, I. Filippenko, Ivan Semenenko, Valentyn Korniienko, P. Sedlaček

{"title":"Adaptation of FPGA architecture for accelerated image preprocessing","authors":"Olesia Barkovska, I. Filippenko, Ivan Semenenko, Valentyn Korniienko, P. Sedlaček","doi":"10.32620/reks.2023.2.08","DOIUrl":null,"url":null,"abstract":"The work is devoted to the topical problem at the intersection of communications theory, digital electronics and numerical analysis, namely the study of image processing methods implementation time on different architectures of computational devices, which are used for software and hardware acceleration. The subject of this article is the investigation of reconfigurable FPGA processing systems in the image processing area. The goal of this work is to create a reconfigurable FPGA-based image processing system and compare it with existing processing architectures. Task. To fulfill the requirements of this work, it is necessary to prepare a practical experiment as well as theoretical research of the proposed architecture; to investigate the process of creating a ZYNQ SoC-based image processing system; and to develop and benchmark the speed of execution for the given set of algorithms with the specific range of the picture resolution. Methods used: FPGA simulation, C++ parallel programming with OpenMP, NVIDIA CUDA, performance analysis tools. The result of this work is the development of a resilient SoC Zynq7000–based computing system with programmable logic and the possibility to load images to FPGA RAM using the resources of ARM core for further processing and output via HDMI video interface, which enables the change of PL configuration at any time during the processing process. Conclusions. The efficiency of the FPGA approach was compared with a parallel image processing method implementation with OpenMP and CUDA. An overview of the ZYNQ platform with specific details related to media processing is presented. The analysis of algorithm speed testing findings based on various outputs proved the advantage (of over 60 times) of hardware acceleration of image processing over software analogs. The obtained results may be used in the development of embedded SoC-based solutions that require acceleration of big data processing. Also, the achieved findings can be used during the process of finding a suitable embedded platform for a certain image-processing task, where high data throughput is one of the most desired requirements.","PeriodicalId":36122,"journal":{"name":"Radioelectronic and Computer Systems","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Radioelectronic and Computer Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.32620/reks.2023.2.08","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Computer Science","Score":null,"Total":0}

引用次数: 0

Abstract

The work is devoted to the topical problem at the intersection of communications theory, digital electronics and numerical analysis, namely the study of image processing methods implementation time on different architectures of computational devices, which are used for software and hardware acceleration. The subject of this article is the investigation of reconfigurable FPGA processing systems in the image processing area. The goal of this work is to create a reconfigurable FPGA-based image processing system and compare it with existing processing architectures. Task. To fulfill the requirements of this work, it is necessary to prepare a practical experiment as well as theoretical research of the proposed architecture; to investigate the process of creating a ZYNQ SoC-based image processing system; and to develop and benchmark the speed of execution for the given set of algorithms with the specific range of the picture resolution. Methods used: FPGA simulation, C++ parallel programming with OpenMP, NVIDIA CUDA, performance analysis tools. The result of this work is the development of a resilient SoC Zynq7000–based computing system with programmable logic and the possibility to load images to FPGA RAM using the resources of ARM core for further processing and output via HDMI video interface, which enables the change of PL configuration at any time during the processing process. Conclusions. The efficiency of the FPGA approach was compared with a parallel image processing method implementation with OpenMP and CUDA. An overview of the ZYNQ platform with specific details related to media processing is presented. The analysis of algorithm speed testing findings based on various outputs proved the advantage (of over 60 times) of hardware acceleration of image processing over software analogs. The obtained results may be used in the development of embedded SoC-based solutions that require acceleration of big data processing. Also, the achieved findings can be used during the process of finding a suitable embedded platform for a certain image-processing task, where high data throughput is one of the most desired requirements.

查看原文本刊更多论文

FPGA结构在加速图像预处理中的应用

这项工作致力于通信理论、数字电子学和数值分析交叉点的主题问题，即研究用于软件和硬件加速的不同计算设备架构上的图像处理方法实现时间。本文的主题是研究图像处理领域中的可重构FPGA处理系统。本工作的目标是创建一个基于FPGA的可重构图像处理系统，并将其与现有的处理架构进行比较。任务为了满足这项工作的要求，有必要对所提出的架构进行实践实验和理论研究；研究了创建基于ZYNQ SoC的图像处理系统的过程；以及开发给定算法集的执行速度并将其与图片分辨率的特定范围进行比较。使用的方法：FPGA仿真，C++与OpenMP并行编程，NVIDIA CUDA，性能分析工具。这项工作的结果是开发了一个具有可编程逻辑的基于SoC Zynq7000的弹性计算系统，并有可能使用ARM内核的资源将图像加载到FPGA RAM中，以便通过HDMI视频接口进行进一步处理和输出，从而在处理过程中随时更改PL配置。结论。将FPGA方法的效率与使用OpenMP和CUDA实现的并行图像处理方法进行了比较。介绍了ZYNQ平台的概况以及与媒体处理相关的具体细节。基于各种输出的算法速度测试结果的分析证明了硬件加速图像处理相对于软件模拟的优势（超过60倍）。所获得的结果可用于开发需要加速大数据处理的基于嵌入式SoC的解决方案。此外，所获得的发现可以在为某个图像处理任务寻找合适的嵌入式平台的过程中使用，其中高数据吞吐量是最期望的要求之一。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊