A study of the use of SIMD instructions for two image processing algorithms

2012 Western New York Image Processing Workshop Pub Date : 2012-11-01 DOI:10.1109/WNYIPW.2012.6466650

E. Welch, D. Patru, E. Saber, K. Bengtson

引用次数: 12

Abstract

Most image processing algorithms are parallelizable, i.e. the calculation of one pixel does not affect another one. SIMD architectures, including Intel's WMMX and SSE and ARM's NEON, can exploit this fact by processing multiple pixels at a time, which can result in significant speedups. This study investigates the use of NEON SIMD instructions for two image processing algorithms. The latter are altered to process four pixels at a time, for which a theoretical speedup factor of four can be achieved. In addition, parts of the original implementation have been replaced with inline functions or modified at assembly code level. Experimental benchmark data shows the actual execution speed to be between two to three times higher than the original reference. These results prove that SIMD instructions can significantly speedup image processing algorithms through proper code manipulations.

查看原文本刊更多论文

研究了使用SIMD指令的两种图像处理算法

大多数图像处理算法都是可并行的，即一个像素的计算不会影响另一个像素。SIMD架构，包括英特尔的WMMX和SSE以及ARM的NEON，可以通过一次处理多个像素来利用这一事实，这可以带来显着的速度提升。本研究探讨了NEON SIMD指令在两种图像处理算法中的使用。后者被改变为一次处理四个像素，因此理论上可以实现四倍的加速因子。此外，原始实现的部分已被内联函数取代或在汇编代码级别进行了修改。实验基准数据表明，实际执行速度比原始参考高出两到三倍。这些结果证明，通过适当的代码操作，SIMD指令可以显著提高图像处理算法的速度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2012 Western New York Image Processing Workshop

自引率

0.00%

发文量