Real-time smart stereo camera based on FPGA-SoC

2017 IEEE-RAS 17th International Conference on Humanoid Robotics (Humanoids) Pub Date : 2017-11-01 DOI:10.1109/HUMANOIDS.2017.8246891

Soenke Michalik, S. Michalik, J. Naghmouchi, Mladen Berekovic

{"title":"Real-time smart stereo camera based on FPGA-SoC","authors":"Soenke Michalik, S. Michalik, J. Naghmouchi, Mladen Berekovic","doi":"10.1109/HUMANOIDS.2017.8246891","DOIUrl":null,"url":null,"abstract":"Stereo image processing is one of the most demanding tasks in the field of 3D computer vision and robot vision requiring high-performance computing capabilities within embedded systems. Real-time constraints for autonomous vehicles such as humanoid robots, lead to hardware acceleration approaches for high resolution stereo imaging in human-like vision systems, where commonly FPGA device are employed to handle very high sensor data rates. This work presents a realtime smart stereo camera system implementation resembling the full stereo processing pipeline in a single FPGA device. We introduce the novel memory optimized stereo processing algorithm ”Sparse Retina Census Correlation” (SRCC) that embodies a combination of two well established window based stereo matching approaches. We have leveraged a Sum of Absolute Difference (SAD) of Sobel-filtered images and a Sum of Hamming Distance (SHD) using a modified Retina based Census Transform for increased robustness to lighting variations and for high accuracy. A color rectification module has been implemented to cope with the high frame rate of the stereo pipelining calculating image transformations and rectified pixel coordinates in real-time using parameters for camera intrinsic, image rotation, image distortion and image projection. In addition multiple post-processing algorithms like texture filtering, uniqueness filtering, speckle removal and disparity to depth conversion have been implemented to further enhance the output results. The presented smart camera solution has demonstrated real-time stereo processing of 1280×720 pixel depth images with 256 disparities on a Zynq XC7Z030 FPGA device at 60fps. Due to the universal USB3.0 UVC interface and the onboard depth calculation it is a replacement for RGBD 3D-Sensors with improved image quality and outdoor performance. The camera can easily be used in conjunction with ROS-enabled robots and in automotive or industrial applications.","PeriodicalId":143992,"journal":{"name":"2017 IEEE-RAS 17th International Conference on Humanoid Robotics (Humanoids)","volume":"60 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE-RAS 17th International Conference on Humanoid Robotics (Humanoids)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HUMANOIDS.2017.8246891","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 11

Abstract

Stereo image processing is one of the most demanding tasks in the field of 3D computer vision and robot vision requiring high-performance computing capabilities within embedded systems. Real-time constraints for autonomous vehicles such as humanoid robots, lead to hardware acceleration approaches for high resolution stereo imaging in human-like vision systems, where commonly FPGA device are employed to handle very high sensor data rates. This work presents a realtime smart stereo camera system implementation resembling the full stereo processing pipeline in a single FPGA device. We introduce the novel memory optimized stereo processing algorithm ”Sparse Retina Census Correlation” (SRCC) that embodies a combination of two well established window based stereo matching approaches. We have leveraged a Sum of Absolute Difference (SAD) of Sobel-filtered images and a Sum of Hamming Distance (SHD) using a modified Retina based Census Transform for increased robustness to lighting variations and for high accuracy. A color rectification module has been implemented to cope with the high frame rate of the stereo pipelining calculating image transformations and rectified pixel coordinates in real-time using parameters for camera intrinsic, image rotation, image distortion and image projection. In addition multiple post-processing algorithms like texture filtering, uniqueness filtering, speckle removal and disparity to depth conversion have been implemented to further enhance the output results. The presented smart camera solution has demonstrated real-time stereo processing of 1280×720 pixel depth images with 256 disparities on a Zynq XC7Z030 FPGA device at 60fps. Due to the universal USB3.0 UVC interface and the onboard depth calculation it is a replacement for RGBD 3D-Sensors with improved image quality and outdoor performance. The camera can easily be used in conjunction with ROS-enabled robots and in automotive or industrial applications.

查看原文本刊更多论文

基于FPGA-SoC的实时智能立体摄像头

立体图像处理是三维计算机视觉和机器人视觉领域中要求最高的任务之一，需要嵌入式系统中的高性能计算能力。类人机器人等自动驾驶车辆的实时性限制导致了类人视觉系统中高分辨率立体成像的硬件加速方法，其中通常采用FPGA器件来处理非常高的传感器数据速率。本工作提出了一个实时智能立体相机系统的实现，类似于单个FPGA器件中的全立体处理管道。我们介绍了一种新的内存优化立体处理算法“稀疏视网膜普查相关”(SRCC)，它结合了两种基于窗口的立体匹配方法。我们利用sobel滤波图像的绝对差之和(SAD)和汉明距离之和(SHD)，使用改进的基于视网膜的人口普查变换来增加对照明变化的鲁棒性和高精度。为了解决立体流水线的高帧率问题，实现了色彩校正模块，利用摄像机内禀参数、图像旋转参数、图像畸变参数和图像投影参数，实时计算图像变换并校正像素坐标。此外，还实现了纹理滤波、唯一性滤波、斑点去除、视差深度转换等多种后处理算法，进一步增强了输出结果。所提出的智能相机解决方案在Zynq XC7Z030 FPGA器件上以60fps的速度演示了1280×720像素深度图像的实时立体处理，具有256个差异。由于通用USB3.0 UVC接口和板载深度计算，它是RGBD 3d传感器的替代品，具有改进的图像质量和户外性能。该相机可以很容易地与ros机器人结合使用，并用于汽车或工业应用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 IEEE-RAS 17th International Conference on Humanoid Robotics (Humanoids)

自引率

0.00%

发文量