Predicting Polarization Beyond Semantics for Wearable Robotics

2018 IEEE-RAS 18th International Conference on Humanoid Robots (Humanoids) Pub Date : 2018-11-01 DOI:10.1109/HUMANOIDS.2018.8625005

Kailun Yang, L. Bergasa, Eduardo Romera, Xiao Huang, Kaiwei Wang

{"title":"Predicting Polarization Beyond Semantics for Wearable Robotics","authors":"Kailun Yang, L. Bergasa, Eduardo Romera, Xiao Huang, Kaiwei Wang","doi":"10.1109/HUMANOIDS.2018.8625005","DOIUrl":null,"url":null,"abstract":"Semantic perception is a key enabler in robotics, which supposes a very resourceful and efficient manner of applying vision information for upper-level navigation and manipulation tasks. Given the challenges on specular semantics such as water hazards, transparent glasses and metallic surfaces, polarization imaging has been explored to complement the RGB-based pixel-wise semantic segmentation because it reflects surface characteristics and provides additional attributes. However, polarimetric measurements generally entail prohibitively expensive cameras and highly accurate calibrations. Inspired by the representation power of Convolutional Neural Networks (CNNs), we propose to predict polarization information from monocular RGB images, precisely per-pixel polarization difference. The core of our approach is a cluster of efficient deep architectures building on factorized convolutions, hierarchical dilations and pyramid representations, aimed to produce both semantic and polarimetric estimations in real time. Comprehensive experiments demonstrate the qualified accuracy on a wearable exoskeleton humanoid robot.","PeriodicalId":433345,"journal":{"name":"2018 IEEE-RAS 18th International Conference on Humanoid Robots (Humanoids)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE-RAS 18th International Conference on Humanoid Robots (Humanoids)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HUMANOIDS.2018.8625005","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 14

Abstract

Semantic perception is a key enabler in robotics, which supposes a very resourceful and efficient manner of applying vision information for upper-level navigation and manipulation tasks. Given the challenges on specular semantics such as water hazards, transparent glasses and metallic surfaces, polarization imaging has been explored to complement the RGB-based pixel-wise semantic segmentation because it reflects surface characteristics and provides additional attributes. However, polarimetric measurements generally entail prohibitively expensive cameras and highly accurate calibrations. Inspired by the representation power of Convolutional Neural Networks (CNNs), we propose to predict polarization information from monocular RGB images, precisely per-pixel polarization difference. The core of our approach is a cluster of efficient deep architectures building on factorized convolutions, hierarchical dilations and pyramid representations, aimed to produce both semantic and polarimetric estimations in real time. Comprehensive experiments demonstrate the qualified accuracy on a wearable exoskeleton humanoid robot.

查看原文本刊更多论文

可穿戴机器人超越语义的极化预测

语义感知是机器人技术的关键，它要求以一种非常灵活和有效的方式将视觉信息应用于高级导航和操作任务。考虑到镜面语义的挑战，如水危害、透明玻璃和金属表面，偏振成像已经被探索用于补充基于rgb的像素语义分割，因为它反映了表面特征并提供了额外的属性。然而，偏振测量通常需要昂贵的相机和高度精确的校准。受卷积神经网络(cnn)表征能力的启发，我们提出了从单眼RGB图像中精确预测偏振信息的方法，即逐像素偏振差。我们方法的核心是建立在因式卷积、分层扩张和金字塔表示基础上的高效深度架构集群，旨在实时生成语义和极化估计。综合实验证明了该方法在可穿戴外骨骼类人机器人上具有良好的精度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 IEEE-RAS 18th International Conference on Humanoid Robots (Humanoids)

自引率

0.00%

发文量