利用深度学习从距离图像中恢复超二次曲线的初步研究

2019 IEEE International Work Conference on Bioinspired Intelligence (IWOBI) Pub Date : 2019-04-13 DOI:10.1109/IWOBI47054.2019.9114452

T. Oblak, Klemen Grm, A. Jaklič, P. Peer, V. Štruc, F. Solina

{"title":"利用深度学习从距离图像中恢复超二次曲线的初步研究","authors":"T. Oblak, Klemen Grm, A. Jaklič, P. Peer, V. Štruc, F. Solina","doi":"10.1109/IWOBI47054.2019.9114452","DOIUrl":null,"url":null,"abstract":"It has been a longstanding goal in computer vision to describe the 3D physical space in terms of parameterized volumetric models that would allow autonomous machines to understand and interact with their surroundings. Such models are typically motivated by human visual perception and aim to represents all elements of the physical word ranging from individual objects to complex scenes using a small set of parameters. One of the de facto stadards to approach this problem are superquadrics - volumetric models that define various 3D shape primitives and can be fitted to actual 3D data (either in the form of point clouds or range images). However, existing solutions to superquadric recovery involve costly iterative fitting procedures, which limit the applicability of such techniques in practice. To alleviate this problem, we explore in this paper the possibility to recover superquadrics from range images without time consuming iterative parameter estimation techniques by using contemporary deep-learning models, more specifically, convolutional neural networks (CNNs). We pose the superquadric recovery problem as a regression task and develop a CNN regressor that is able to estimate the parameters of a superquadric model from a given range image. We train the regressor on a large set of synthetic range images, each containing a single (unrotated) superquadric shape and evaluate the learned model in comparaitve experiments with the current state-of-the-art. Additionally, we also present a qualitative analysis involving a dataset of real-world objects. The results of our experiments show that the proposed regressor not only outperforms the existing state-of-the-art, but also ensures a $270\\times$ faster execution time.","PeriodicalId":427695,"journal":{"name":"2019 IEEE International Work Conference on Bioinspired Intelligence (IWOBI)","volume":"55 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Recovery of Superquadrics from Range Images using Deep Learning: A Preliminary Study\",\"authors\":\"T. Oblak, Klemen Grm, A. Jaklič, P. Peer, V. Štruc, F. Solina\",\"doi\":\"10.1109/IWOBI47054.2019.9114452\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"It has been a longstanding goal in computer vision to describe the 3D physical space in terms of parameterized volumetric models that would allow autonomous machines to understand and interact with their surroundings. Such models are typically motivated by human visual perception and aim to represents all elements of the physical word ranging from individual objects to complex scenes using a small set of parameters. One of the de facto stadards to approach this problem are superquadrics - volumetric models that define various 3D shape primitives and can be fitted to actual 3D data (either in the form of point clouds or range images). However, existing solutions to superquadric recovery involve costly iterative fitting procedures, which limit the applicability of such techniques in practice. To alleviate this problem, we explore in this paper the possibility to recover superquadrics from range images without time consuming iterative parameter estimation techniques by using contemporary deep-learning models, more specifically, convolutional neural networks (CNNs). We pose the superquadric recovery problem as a regression task and develop a CNN regressor that is able to estimate the parameters of a superquadric model from a given range image. We train the regressor on a large set of synthetic range images, each containing a single (unrotated) superquadric shape and evaluate the learned model in comparaitve experiments with the current state-of-the-art. Additionally, we also present a qualitative analysis involving a dataset of real-world objects. The results of our experiments show that the proposed regressor not only outperforms the existing state-of-the-art, but also ensures a $270\\\\times$ faster execution time.\",\"PeriodicalId\":427695,\"journal\":{\"name\":\"2019 IEEE International Work Conference on Bioinspired Intelligence (IWOBI)\",\"volume\":\"55 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-04-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 IEEE International Work Conference on Bioinspired Intelligence (IWOBI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IWOBI47054.2019.9114452\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE International Work Conference on Bioinspired Intelligence (IWOBI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IWOBI47054.2019.9114452","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

摘要

在计算机视觉领域，用参数化的体积模型描述三维物理空间一直是一个长期的目标，这将使自动机器能够理解周围环境并与之互动。这样的模型通常是由人类视觉感知驱动的，目的是使用一小组参数来表示物理世界的所有元素，从单个对象到复杂场景。解决这个问题的一个事实上的标准是超二次曲面——体积模型，它定义了各种3D形状原语，可以拟合到实际的3D数据中(以点云或距离图像的形式)。然而，现有的超二次采收率解决方案涉及昂贵的迭代拟合程序，这限制了此类技术在实践中的适用性。为了缓解这一问题，我们在本文中探索了使用现代深度学习模型，更具体地说是卷积神经网络(cnn)，从距离图像中恢复超二次曲线的可能性，而无需耗时的迭代参数估计技术。我们将超二次恢复问题作为一个回归任务，并开发了一个能够从给定范围图像估计超二次模型参数的CNN回归器。我们在一组大的合成范围图像上训练回归器，每个图像都包含一个(未旋转的)超二次形，并在与当前最先进的比较实验中评估学习到的模型。此外，我们还提出了一个涉及现实世界对象数据集的定性分析。我们的实验结果表明，所提出的回归器不仅优于现有的最先进的回归器，而且确保了270倍的执行时间。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Recovery of Superquadrics from Range Images using Deep Learning: A Preliminary Study

It has been a longstanding goal in computer vision to describe the 3D physical space in terms of parameterized volumetric models that would allow autonomous machines to understand and interact with their surroundings. Such models are typically motivated by human visual perception and aim to represents all elements of the physical word ranging from individual objects to complex scenes using a small set of parameters. One of the de facto stadards to approach this problem are superquadrics - volumetric models that define various 3D shape primitives and can be fitted to actual 3D data (either in the form of point clouds or range images). However, existing solutions to superquadric recovery involve costly iterative fitting procedures, which limit the applicability of such techniques in practice. To alleviate this problem, we explore in this paper the possibility to recover superquadrics from range images without time consuming iterative parameter estimation techniques by using contemporary deep-learning models, more specifically, convolutional neural networks (CNNs). We pose the superquadric recovery problem as a regression task and develop a CNN regressor that is able to estimate the parameters of a superquadric model from a given range image. We train the regressor on a large set of synthetic range images, each containing a single (unrotated) superquadric shape and evaluate the learned model in comparaitve experiments with the current state-of-the-art. Additionally, we also present a qualitative analysis involving a dataset of real-world objects. The results of our experiments show that the proposed regressor not only outperforms the existing state-of-the-art, but also ensures a $270\times$ faster execution time.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2019 IEEE International Work Conference on Bioinspired Intelligence (IWOBI)

自引率

0.00%

发文量