A functional approach to rotation equivariant non-linearities for Tensor Field Networks

2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Pub Date : 2021-06-01 DOI:10.1109/CVPR46437.2021.01297

A. Poulenard, L. Guibas

{"title":"A functional approach to rotation equivariant non-linearities for Tensor Field Networks","authors":"A. Poulenard, L. Guibas","doi":"10.1109/CVPR46437.2021.01297","DOIUrl":null,"url":null,"abstract":"Learning pose invariant representation is a fundamental problem in shape analysis. Most existing deep learning algorithms for 3D shape analysis are not robust to rotations and are often trained on synthetic datasets consisting of pre-aligned shapes, yielding poor generalization to unseen poses. This observation motivates a growing interest in rotation invariant and equivariant methods. The field of rotation equivariant deep learning is developing in recent years thanks to a well established theory of Lie group representations and convolutions. A fundamental problem in equivariant deep learning is to design activation functions which are both informative and preserve equivariance. The recently introduced Tensor Field Network (TFN) framework provides a rotation equivariant network design for point cloud analysis. TFN features undergo a rotation in feature space given a rotation of the input pointcloud. TFN and similar designs consider nonlinearities which operate only over rotation invariant features such as the norm of equivariant features to preserve equivariance, making them unable to capture the directional information. In a recent work entitled \"Gauge Equivariant Mesh CNNs: Anisotropic Convolutions on Geometric Graphs\" Hann et al. interpret 2D rotation equivariant features as Fourier coefficients of functions on the circle. In this work we transpose the idea of Hann et al. to 3D by interpreting TFN features as spherical harmonics coefficients of functions on the sphere. We introduce a new equivariant nonlinearity and pooling for TFN. We show improvments over the original TFN design and other equivariant nonlinearities in classification and segmentation tasks. Furthermore our method is competitive with state of the art rotation invariant methods in some instances.","PeriodicalId":339646,"journal":{"name":"2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)","volume":"103 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"27","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CVPR46437.2021.01297","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 27

Abstract

Learning pose invariant representation is a fundamental problem in shape analysis. Most existing deep learning algorithms for 3D shape analysis are not robust to rotations and are often trained on synthetic datasets consisting of pre-aligned shapes, yielding poor generalization to unseen poses. This observation motivates a growing interest in rotation invariant and equivariant methods. The field of rotation equivariant deep learning is developing in recent years thanks to a well established theory of Lie group representations and convolutions. A fundamental problem in equivariant deep learning is to design activation functions which are both informative and preserve equivariance. The recently introduced Tensor Field Network (TFN) framework provides a rotation equivariant network design for point cloud analysis. TFN features undergo a rotation in feature space given a rotation of the input pointcloud. TFN and similar designs consider nonlinearities which operate only over rotation invariant features such as the norm of equivariant features to preserve equivariance, making them unable to capture the directional information. In a recent work entitled "Gauge Equivariant Mesh CNNs: Anisotropic Convolutions on Geometric Graphs" Hann et al. interpret 2D rotation equivariant features as Fourier coefficients of functions on the circle. In this work we transpose the idea of Hann et al. to 3D by interpreting TFN features as spherical harmonics coefficients of functions on the sphere. We introduce a new equivariant nonlinearity and pooling for TFN. We show improvments over the original TFN design and other equivariant nonlinearities in classification and segmentation tasks. Furthermore our method is competitive with state of the art rotation invariant methods in some instances.

查看原文本刊更多论文

张量场网络旋转等变非线性的泛函方法

姿态不变表示的学习是形状分析中的一个基本问题。大多数用于3D形状分析的现有深度学习算法对旋转不具有鲁棒性，并且通常是在由预对齐形状组成的合成数据集上进行训练的，因此对看不见的姿势的泛化效果很差。这一观察结果激发了人们对旋转不变和等变方法的兴趣。近年来，旋转等变深度学习领域的发展得益于李群表示和卷积理论的完善。等变深度学习的一个基本问题是如何设计既能提供信息又能保持等变的激活函数。最近引入的张量场网络(TFN)框架为点云分析提供了一种旋转等变网络设计。给定输入点云的旋转，TFN特征在特征空间中进行旋转。TFN和类似的设计考虑仅在旋转不变特征(如等变特征的范数)上运行的非线性，以保持等变，使它们无法捕获方向信息。在最近的一篇题为“规范等变网格cnn:几何图上的各向异性卷积”的文章中，Hann等人将二维旋转等变特征解释为圆上函数的傅里叶系数。在这项工作中，我们通过将TFN特征解释为球体上函数的球面谐波系数，将Hann等人的想法转到3D中。我们引入了一种新的等变非线性和TFN池化。我们展示了在分类和分割任务中对原始TFN设计和其他等变非线性的改进。此外，在某些情况下，我们的方法与最先进的旋转不变量方法相竞争。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

自引率

0.00%

发文量