Rotation & Viewpoint Angle Prediction in Capsule Network

Proceedings of the 2019 3rd International Conference on Advances in Image Processing Pub Date : 2019-11-08 DOI:10.1145/3373419.3373463

Husein Sulianto

引用次数: 3

Abstract

Convolutional neural network (CNN) is effective in detecting features and classifying object but less effective in exploring spatial relationships among the features. Capsule network introduces stacking layers called capsules and routing algorithm. Such a capsule structure is proved to handle spatial relationships better than CNN architecture. This paper aims at exploring capsule network ability to adapt with the rotation and viewpoint change in image recognition for MNIST, SmallNORB and CAS-PEAL datasets compared to CNN architecture (VGG-based network). The experimental results show that capsule network performs better than CNN for rotation estimation, whereas CNN architecture performs slightly better than capsule network for viewpoint change. The experiments also show that capsule network may have capability to generalize better in some untrained data for rotation and viewpoint change. Capsule network is quite promising architecture in classification and spatial context.

查看原文本刊更多论文

胶囊网络中的旋转与视角预测

卷积神经网络(Convolutional neural network, CNN)在识别特征和分类目标方面是有效的，但在探索特征之间的空间关系方面效果较差。胶囊网络引入了称为胶囊的堆叠层和路由算法。这种胶囊结构被证明比CNN结构更能处理空间关系。对比CNN架构(基于vgg的网络)，探讨胶囊网络在MNIST、SmallNORB和CAS-PEAL数据集图像识别中适应旋转和视点变化的能力。实验结果表明，胶囊网络在旋转估计方面优于CNN，而CNN架构在视点变化方面略优于胶囊网络。实验还表明，胶囊网络对一些未经训练的旋转和视点变化数据有较好的泛化能力。胶囊网络在分类和空间语境上都是一种很有前途的结构。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2019 3rd International Conference on Advances in Image Processing

自引率

0.00%

发文量