{"title":"Rotation & Viewpoint Angle Prediction in Capsule Network","authors":"Husein Sulianto","doi":"10.1145/3373419.3373463","DOIUrl":null,"url":null,"abstract":"Convolutional neural network (CNN) is effective in detecting features and classifying object but less effective in exploring spatial relationships among the features. Capsule network introduces stacking layers called capsules and routing algorithm. Such a capsule structure is proved to handle spatial relationships better than CNN architecture. This paper aims at exploring capsule network ability to adapt with the rotation and viewpoint change in image recognition for MNIST, SmallNORB and CAS-PEAL datasets compared to CNN architecture (VGG-based network). The experimental results show that capsule network performs better than CNN for rotation estimation, whereas CNN architecture performs slightly better than capsule network for viewpoint change. The experiments also show that capsule network may have capability to generalize better in some untrained data for rotation and viewpoint change. Capsule network is quite promising architecture in classification and spatial context.","PeriodicalId":352528,"journal":{"name":"Proceedings of the 2019 3rd International Conference on Advances in Image Processing","volume":"83 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2019 3rd International Conference on Advances in Image Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3373419.3373463","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Convolutional neural network (CNN) is effective in detecting features and classifying object but less effective in exploring spatial relationships among the features. Capsule network introduces stacking layers called capsules and routing algorithm. Such a capsule structure is proved to handle spatial relationships better than CNN architecture. This paper aims at exploring capsule network ability to adapt with the rotation and viewpoint change in image recognition for MNIST, SmallNORB and CAS-PEAL datasets compared to CNN architecture (VGG-based network). The experimental results show that capsule network performs better than CNN for rotation estimation, whereas CNN architecture performs slightly better than capsule network for viewpoint change. The experiments also show that capsule network may have capability to generalize better in some untrained data for rotation and viewpoint change. Capsule network is quite promising architecture in classification and spatial context.