{"title":"基于初始结构的胶囊网络图像分类","authors":"Chen Zhao","doi":"10.1145/3573834.3574507","DOIUrl":null,"url":null,"abstract":"Convolutional Neural Networks (CNNs) are popular in deep learning, especially in computer vision. But CNNs are poor at extracting spatial information, such as position and direction of entities in an image. A recently proposed model called Capsule network could effectively learn the spatial relations among entities, but it has weak feature extraction ability. The inception structure could extract multi-scale features. So this paper modifies the inception-v1 structure and adds it into Capsule network to strengthen the feature extraction ability. In the modified inception-v1, the max pooling branch is removed to reduce the loss of feature information. And Batch Normalization (BN) layer is added after each convolution to accelerate convergence and reduce over fitting. To organize categories information, a fully connected layer is added after digit Capsules. This paper conducted experiment on two public datasets, and the results show that the proposed model outperforms the original model in accuracy.","PeriodicalId":345434,"journal":{"name":"Proceedings of the 4th International Conference on Advanced Information Science and System","volume":"363 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Capsule Network with Inception Structure for Image Classification\",\"authors\":\"Chen Zhao\",\"doi\":\"10.1145/3573834.3574507\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Convolutional Neural Networks (CNNs) are popular in deep learning, especially in computer vision. But CNNs are poor at extracting spatial information, such as position and direction of entities in an image. A recently proposed model called Capsule network could effectively learn the spatial relations among entities, but it has weak feature extraction ability. The inception structure could extract multi-scale features. So this paper modifies the inception-v1 structure and adds it into Capsule network to strengthen the feature extraction ability. In the modified inception-v1, the max pooling branch is removed to reduce the loss of feature information. And Batch Normalization (BN) layer is added after each convolution to accelerate convergence and reduce over fitting. To organize categories information, a fully connected layer is added after digit Capsules. This paper conducted experiment on two public datasets, and the results show that the proposed model outperforms the original model in accuracy.\",\"PeriodicalId\":345434,\"journal\":{\"name\":\"Proceedings of the 4th International Conference on Advanced Information Science and System\",\"volume\":\"363 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 4th International Conference on Advanced Information Science and System\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3573834.3574507\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 4th International Conference on Advanced Information Science and System","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3573834.3574507","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Capsule Network with Inception Structure for Image Classification
Convolutional Neural Networks (CNNs) are popular in deep learning, especially in computer vision. But CNNs are poor at extracting spatial information, such as position and direction of entities in an image. A recently proposed model called Capsule network could effectively learn the spatial relations among entities, but it has weak feature extraction ability. The inception structure could extract multi-scale features. So this paper modifies the inception-v1 structure and adds it into Capsule network to strengthen the feature extraction ability. In the modified inception-v1, the max pooling branch is removed to reduce the loss of feature information. And Batch Normalization (BN) layer is added after each convolution to accelerate convergence and reduce over fitting. To organize categories information, a fully connected layer is added after digit Capsules. This paper conducted experiment on two public datasets, and the results show that the proposed model outperforms the original model in accuracy.