胶囊网络中期望最大化算法的一种有效方法

M. Hasani, Amin Nasim Saravi, Hassan Khotanlou
{"title":"胶囊网络中期望最大化算法的一种有效方法","authors":"M. Hasani, Amin Nasim Saravi, Hassan Khotanlou","doi":"10.1109/MVIP49855.2020.9116870","DOIUrl":null,"url":null,"abstract":"Capsule Networks (CapsNets) are brand-new architectures that have shown ground-breaking results in certain areas of Computer Vision (CV). In 2017, Hinton and his team introduced CapsNets with routing-by-agreement in \"Sabour et al\" and in a more recent paper \"Matrix Capsules with EM Routing\" they proposed a more complete architecture with Expectation-Maximization (EM) algorithm. Unlike the traditional convolutional neural networks (CNNs), this architecture is able to preserve the pose of the objects in the picture. Due to this characteristic, it has been able to beat the previous state-of-the-art results on the smallNORB dataset, which includes images with various view points. Also, this new architecture is more robust to white box adversarial attacks. However, CapsNets have two major drawbacks. They can’t perform as well as CNNs on complex datasets and, they need a huge amount of time for training. We try to mitigate these shortcomings by finding optimum settings of EM routing iterations for training CapsNets. Unlike the past studies, we use un-equal numbers of EM routing iterations for different stages of the CapsNet. We manage to achieve higher accuracies than the original CapsNet while training the network up to three times faster. For our research, we use three datasets: Yale face dataset, Belgium Traffic Sign dataset, and Fashion-MNIST dataset.","PeriodicalId":255375,"journal":{"name":"2020 International Conference on Machine Vision and Image Processing (MVIP)","volume":"352 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An Efficient Approach for Using Expectation Maximization Algorithm in Capsule Networks\",\"authors\":\"M. Hasani, Amin Nasim Saravi, Hassan Khotanlou\",\"doi\":\"10.1109/MVIP49855.2020.9116870\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Capsule Networks (CapsNets) are brand-new architectures that have shown ground-breaking results in certain areas of Computer Vision (CV). In 2017, Hinton and his team introduced CapsNets with routing-by-agreement in \\\"Sabour et al\\\" and in a more recent paper \\\"Matrix Capsules with EM Routing\\\" they proposed a more complete architecture with Expectation-Maximization (EM) algorithm. Unlike the traditional convolutional neural networks (CNNs), this architecture is able to preserve the pose of the objects in the picture. Due to this characteristic, it has been able to beat the previous state-of-the-art results on the smallNORB dataset, which includes images with various view points. Also, this new architecture is more robust to white box adversarial attacks. However, CapsNets have two major drawbacks. They can’t perform as well as CNNs on complex datasets and, they need a huge amount of time for training. We try to mitigate these shortcomings by finding optimum settings of EM routing iterations for training CapsNets. Unlike the past studies, we use un-equal numbers of EM routing iterations for different stages of the CapsNet. We manage to achieve higher accuracies than the original CapsNet while training the network up to three times faster. For our research, we use three datasets: Yale face dataset, Belgium Traffic Sign dataset, and Fashion-MNIST dataset.\",\"PeriodicalId\":255375,\"journal\":{\"name\":\"2020 International Conference on Machine Vision and Image Processing (MVIP)\",\"volume\":\"352 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-12-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Conference on Machine Vision and Image Processing (MVIP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MVIP49855.2020.9116870\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Machine Vision and Image Processing (MVIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MVIP49855.2020.9116870","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

胶囊网络(CapsNets)是一种全新的架构,在计算机视觉(CV)的某些领域取得了突破性的成果。2017年,Hinton和他的团队在“Sabour等人”中介绍了基于协议路由的capnet,在最近的一篇论文“EM路由的矩阵胶囊”中,他们提出了一个更完整的期望最大化(EM)算法架构。与传统的卷积神经网络(cnn)不同,这种结构能够保持图像中物体的姿态。由于这个特点,它已经能够击败以前在smallNORB数据集上的最先进的结果,该数据集包括具有不同视点的图像。此外,这种新架构对白盒对抗性攻击更加健壮。然而,capnet有两个主要缺点。它们在复杂数据集上的表现不如cnn,而且它们需要大量的训练时间。我们试图通过寻找训练capnet的EM路由迭代的最佳设置来缓解这些缺点。与过去的研究不同,我们对CapsNet的不同阶段使用了不等数量的EM路由迭代。我们设法达到比原始CapsNet更高的精度,同时训练网络的速度提高了三倍。在我们的研究中,我们使用了三个数据集:耶鲁大学人脸数据集、比利时交通标志数据集和Fashion-MNIST数据集。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
An Efficient Approach for Using Expectation Maximization Algorithm in Capsule Networks
Capsule Networks (CapsNets) are brand-new architectures that have shown ground-breaking results in certain areas of Computer Vision (CV). In 2017, Hinton and his team introduced CapsNets with routing-by-agreement in "Sabour et al" and in a more recent paper "Matrix Capsules with EM Routing" they proposed a more complete architecture with Expectation-Maximization (EM) algorithm. Unlike the traditional convolutional neural networks (CNNs), this architecture is able to preserve the pose of the objects in the picture. Due to this characteristic, it has been able to beat the previous state-of-the-art results on the smallNORB dataset, which includes images with various view points. Also, this new architecture is more robust to white box adversarial attacks. However, CapsNets have two major drawbacks. They can’t perform as well as CNNs on complex datasets and, they need a huge amount of time for training. We try to mitigate these shortcomings by finding optimum settings of EM routing iterations for training CapsNets. Unlike the past studies, we use un-equal numbers of EM routing iterations for different stages of the CapsNet. We manage to achieve higher accuracies than the original CapsNet while training the network up to three times faster. For our research, we use three datasets: Yale face dataset, Belgium Traffic Sign dataset, and Fashion-MNIST dataset.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信