胶囊网络中期望最大化算法的一种有效方法

2020 International Conference on Machine Vision and Image Processing (MVIP) Pub Date : 2019-12-11 DOI:10.1109/MVIP49855.2020.9116870

M. Hasani, Amin Nasim Saravi, Hassan Khotanlou

{"title":"胶囊网络中期望最大化算法的一种有效方法","authors":"M. Hasani, Amin Nasim Saravi, Hassan Khotanlou","doi":"10.1109/MVIP49855.2020.9116870","DOIUrl":null,"url":null,"abstract":"Capsule Networks (CapsNets) are brand-new architectures that have shown ground-breaking results in certain areas of Computer Vision (CV). In 2017, Hinton and his team introduced CapsNets with routing-by-agreement in \"Sabour et al\" and in a more recent paper \"Matrix Capsules with EM Routing\" they proposed a more complete architecture with Expectation-Maximization (EM) algorithm. Unlike the traditional convolutional neural networks (CNNs), this architecture is able to preserve the pose of the objects in the picture. Due to this characteristic, it has been able to beat the previous state-of-the-art results on the smallNORB dataset, which includes images with various view points. Also, this new architecture is more robust to white box adversarial attacks. However, CapsNets have two major drawbacks. They can’t perform as well as CNNs on complex datasets and, they need a huge amount of time for training. We try to mitigate these shortcomings by finding optimum settings of EM routing iterations for training CapsNets. Unlike the past studies, we use un-equal numbers of EM routing iterations for different stages of the CapsNet. We manage to achieve higher accuracies than the original CapsNet while training the network up to three times faster. For our research, we use three datasets: Yale face dataset, Belgium Traffic Sign dataset, and Fashion-MNIST dataset.","PeriodicalId":255375,"journal":{"name":"2020 International Conference on Machine Vision and Image Processing (MVIP)","volume":"352 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An Efficient Approach for Using Expectation Maximization Algorithm in Capsule Networks\",\"authors\":\"M. Hasani, Amin Nasim Saravi, Hassan Khotanlou\",\"doi\":\"10.1109/MVIP49855.2020.9116870\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Capsule Networks (CapsNets) are brand-new architectures that have shown ground-breaking results in certain areas of Computer Vision (CV). In 2017, Hinton and his team introduced CapsNets with routing-by-agreement in \\\"Sabour et al\\\" and in a more recent paper \\\"Matrix Capsules with EM Routing\\\" they proposed a more complete architecture with Expectation-Maximization (EM) algorithm. Unlike the traditional convolutional neural networks (CNNs), this architecture is able to preserve the pose of the objects in the picture. Due to this characteristic, it has been able to beat the previous state-of-the-art results on the smallNORB dataset, which includes images with various view points. Also, this new architecture is more robust to white box adversarial attacks. However, CapsNets have two major drawbacks. They can’t perform as well as CNNs on complex datasets and, they need a huge amount of time for training. We try to mitigate these shortcomings by finding optimum settings of EM routing iterations for training CapsNets. Unlike the past studies, we use un-equal numbers of EM routing iterations for different stages of the CapsNet. We manage to achieve higher accuracies than the original CapsNet while training the network up to three times faster. For our research, we use three datasets: Yale face dataset, Belgium Traffic Sign dataset, and Fashion-MNIST dataset.\",\"PeriodicalId\":255375,\"journal\":{\"name\":\"2020 International Conference on Machine Vision and Image Processing (MVIP)\",\"volume\":\"352 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-12-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 International Conference on Machine Vision and Image Processing (MVIP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MVIP49855.2020.9116870\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 International Conference on Machine Vision and Image Processing (MVIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MVIP49855.2020.9116870","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

胶囊网络(CapsNets)是一种全新的架构，在计算机视觉(CV)的某些领域取得了突破性的成果。2017年，Hinton和他的团队在“Sabour等人”中介绍了基于协议路由的capnet，在最近的一篇论文“EM路由的矩阵胶囊”中，他们提出了一个更完整的期望最大化(EM)算法架构。与传统的卷积神经网络(cnn)不同，这种结构能够保持图像中物体的姿态。由于这个特点，它已经能够击败以前在smallNORB数据集上的最先进的结果，该数据集包括具有不同视点的图像。此外，这种新架构对白盒对抗性攻击更加健壮。然而，capnet有两个主要缺点。它们在复杂数据集上的表现不如cnn，而且它们需要大量的训练时间。我们试图通过寻找训练capnet的EM路由迭代的最佳设置来缓解这些缺点。与过去的研究不同，我们对CapsNet的不同阶段使用了不等数量的EM路由迭代。我们设法达到比原始CapsNet更高的精度，同时训练网络的速度提高了三倍。在我们的研究中，我们使用了三个数据集:耶鲁大学人脸数据集、比利时交通标志数据集和Fashion-MNIST数据集。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

An Efficient Approach for Using Expectation Maximization Algorithm in Capsule Networks

Capsule Networks (CapsNets) are brand-new architectures that have shown ground-breaking results in certain areas of Computer Vision (CV). In 2017, Hinton and his team introduced CapsNets with routing-by-agreement in "Sabour et al" and in a more recent paper "Matrix Capsules with EM Routing" they proposed a more complete architecture with Expectation-Maximization (EM) algorithm. Unlike the traditional convolutional neural networks (CNNs), this architecture is able to preserve the pose of the objects in the picture. Due to this characteristic, it has been able to beat the previous state-of-the-art results on the smallNORB dataset, which includes images with various view points. Also, this new architecture is more robust to white box adversarial attacks. However, CapsNets have two major drawbacks. They can’t perform as well as CNNs on complex datasets and, they need a huge amount of time for training. We try to mitigate these shortcomings by finding optimum settings of EM routing iterations for training CapsNets. Unlike the past studies, we use un-equal numbers of EM routing iterations for different stages of the CapsNet. We manage to achieve higher accuracies than the original CapsNet while training the network up to three times faster. For our research, we use three datasets: Yale face dataset, Belgium Traffic Sign dataset, and Fashion-MNIST dataset.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2020 International Conference on Machine Vision and Image Processing (MVIP)

自引率

0.00%

发文量