An Automatic Scheme for Optimizing the Size of Deep Networks

Proceedings of the 2020 3rd International Conference on Signal Processing and Machine Learning Pub Date : 2020-10-22 DOI:10.1145/3432291.3432293

Wenting Ma, Zhipeng Zhang, Qingqing Xu, Wai Chen

{"title":"An Automatic Scheme for Optimizing the Size of Deep Networks","authors":"Wenting Ma, Zhipeng Zhang, Qingqing Xu, Wai Chen","doi":"10.1145/3432291.3432293","DOIUrl":null,"url":null,"abstract":"Large-scale datasets and complex architectures promote the development of CNN models. Although with stronger representation power, larger CNNs are more resource-hungry, which makes it difficult to deploy on resource-constrained Internet of Things (IoT) devices. Another serious challenge occurring with larger CNNs is their susceptibility to overfit with a small training dataset. In this paper, we ask the question: can we find an optimized compact model for a particular data set? We propose a novel scheme to optimize the model size so as to obtain a compact model instead of a larger model. The optimized model achieves higher accuracy than the widely used deeper model. In addition, it decreases the run-time memory and reduces the number of computing operations. This is achieved by applying Minimum Description Length (MDL) to find the optimal size of the model for a particular data set mathematically. MDL--the information-theoretic model selection principle assumes that the simplest, most compact representation model is the best model and most probable explanation of the data. We call our approach OptSize, model size is automatically identified, yielding compact models with comparable accuracy. We empirically demonstrate the effectiveness of our approach with several state-of-the-art CNN models, including VGGNet, ResNet on various image classification datasets. The result shows that compact nets obtained by our proposed method perform better to complex, well-engineered, deeper convolutional architectures.","PeriodicalId":126684,"journal":{"name":"Proceedings of the 2020 3rd International Conference on Signal Processing and Machine Learning","volume":"52 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2020 3rd International Conference on Signal Processing and Machine Learning","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3432291.3432293","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Large-scale datasets and complex architectures promote the development of CNN models. Although with stronger representation power, larger CNNs are more resource-hungry, which makes it difficult to deploy on resource-constrained Internet of Things (IoT) devices. Another serious challenge occurring with larger CNNs is their susceptibility to overfit with a small training dataset. In this paper, we ask the question: can we find an optimized compact model for a particular data set? We propose a novel scheme to optimize the model size so as to obtain a compact model instead of a larger model. The optimized model achieves higher accuracy than the widely used deeper model. In addition, it decreases the run-time memory and reduces the number of computing operations. This is achieved by applying Minimum Description Length (MDL) to find the optimal size of the model for a particular data set mathematically. MDL--the information-theoretic model selection principle assumes that the simplest, most compact representation model is the best model and most probable explanation of the data. We call our approach OptSize, model size is automatically identified, yielding compact models with comparable accuracy. We empirically demonstrate the effectiveness of our approach with several state-of-the-art CNN models, including VGGNet, ResNet on various image classification datasets. The result shows that compact nets obtained by our proposed method perform better to complex, well-engineered, deeper convolutional architectures.

查看原文本刊更多论文

一种自动优化深度网络大小的方案

大规模的数据集和复杂的架构推动了CNN模型的发展。虽然具有更强的表示能力，但更大的cnn需要更多的资源，这使得在资源受限的物联网设备上部署更加困难。大型cnn面临的另一个严重挑战是它们对小型训练数据集的过拟合敏感性。在本文中，我们提出了一个问题:我们能否找到一个特定数据集的优化紧凑模型?我们提出了一种优化模型尺寸的新方案，以获得一个紧凑的模型，而不是一个更大的模型。优化后的模型比目前广泛使用的深度模型具有更高的精度。此外，它减少了运行时内存并减少了计算操作的数量。这是通过应用最小描述长度(MDL)在数学上为特定数据集找到模型的最佳大小来实现的。MDL——信息论模型选择原则，假设最简单、最紧凑的表示模型是数据的最佳模型和最可能的解释。我们称我们的方法为OptSize，模型大小是自动识别的，产生具有相当精度的紧凑模型。我们用几个最先进的CNN模型(包括VGGNet、ResNet)在各种图像分类数据集上实证地证明了我们的方法的有效性。结果表明，我们提出的方法获得的紧凑网络在复杂的、精心设计的、更深的卷积架构中表现更好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2020 3rd International Conference on Signal Processing and Machine Learning

自引率

0.00%

发文量