Orientation and Scale Based Weights Initialization Scheme for Deep Convolutional Neural Networks

A. Abdullah, Wong En Ting
{"title":"Orientation and Scale Based Weights Initialization Scheme for Deep Convolutional Neural Networks","authors":"A. Abdullah, Wong En Ting","doi":"10.17576/apjitm-2020-0902-08","DOIUrl":null,"url":null,"abstract":"Image classification is generally about the understanding of information in the images concerned. The more the system able to understand the image contains, the more effective it will be in classifying desired images. Recent work has shown that the convolutional neural network (CNN) paradigm is useful for obtaining more accurate image classification results. A crucial component in the CNN is the convolution filters which consist of a series of predefined filter weight initialization values. The filter weights are then automatically learned by the neural network throughout the back-propagation training algorithm. However, most initialization schemes used in the deep convolutional neural networks are mainly to deal with vanishing gradient problems. Thus, selecting optimal weights are crucial to improve convergence and minimize the complexity which can enhance the generalization performance. One possible solution is to replace the standard weights with parameterized filters that proven to be efficient in extracting useful features such as Gabor filter bank. The Gabor filter bank is popular due to its ability in dealing with spatial transformation, especially on edges and texture information of different scales and directions. Thus, in this paper, we investigate the effect of utilizing Gabor and convolutional filters on small size kernels of deep VGG-16 architecture. The standard VGG-16 filter is replaced with the Gabor filter bank to obtain uniform distribution at all layers of the network. The result shows that the orientation and scale weights initialization scheme outperforms the standard filter weights on an image classification problem.","PeriodicalId":130300,"journal":{"name":"Asia-Pacific Journal of Information Technology & Multimedia","volume":"25 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Asia-Pacific Journal of Information Technology & Multimedia","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.17576/apjitm-2020-0902-08","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9

Abstract

Image classification is generally about the understanding of information in the images concerned. The more the system able to understand the image contains, the more effective it will be in classifying desired images. Recent work has shown that the convolutional neural network (CNN) paradigm is useful for obtaining more accurate image classification results. A crucial component in the CNN is the convolution filters which consist of a series of predefined filter weight initialization values. The filter weights are then automatically learned by the neural network throughout the back-propagation training algorithm. However, most initialization schemes used in the deep convolutional neural networks are mainly to deal with vanishing gradient problems. Thus, selecting optimal weights are crucial to improve convergence and minimize the complexity which can enhance the generalization performance. One possible solution is to replace the standard weights with parameterized filters that proven to be efficient in extracting useful features such as Gabor filter bank. The Gabor filter bank is popular due to its ability in dealing with spatial transformation, especially on edges and texture information of different scales and directions. Thus, in this paper, we investigate the effect of utilizing Gabor and convolutional filters on small size kernels of deep VGG-16 architecture. The standard VGG-16 filter is replaced with the Gabor filter bank to obtain uniform distribution at all layers of the network. The result shows that the orientation and scale weights initialization scheme outperforms the standard filter weights on an image classification problem.
基于方向和尺度的深度卷积神经网络权重初始化方案
图像分类一般是关于对相关图像中信息的理解。系统对图像中包含的信息理解得越多,对所需图像的分类就越有效。最近的研究表明,卷积神经网络(CNN)范式对于获得更准确的图像分类结果是有用的。卷积滤波器是CNN的一个重要组成部分,它由一系列预定义的滤波器权值初始化值组成。然后,神经网络通过反向传播训练算法自动学习滤波器的权重。然而,深度卷积神经网络中使用的大多数初始化方案主要是处理梯度消失问题。因此,选择最优权值对于提高算法的收敛性和降低算法复杂度至关重要,从而提高算法的泛化性能。一种可能的解决方案是用参数化滤波器代替标准权重,这种滤波器在提取有用的特征方面被证明是有效的,比如Gabor滤波器组。Gabor滤波器组因其处理空间变换的能力,特别是对不同尺度和方向的边缘和纹理信息的处理能力而受到广泛的应用。因此,在本文中,我们研究了使用Gabor和卷积滤波器对深度VGG-16架构的小尺寸核的影响。将标准的VGG-16滤波器替换为Gabor滤波器组,以获得网络各层的均匀分布。结果表明,方向和尺度权重初始化方案在图像分类问题上优于标准滤波器权重初始化方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信