A study of transformation-invariances of deep belief networks

The 2013 International Joint Conference on Neural Networks (IJCNN) Pub Date : 2013-08-01 DOI:10.1109/IJCNN.2013.6706884

Zheng Shou, Yuhao Zhang, H. Cai

{"title":"A study of transformation-invariances of deep belief networks","authors":"Zheng Shou, Yuhao Zhang, H. Cai","doi":"10.1109/IJCNN.2013.6706884","DOIUrl":null,"url":null,"abstract":"In order to learn transformation-invariant features, several effective deep architectures like hierarchical feature learning and variant Deep Belief Networks (DBN) have been proposed. Considering the complexity of those variants, people are interested in whether DBN itself has transformation-invariances. First of all, we use original DBN to test original data. Almost same error rates will be achieved, if we change weights in the bottom interlayer according to transformations occurred in testing data. It implies that weights in the bottom interlayer can store the knowledge to handle transformations such as rotation, shifting, and scaling. Along with the continuous learning ability and good storage of DBN, we present our Weight-Transformed Training Algorithm (WTTA) without augmenting other layers, units or filters to original DBN. Based upon original training method, WTTA is aiming at transforming weights and is still unsupervised. For MNIST handwritten digits recognizing experiments, we adopted 784-100-100-100 DBN to compare the differences of recognizing ability in weights-transformed ranges. Most error rates generated by WTTA were below 25% while most rates generated by original training algorithm exceeded 25%. Then we also did an experiment on part of MIT-CBCL face database, with varying illumination, and the best testing accuracy can be achieved is 87.5%. Besides, similar results can be achieved by datasets covering all kinds of transformations, but WTTA only needs original training data and transform weights after each training loop. Consequently, we can mine inherent transformation-invariances of DBN by WTTA, and DBN itself can recognize transformed data at satisfying error rates without inserting other components.","PeriodicalId":376975,"journal":{"name":"The 2013 International Joint Conference on Neural Networks (IJCNN)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The 2013 International Joint Conference on Neural Networks (IJCNN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IJCNN.2013.6706884","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

Abstract

In order to learn transformation-invariant features, several effective deep architectures like hierarchical feature learning and variant Deep Belief Networks (DBN) have been proposed. Considering the complexity of those variants, people are interested in whether DBN itself has transformation-invariances. First of all, we use original DBN to test original data. Almost same error rates will be achieved, if we change weights in the bottom interlayer according to transformations occurred in testing data. It implies that weights in the bottom interlayer can store the knowledge to handle transformations such as rotation, shifting, and scaling. Along with the continuous learning ability and good storage of DBN, we present our Weight-Transformed Training Algorithm (WTTA) without augmenting other layers, units or filters to original DBN. Based upon original training method, WTTA is aiming at transforming weights and is still unsupervised. For MNIST handwritten digits recognizing experiments, we adopted 784-100-100-100 DBN to compare the differences of recognizing ability in weights-transformed ranges. Most error rates generated by WTTA were below 25% while most rates generated by original training algorithm exceeded 25%. Then we also did an experiment on part of MIT-CBCL face database, with varying illumination, and the best testing accuracy can be achieved is 87.5%. Besides, similar results can be achieved by datasets covering all kinds of transformations, but WTTA only needs original training data and transform weights after each training loop. Consequently, we can mine inherent transformation-invariances of DBN by WTTA, and DBN itself can recognize transformed data at satisfying error rates without inserting other components.

查看原文本刊更多论文

深度信念网络的变换不变性研究

为了学习变换不变特征，人们提出了几种有效的深度结构，如层次特征学习和变体深度信念网络(DBN)。考虑到这些变量的复杂性，人们对DBN本身是否具有变换不变性很感兴趣。首先，我们使用原始DBN对原始数据进行测试。如果我们根据测试数据中发生的转换改变底层中间层的权重，将获得几乎相同的错误率。这意味着底层中间层中的权重可以存储处理旋转、移动和缩放等转换的知识。由于DBN具有持续学习能力和良好的存储能力，我们提出了一种权重转换训练算法(WTTA)，该算法无需在原始DBN上增加其他层、单元或滤波器。WTTA在原有训练方法的基础上，以转换权重为目标，仍然是无监督的。在MNIST手写体数字识别实验中，我们采用784-100-100-100 DBN来比较权重变换范围内识别能力的差异。WTTA产生的错误率大多在25%以下，而原始训练算法产生的错误率大多超过25%。然后我们还在MIT-CBCL人脸数据库的一部分上进行了实验，在不同光照条件下，测试准确率达到了87.5%。此外，覆盖各种变换的数据集也可以得到类似的结果，但WTTA只需要原始的训练数据和每个训练循环后的变换权值。因此，我们可以利用WTTA挖掘DBN的固有变换不变性，并且DBN本身可以在不插入其他组件的情况下以满意的错误率识别转换后的数据。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

The 2013 International Joint Conference on Neural Networks (IJCNN)

自引率

0.00%

发文量