{"title":"A novel efficient method for training sparse auto-encoders","authors":"Yuxi Luo, Y. Wan","doi":"10.1109/CISP.2013.6745205","DOIUrl":null,"url":null,"abstract":"The success of machine learning algorithms generally depends on data representation. So far there has been a great deal of literature on unsupervised feature learning and joint training of deep learning. There is little specific guidance, however, on combining hand-designed features or the operations on them with features which are learned from unsupervised learning. In this paper, using MNIST (“Modified National Institute of Standards and Technology”) handwritten digit database as an example, we propose a novel method for training sparse auto-encoders. In this method, we first get some small-scale features through training, then generate more features through operations such as rotation and translation. Finally, we use the whole dataset to fine-tune the network. This approach avoids optimizing cost function for all nodes in the traditional sparse auto-encoder training process, which is very time-consuming. Simulation results show that the proposed method can speed up the training process by over 50%, while keeping the recognition accuracy at the same level or even better. The present findings also contribute to the field's understanding of sparse representation that large-scale sparse features can be generated by small-scale sparse features.","PeriodicalId":442320,"journal":{"name":"2013 6th International Congress on Image and Signal Processing (CISP)","volume":"2007 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 6th International Congress on Image and Signal Processing (CISP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CISP.2013.6745205","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 12
Abstract
The success of machine learning algorithms generally depends on data representation. So far there has been a great deal of literature on unsupervised feature learning and joint training of deep learning. There is little specific guidance, however, on combining hand-designed features or the operations on them with features which are learned from unsupervised learning. In this paper, using MNIST (“Modified National Institute of Standards and Technology”) handwritten digit database as an example, we propose a novel method for training sparse auto-encoders. In this method, we first get some small-scale features through training, then generate more features through operations such as rotation and translation. Finally, we use the whole dataset to fine-tune the network. This approach avoids optimizing cost function for all nodes in the traditional sparse auto-encoder training process, which is very time-consuming. Simulation results show that the proposed method can speed up the training process by over 50%, while keeping the recognition accuracy at the same level or even better. The present findings also contribute to the field's understanding of sparse representation that large-scale sparse features can be generated by small-scale sparse features.
机器学习算法的成功通常取决于数据表示。到目前为止,关于深度学习的无监督特征学习和联合训练已经有了大量的文献。然而,关于如何将手工设计的特征或对其进行的操作与从无监督学习中学习到的特征相结合,却很少有具体的指导。本文以MNIST (Modified National Institute of Standards and Technology)手写数字数据库为例,提出了一种训练稀疏自编码器的新方法。该方法首先通过训练得到一些小尺度的特征,然后通过旋转、平移等操作生成更多的特征。最后,我们使用整个数据集对网络进行微调。该方法避免了传统稀疏自编码器训练过程中对所有节点的代价函数进行优化的耗时问题。仿真结果表明,该方法可将训练速度提高50%以上,同时保持相同甚至更高的识别精度。本研究结果也有助于该领域对稀疏表示的理解,即大规模稀疏特征可以由小规模稀疏特征生成。