Residual Recurrent Neural Network with Sparse Training for Offline Arabic Handwriting Recognition

2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR) Pub Date : 2017-11-01 DOI:10.1109/ICDAR.2017.171

Ruijie Yan, Liangrui Peng, GuangXiang Bin, Shengjin Wang, Yao Cheng

引用次数: 9

Abstract

Deep Recurrent Neural Networks (RNN) have been suffering from the overfitting problem due to the model redundancy of the network structures. We propose a novel temporal and spatial residual learning method for RNN, followed with sparse training by weight pruning to gain sparsity in network parameters. For a Long Short-Term Memory (LSTM) network, we explore the combination schemes and parameter settings for temporal and spatial residual learning with sparse training. Experiments are carried out on the IFN/ENIT database. For the character error rate on the testing set e while training with sets a, b, c, d, the previously reported best result is 13.42%, and the proposed configuration of temporal residual learning followed with sparse training achieves the state-of-the-art result 12.06%.

查看原文本刊更多论文

基于稀疏训练的残差递归神经网络离线阿拉伯手写识别

深度递归神经网络(RNN)由于网络结构的模型冗余性，一直存在过拟合问题。提出了一种新的RNN时空残差学习方法，通过权值剪枝的稀疏训练来获得网络参数的稀疏性。对于长短期记忆(LSTM)网络，我们探索了基于稀疏训练的时空残差学习的组合方案和参数设置。在IFN/ENIT数据库上进行了实验。对于集a、b、c、d训练时测试集e上的字符错误率，先前报道的最佳结果为13.42%，提出的时间残差学习配合稀疏训练的配置达到了最优结果12.06%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)

自引率

0.00%

发文量