Recognizing handwritten single digits and digit strings using deep architecture of neural networks

Raid Saabni
{"title":"Recognizing handwritten single digits and digit strings using deep architecture of neural networks","authors":"Raid Saabni","doi":"10.1109/ICAIPR.2016.7585206","DOIUrl":null,"url":null,"abstract":"Automatic handwriting recognition of digits and digit strings, are of real interest commercially and as an academic research topic. Recent advances using neural networks and especially deep learning algorithms such as convolutional neural nets present impressive results for single digit recognition. Such results enable developing efficient tools for automatic mail sorting and reading amounts and dates on personal checks. Artificial- Neural-Networks is a powerful technology for classification of visual inputs in many fields due to their ability to approximate complex nonlinear mappings directly from input samples. In this paper we present an approach compromising between the full connectivity of traditional Multi Layer Neural Network trained by Back Propagation and deep architecture. This enables, reasonable training time using a four hidden layers Neural Network and keeps high recognition rates. Pre-trained layers using sparse auto encoders with predefined sequences of training process and rounds, are used to train the net to attain high recognition rates. We have extended the training set to include CVL, MNIST and manually crafted images of single digits from the ORAND-CAR and a private collection of bank checks. Sliding windows technique is used to handle digit strings recognition and obtain encouraging results on CVL and ORAND-CAR benchmarks and our private collection of local bank checks.","PeriodicalId":127231,"journal":{"name":"2016 Third International Conference on Artificial Intelligence and Pattern Recognition (AIPR)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"24","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 Third International Conference on Artificial Intelligence and Pattern Recognition (AIPR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAIPR.2016.7585206","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 24

Abstract

Automatic handwriting recognition of digits and digit strings, are of real interest commercially and as an academic research topic. Recent advances using neural networks and especially deep learning algorithms such as convolutional neural nets present impressive results for single digit recognition. Such results enable developing efficient tools for automatic mail sorting and reading amounts and dates on personal checks. Artificial- Neural-Networks is a powerful technology for classification of visual inputs in many fields due to their ability to approximate complex nonlinear mappings directly from input samples. In this paper we present an approach compromising between the full connectivity of traditional Multi Layer Neural Network trained by Back Propagation and deep architecture. This enables, reasonable training time using a four hidden layers Neural Network and keeps high recognition rates. Pre-trained layers using sparse auto encoders with predefined sequences of training process and rounds, are used to train the net to attain high recognition rates. We have extended the training set to include CVL, MNIST and manually crafted images of single digits from the ORAND-CAR and a private collection of bank checks. Sliding windows technique is used to handle digit strings recognition and obtain encouraging results on CVL and ORAND-CAR benchmarks and our private collection of local bank checks.
使用神经网络的深层结构识别手写单位数和数字字符串
数字和数字串的自动手写识别,在商业上和学术上都是一个非常有趣的研究课题。最近使用神经网络,特别是卷积神经网络等深度学习算法的进展,在个位数识别方面取得了令人印象深刻的结果。这样的结果使开发有效的工具能够自动邮件分类和读取个人支票上的数量和日期。人工神经网络是一种强大的视觉输入分类技术,因为它能够直接从输入样本中近似复杂的非线性映射。本文提出了一种折衷的方法,在传统的反向传播训练的多层神经网络的全连通性和深度结构之间进行折衷。这使得使用四隐藏层神经网络的训练时间合理,并保持较高的识别率。预训练层采用稀疏自编码器和预定义的训练过程和轮数序列,用于训练网络以获得高识别率。我们已经扩展了训练集,包括CVL, MNIST和手动制作的来自ORAND-CAR的个位数图像和私人收集的银行支票。滑动窗口技术用于处理数字字符串识别,并在CVL和ORAND-CAR基准测试以及我们私人收集的本地银行支票上获得令人鼓舞的结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信