基于深度学习4j的手写数字识别

Zareen Sakhawat, Saqib Ali, Li Hongzhi
{"title":"基于深度学习4j的手写数字识别","authors":"Zareen Sakhawat, Saqib Ali, Li Hongzhi","doi":"10.1145/3268866.3268888","DOIUrl":null,"url":null,"abstract":"Over the past few decades, Optical Character Recognition (OCR), particularly handwriting recognition, has received much attention. Handwritten Digits Recognition (HDR) means, receive and comprehend handwriting inputs from different sources for example pictures, touch screens, paper documents, and other devices. The field of HDR has witnessed rapid progress owing to the concurrent availability of cheap and well-assembled computers, advancements in learning algorithms, and availability of large databases. In recent years, HDR has received much attention due to ambiguity in learning methods. The aim of the current study was to explore the potential of Deeplearnig4j (DL4J) framework for HDR. DL4J offers the most appropriate framework for the identification of handwritten digits. To execute the task of HDR, Convolutional Neural Network (CNN) is implemented. This study measures the strength and productivity of DL4J for the aforementioned tasks of recognition and attempts to upgrade the procedure. Results obtained shows significant improvement in the recognition rates of hand-typed digits. Though the accuracy and error rates obtained through our proposed system (CNN-DL4J) show variations, on average the accuracy rate remained at 97 %. The aim of the proposed endeavor was to make the path towards digitalization clearer. Though the purpose was only to identify the digits, we can extend it to deal with digits having different sizes, different languages (Urdu, Arabic, Persian), letters, and the task of detecting multi-digit person's handwriting. Hence, it could reduce the typing need to an extent that people will be able to convert their handwritten materials into digital form in one click on its picture. Altogether, this investigation combines CNN with the DL4J framework and took MNIST as a standard dataset to accomplish the task of digit recognition. In addition, the test framework can be assessed in the future for the prospects of image classification and such other pattern recognition tasks.","PeriodicalId":285628,"journal":{"name":"Proceedings of the 2018 International Conference on Artificial Intelligence and Pattern Recognition","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Handwritten Digits Recognition Based on Deep Learning4j\",\"authors\":\"Zareen Sakhawat, Saqib Ali, Li Hongzhi\",\"doi\":\"10.1145/3268866.3268888\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Over the past few decades, Optical Character Recognition (OCR), particularly handwriting recognition, has received much attention. Handwritten Digits Recognition (HDR) means, receive and comprehend handwriting inputs from different sources for example pictures, touch screens, paper documents, and other devices. The field of HDR has witnessed rapid progress owing to the concurrent availability of cheap and well-assembled computers, advancements in learning algorithms, and availability of large databases. In recent years, HDR has received much attention due to ambiguity in learning methods. The aim of the current study was to explore the potential of Deeplearnig4j (DL4J) framework for HDR. DL4J offers the most appropriate framework for the identification of handwritten digits. To execute the task of HDR, Convolutional Neural Network (CNN) is implemented. This study measures the strength and productivity of DL4J for the aforementioned tasks of recognition and attempts to upgrade the procedure. Results obtained shows significant improvement in the recognition rates of hand-typed digits. Though the accuracy and error rates obtained through our proposed system (CNN-DL4J) show variations, on average the accuracy rate remained at 97 %. The aim of the proposed endeavor was to make the path towards digitalization clearer. Though the purpose was only to identify the digits, we can extend it to deal with digits having different sizes, different languages (Urdu, Arabic, Persian), letters, and the task of detecting multi-digit person's handwriting. Hence, it could reduce the typing need to an extent that people will be able to convert their handwritten materials into digital form in one click on its picture. Altogether, this investigation combines CNN with the DL4J framework and took MNIST as a standard dataset to accomplish the task of digit recognition. In addition, the test framework can be assessed in the future for the prospects of image classification and such other pattern recognition tasks.\",\"PeriodicalId\":285628,\"journal\":{\"name\":\"Proceedings of the 2018 International Conference on Artificial Intelligence and Pattern Recognition\",\"volume\":\"9 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-08-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2018 International Conference on Artificial Intelligence and Pattern Recognition\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3268866.3268888\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2018 International Conference on Artificial Intelligence and Pattern Recognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3268866.3268888","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

摘要

在过去的几十年里,光学字符识别(OCR),特别是手写识别,受到了广泛的关注。手写数字识别(HDR)意味着接收和理解来自不同来源的手写输入,例如图片、触摸屏、纸质文档和其他设备。由于廉价且组装良好的计算机的同时可用性、学习算法的进步以及大型数据库的可用性,HDR领域取得了迅速的进展。近年来,由于学习方法的模糊性,HDR受到了广泛关注。当前研究的目的是探索Deeplearnig4j (DL4J)框架在HDR中的潜力。DL4J为识别手写数字提供了最合适的框架。为了完成HDR任务,采用了卷积神经网络(CNN)。本研究测量了DL4J在上述识别任务中的强度和效率,并尝试对该过程进行升级。实验结果表明,人工输入数字的识别率有显著提高。虽然通过我们提出的系统(CNN-DL4J)获得的准确率和错误率有所不同,但平均准确率保持在97%。这项提议的目的是使通往数字化的道路更加清晰。虽然目的只是为了识别数字,但我们可以将其扩展到处理具有不同大小、不同语言(乌尔都语、阿拉伯语、波斯语)、字母的数字,以及检测多位数人的笔迹的任务。因此,它可以在一定程度上减少打字需求,人们可以通过点击图片将手写材料转换为数字形式。总的来说,本研究将CNN与DL4J框架结合起来,并以MNIST作为标准数据集来完成数字识别的任务。此外,测试框架可以在未来评估图像分类和其他模式识别任务的前景。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Handwritten Digits Recognition Based on Deep Learning4j
Over the past few decades, Optical Character Recognition (OCR), particularly handwriting recognition, has received much attention. Handwritten Digits Recognition (HDR) means, receive and comprehend handwriting inputs from different sources for example pictures, touch screens, paper documents, and other devices. The field of HDR has witnessed rapid progress owing to the concurrent availability of cheap and well-assembled computers, advancements in learning algorithms, and availability of large databases. In recent years, HDR has received much attention due to ambiguity in learning methods. The aim of the current study was to explore the potential of Deeplearnig4j (DL4J) framework for HDR. DL4J offers the most appropriate framework for the identification of handwritten digits. To execute the task of HDR, Convolutional Neural Network (CNN) is implemented. This study measures the strength and productivity of DL4J for the aforementioned tasks of recognition and attempts to upgrade the procedure. Results obtained shows significant improvement in the recognition rates of hand-typed digits. Though the accuracy and error rates obtained through our proposed system (CNN-DL4J) show variations, on average the accuracy rate remained at 97 %. The aim of the proposed endeavor was to make the path towards digitalization clearer. Though the purpose was only to identify the digits, we can extend it to deal with digits having different sizes, different languages (Urdu, Arabic, Persian), letters, and the task of detecting multi-digit person's handwriting. Hence, it could reduce the typing need to an extent that people will be able to convert their handwritten materials into digital form in one click on its picture. Altogether, this investigation combines CNN with the DL4J framework and took MNIST as a standard dataset to accomplish the task of digit recognition. In addition, the test framework can be assessed in the future for the prospects of image classification and such other pattern recognition tasks.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信