Handwritten Bengali Number Detection using Region Proposal Network

Shaharat Tajrean, Mohammad Abu Yousuf
{"title":"Handwritten Bengali Number Detection using Region Proposal Network","authors":"Shaharat Tajrean, Mohammad Abu Yousuf","doi":"10.1109/ICBSLP47725.2019.202049","DOIUrl":null,"url":null,"abstract":"As a seventh most spoken native language, Bengali needs a robust and accurate optical character recognition (OCR) especially for number detection. As there is no publicly available well-organized dataset for Bangla number OCR, a synthesized dataset was generated to fill the lack of available data. The recent advancement in artificial intelligence using deep neural networks easily outperforms prior hand selected feature-based machine learning approaches. As the region proposal networks (RPN) in deep neural networks perform very well in detecting objects, it can be used for digit detection in an image. So, in this work a very robust Bengali handwritten number detection system is presented where with the help of deep neural networks and a very well-organized, unbiased generated dataset we achieved state of the art result in handwritten Bangla number detection. This system beats any related prior works by a large margin while considering a real world dataset for benchmarks. The overall detection accuracy was 97.8%. The processing can be done real-time with about 35 images per second using a GPU. Also, while implementing the solution is completely based on python, the framework used for deep learning is Google’s Tensorflow and the dependencies, all of which are publicly available.","PeriodicalId":413077,"journal":{"name":"2019 International Conference on Bangla Speech and Language Processing (ICBSLP)","volume":"97 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Bangla Speech and Language Processing (ICBSLP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICBSLP47725.2019.202049","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

As a seventh most spoken native language, Bengali needs a robust and accurate optical character recognition (OCR) especially for number detection. As there is no publicly available well-organized dataset for Bangla number OCR, a synthesized dataset was generated to fill the lack of available data. The recent advancement in artificial intelligence using deep neural networks easily outperforms prior hand selected feature-based machine learning approaches. As the region proposal networks (RPN) in deep neural networks perform very well in detecting objects, it can be used for digit detection in an image. So, in this work a very robust Bengali handwritten number detection system is presented where with the help of deep neural networks and a very well-organized, unbiased generated dataset we achieved state of the art result in handwritten Bangla number detection. This system beats any related prior works by a large margin while considering a real world dataset for benchmarks. The overall detection accuracy was 97.8%. The processing can be done real-time with about 35 images per second using a GPU. Also, while implementing the solution is completely based on python, the framework used for deep learning is Google’s Tensorflow and the dependencies, all of which are publicly available.
基于区域建议网络的手写体孟加拉数字检测
孟加拉语作为世界第七大母语,需要一种鲁棒、准确的光学字符识别(OCR),尤其是数字检测。由于没有公开可用的组织良好的孟加拉数字OCR数据集,因此生成了一个合成数据集来填补可用数据的不足。使用深度神经网络的人工智能的最新进展轻松优于先前手工选择的基于特征的机器学习方法。由于深度神经网络中的区域建议网络(RPN)具有很好的目标检测性能,因此可以用于图像中的数字检测。因此,在这项工作中,我们提出了一个非常强大的孟加拉手写数字检测系统,在深度神经网络和一个非常有组织的、无偏的生成数据集的帮助下,我们实现了手写孟加拉数字检测的最先进的结果。在考虑真实世界的基准数据集时,该系统比任何相关的先前工作都要好得多。总体检测准确率为97.8%。使用GPU可以实时处理大约每秒35张图像。此外,虽然实现解决方案完全基于python,但用于深度学习的框架是谷歌的Tensorflow及其依赖项,所有这些都是公开可用的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信