{"title":"Handwritten Bengali Number Detection using Region Proposal Network","authors":"Shaharat Tajrean, Mohammad Abu Yousuf","doi":"10.1109/ICBSLP47725.2019.202049","DOIUrl":null,"url":null,"abstract":"As a seventh most spoken native language, Bengali needs a robust and accurate optical character recognition (OCR) especially for number detection. As there is no publicly available well-organized dataset for Bangla number OCR, a synthesized dataset was generated to fill the lack of available data. The recent advancement in artificial intelligence using deep neural networks easily outperforms prior hand selected feature-based machine learning approaches. As the region proposal networks (RPN) in deep neural networks perform very well in detecting objects, it can be used for digit detection in an image. So, in this work a very robust Bengali handwritten number detection system is presented where with the help of deep neural networks and a very well-organized, unbiased generated dataset we achieved state of the art result in handwritten Bangla number detection. This system beats any related prior works by a large margin while considering a real world dataset for benchmarks. The overall detection accuracy was 97.8%. The processing can be done real-time with about 35 images per second using a GPU. Also, while implementing the solution is completely based on python, the framework used for deep learning is Google’s Tensorflow and the dependencies, all of which are publicly available.","PeriodicalId":413077,"journal":{"name":"2019 International Conference on Bangla Speech and Language Processing (ICBSLP)","volume":"97 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Bangla Speech and Language Processing (ICBSLP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICBSLP47725.2019.202049","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
As a seventh most spoken native language, Bengali needs a robust and accurate optical character recognition (OCR) especially for number detection. As there is no publicly available well-organized dataset for Bangla number OCR, a synthesized dataset was generated to fill the lack of available data. The recent advancement in artificial intelligence using deep neural networks easily outperforms prior hand selected feature-based machine learning approaches. As the region proposal networks (RPN) in deep neural networks perform very well in detecting objects, it can be used for digit detection in an image. So, in this work a very robust Bengali handwritten number detection system is presented where with the help of deep neural networks and a very well-organized, unbiased generated dataset we achieved state of the art result in handwritten Bangla number detection. This system beats any related prior works by a large margin while considering a real world dataset for benchmarks. The overall detection accuracy was 97.8%. The processing can be done real-time with about 35 images per second using a GPU. Also, while implementing the solution is completely based on python, the framework used for deep learning is Google’s Tensorflow and the dependencies, all of which are publicly available.