采用卷积神经网络的实时孟加拉文手写字符和数字识别

Muhammad Aminur Rahaman, Md. Mahin, Md.Haider Ali, M. Hasanuzzaman
{"title":"采用卷积神经网络的实时孟加拉文手写字符和数字识别","authors":"Muhammad Aminur Rahaman, Md. Mahin, Md.Haider Ali, M. Hasanuzzaman","doi":"10.1109/ICASERT.2019.8934476","DOIUrl":null,"url":null,"abstract":"Machine learning algorithm suffers to recognize the Bangla handwriting from images because of the complex design, diversities among different writers and similarity between characters and digits. In recent times, deep learning is becoming very popular among the researchers for Bangla Handwriting Recgnition (BHR) because of its high efficiency i n t erms of memory, time complexity and robust feature extraction. This research aims at improving the performance of baseline Convolutional Neural Network (CNN) by increasing the recognition accuracy with minimizing the computational overhead; this paper presents a real-time Bangla Handwritten Characters and Digits Recognition (BHCDR) system using adopted CNN. Our proposed preprocessing technique, data augmentation and incorporating dropout filters i n t he b aseline C NN a rchitecture h ave achieved the goal. The proposed eight layered architecture has used two convolutional layers followed by two Maxpooling layers with 25% dropout filters from one layer to another and two fully connected layers with 50% dropout followed by a softmax classifier. The proposed model is trained and tested using 118,698 images of Bangla lekha-isolated dataset and 21000 images of CMATERdb dataset for Bangla hand-written characters and digits maintaining the ratio of 4:1 respectively. The proposed model has achieved the mean accuracy of 97.43% for classification with the average computational costs of 44.95 ms/f.","PeriodicalId":6613,"journal":{"name":"2019 1st International Conference on Advances in Science, Engineering and Robotics Technology (ICASERT)","volume":"19 1","pages":"1-6"},"PeriodicalIF":0.0000,"publicationDate":"2019-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"BHCDR: Real-Time Bangla Handwritten Characters and Digits Recognition using Adopted Convolutional Neural Network\",\"authors\":\"Muhammad Aminur Rahaman, Md. Mahin, Md.Haider Ali, M. Hasanuzzaman\",\"doi\":\"10.1109/ICASERT.2019.8934476\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Machine learning algorithm suffers to recognize the Bangla handwriting from images because of the complex design, diversities among different writers and similarity between characters and digits. In recent times, deep learning is becoming very popular among the researchers for Bangla Handwriting Recgnition (BHR) because of its high efficiency i n t erms of memory, time complexity and robust feature extraction. This research aims at improving the performance of baseline Convolutional Neural Network (CNN) by increasing the recognition accuracy with minimizing the computational overhead; this paper presents a real-time Bangla Handwritten Characters and Digits Recognition (BHCDR) system using adopted CNN. Our proposed preprocessing technique, data augmentation and incorporating dropout filters i n t he b aseline C NN a rchitecture h ave achieved the goal. The proposed eight layered architecture has used two convolutional layers followed by two Maxpooling layers with 25% dropout filters from one layer to another and two fully connected layers with 50% dropout followed by a softmax classifier. The proposed model is trained and tested using 118,698 images of Bangla lekha-isolated dataset and 21000 images of CMATERdb dataset for Bangla hand-written characters and digits maintaining the ratio of 4:1 respectively. The proposed model has achieved the mean accuracy of 97.43% for classification with the average computational costs of 44.95 ms/f.\",\"PeriodicalId\":6613,\"journal\":{\"name\":\"2019 1st International Conference on Advances in Science, Engineering and Robotics Technology (ICASERT)\",\"volume\":\"19 1\",\"pages\":\"1-6\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 1st International Conference on Advances in Science, Engineering and Robotics Technology (ICASERT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICASERT.2019.8934476\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 1st International Conference on Advances in Science, Engineering and Robotics Technology (ICASERT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICASERT.2019.8934476","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

摘要

由于设计复杂、不同写作者之间的差异以及字符和数字之间的相似性,机器学习算法难以从图像中识别出孟加拉语笔迹。近年来,深度学习因其在记忆、时间复杂度和鲁棒性等方面的高效率而成为孟加拉文手写识别的研究热点。本研究旨在提高基线卷积神经网络(CNN)的性能,在最小化计算开销的情况下提高识别精度;本文提出了一种采用CNN的实时孟加拉语手写字符和数字识别系统。我们提出的预处理技术、数据增强技术和将dropout滤波器集成到线性神经网络结构中已经达到了这个目标。提出的八层架构使用了两个卷积层,然后是两个Maxpooling层,从一层到另一层有25%的dropout过滤器,两个完全连接的层,50%的dropout,然后是一个softmax分类器。使用孟加拉语lekha-isolated数据集的118,698幅图像和CMATERdb数据集的21000幅图像,分别对孟加拉语手写字符和数字保持4:1的比例进行了训练和测试。该模型的平均分类准确率为97.43%,平均计算成本为44.95 ms/f。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
BHCDR: Real-Time Bangla Handwritten Characters and Digits Recognition using Adopted Convolutional Neural Network
Machine learning algorithm suffers to recognize the Bangla handwriting from images because of the complex design, diversities among different writers and similarity between characters and digits. In recent times, deep learning is becoming very popular among the researchers for Bangla Handwriting Recgnition (BHR) because of its high efficiency i n t erms of memory, time complexity and robust feature extraction. This research aims at improving the performance of baseline Convolutional Neural Network (CNN) by increasing the recognition accuracy with minimizing the computational overhead; this paper presents a real-time Bangla Handwritten Characters and Digits Recognition (BHCDR) system using adopted CNN. Our proposed preprocessing technique, data augmentation and incorporating dropout filters i n t he b aseline C NN a rchitecture h ave achieved the goal. The proposed eight layered architecture has used two convolutional layers followed by two Maxpooling layers with 25% dropout filters from one layer to another and two fully connected layers with 50% dropout followed by a softmax classifier. The proposed model is trained and tested using 118,698 images of Bangla lekha-isolated dataset and 21000 images of CMATERdb dataset for Bangla hand-written characters and digits maintaining the ratio of 4:1 respectively. The proposed model has achieved the mean accuracy of 97.43% for classification with the average computational costs of 44.95 ms/f.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信