AIBangla: A Benchmark Dataset for Isolated Bangla Handwritten Basic and Compound Character Recognition

M. Hasan, Mahathir Mohammad Abir, Md. Ibrahim, M. Sayem, Sohaib Abdullah
{"title":"AIBangla: A Benchmark Dataset for Isolated Bangla Handwritten Basic and Compound Character Recognition","authors":"M. Hasan, Mahathir Mohammad Abir, Md. Ibrahim, M. Sayem, Sohaib Abdullah","doi":"10.1109/ICBSLP47725.2019.201481","DOIUrl":null,"url":null,"abstract":"Automatic handwritten Bangla character recognition (HBCR) is a challenging problem in computer vision due to numerous variations in writing styles of an individual Bangla character and the presence of similarities in shapes among different characters. Considering the complexity of the problem, we need to develop a modern convolutional neural network (CNN) for accurate recognition, but unfortunately, at present, very few Bangla handwritten dataset contain a large number of image samples for each character suitable for training deep learning-based methods. In this paper, we present AIBangla, a new benchmark image database for isolated handwritten Bangla characters with detailed usage and a performance baseline. Our dataset contains 80,403 hand-written images on 50 Bangla basic characters and 249,911 hand-written images on 171 Bangla compound characters which were written by more than 2,000 unique writers from various institutes across Bangladesh. In addition, we have applied three leading state-of-the-art deep CNN networks on our proposed AIBangla dataset to provide baseline performance. We have achieved a maximum accuracy of 98.13% and 81.83% for basic and compound character classes respectively on the test set of the AIBangla dataset.","PeriodicalId":413077,"journal":{"name":"2019 International Conference on Bangla Speech and Language Processing (ICBSLP)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Bangla Speech and Language Processing (ICBSLP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICBSLP47725.2019.201481","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Automatic handwritten Bangla character recognition (HBCR) is a challenging problem in computer vision due to numerous variations in writing styles of an individual Bangla character and the presence of similarities in shapes among different characters. Considering the complexity of the problem, we need to develop a modern convolutional neural network (CNN) for accurate recognition, but unfortunately, at present, very few Bangla handwritten dataset contain a large number of image samples for each character suitable for training deep learning-based methods. In this paper, we present AIBangla, a new benchmark image database for isolated handwritten Bangla characters with detailed usage and a performance baseline. Our dataset contains 80,403 hand-written images on 50 Bangla basic characters and 249,911 hand-written images on 171 Bangla compound characters which were written by more than 2,000 unique writers from various institutes across Bangladesh. In addition, we have applied three leading state-of-the-art deep CNN networks on our proposed AIBangla dataset to provide baseline performance. We have achieved a maximum accuracy of 98.13% and 81.83% for basic and compound character classes respectively on the test set of the AIBangla dataset.
AIBangla:孤立孟加拉语手写基本字和复合字识别的基准数据集
自动手写体孟加拉语字符识别(HBCR)是计算机视觉中的一个具有挑战性的问题,因为单个孟加拉语字符的书写风格存在许多变化,并且不同字符之间存在形状相似性。考虑到问题的复杂性,我们需要开发一种现代卷积神经网络(CNN)来进行准确的识别,但遗憾的是,目前很少有孟加拉语手写数据集包含适合训练基于深度学习的方法的每个字符的大量图像样本。在本文中,我们提出了AIBangla,一个新的基准图像数据库,用于孤立的手写孟加拉字符,具有详细的使用情况和性能基线。我们的数据集包含50个孟加拉语基本字符的80,403个手写图像和171个孟加拉语复合字符的249,911个手写图像,这些图像由来自孟加拉国不同机构的2,000多名独特的作者编写。此外,我们在我们提出的AIBangla数据集上应用了三个领先的最先进的深度CNN网络,以提供基线性能。在AIBangla数据集的测试集上,我们对基本字符类和复合字符类的最大准确率分别达到了98.13%和81.83%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信