从基础到复合:一种新的孟加拉手写字符识别迁移学习方法

Sakib Reza, Ohida Binte Amin, M. Hashem
{"title":"从基础到复合:一种新的孟加拉手写字符识别迁移学习方法","authors":"Sakib Reza, Ohida Binte Amin, M. Hashem","doi":"10.1109/ICBSLP47725.2019.201522","DOIUrl":null,"url":null,"abstract":"Transfer learning is widely used in various character recognition tasks. In this paper, we propose a transfer learning approach with convolutional neural network (CNN) for Bengali handwritten character recognition. When children learn the Bengali scripts, they first learn basic characters (vowels and consonants) and then go for compound characters (consonant conjuncts). Without prior knowledge of basic characters, it would be quite difficult for them to learn compound characters. In our approach, the machine mimics this human child learning process. Our study shows that CNN trained on basic characters is well capable of recognizing compound characters with minimal retraining. It performs better and also trains much faster than CNN fully trained on compound characters. Similarly, CNN trained on digits easily recognizes basic characters with a short period of training. Furthermore, pretrained CNN consistently outperforms the randomly initialized CNN while training only last few layers.","PeriodicalId":413077,"journal":{"name":"2019 International Conference on Bangla Speech and Language Processing (ICBSLP)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Basic to Compound: A Novel Transfer Learning Approach for Bengali Handwritten Character Recognition\",\"authors\":\"Sakib Reza, Ohida Binte Amin, M. Hashem\",\"doi\":\"10.1109/ICBSLP47725.2019.201522\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Transfer learning is widely used in various character recognition tasks. In this paper, we propose a transfer learning approach with convolutional neural network (CNN) for Bengali handwritten character recognition. When children learn the Bengali scripts, they first learn basic characters (vowels and consonants) and then go for compound characters (consonant conjuncts). Without prior knowledge of basic characters, it would be quite difficult for them to learn compound characters. In our approach, the machine mimics this human child learning process. Our study shows that CNN trained on basic characters is well capable of recognizing compound characters with minimal retraining. It performs better and also trains much faster than CNN fully trained on compound characters. Similarly, CNN trained on digits easily recognizes basic characters with a short period of training. Furthermore, pretrained CNN consistently outperforms the randomly initialized CNN while training only last few layers.\",\"PeriodicalId\":413077,\"journal\":{\"name\":\"2019 International Conference on Bangla Speech and Language Processing (ICBSLP)\",\"volume\":\"3 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 International Conference on Bangla Speech and Language Processing (ICBSLP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICBSLP47725.2019.201522\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Bangla Speech and Language Processing (ICBSLP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICBSLP47725.2019.201522","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

摘要

迁移学习被广泛应用于各种字符识别任务中。在本文中,我们提出了一种卷积神经网络(CNN)的迁移学习方法用于孟加拉手写字符识别。当孩子们学习孟加拉语时,他们首先学习基本字符(元音和辅音),然后学习复合字符(辅音连词)。如果没有对基本汉字的先验知识,他们学习复合字是相当困难的。在我们的方法中,机器模仿人类儿童的学习过程。我们的研究表明,经过基本字符训练的CNN能够很好地识别复合字符,并且只需要很少的再训练。它的性能更好,训练速度也比CNN完全训练复合字快得多。同样,在数字上训练的CNN,训练时间短,很容易识别基本字符。此外,预训练的CNN在只训练最后几层的情况下,始终优于随机初始化的CNN。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Basic to Compound: A Novel Transfer Learning Approach for Bengali Handwritten Character Recognition
Transfer learning is widely used in various character recognition tasks. In this paper, we propose a transfer learning approach with convolutional neural network (CNN) for Bengali handwritten character recognition. When children learn the Bengali scripts, they first learn basic characters (vowels and consonants) and then go for compound characters (consonant conjuncts). Without prior knowledge of basic characters, it would be quite difficult for them to learn compound characters. In our approach, the machine mimics this human child learning process. Our study shows that CNN trained on basic characters is well capable of recognizing compound characters with minimal retraining. It performs better and also trains much faster than CNN fully trained on compound characters. Similarly, CNN trained on digits easily recognizes basic characters with a short period of training. Furthermore, pretrained CNN consistently outperforms the randomly initialized CNN while training only last few layers.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信