使用深度卷积网络识别波斯语手写文字

Rasool Sabzi, Zahra Fotoohinya, Abdullah Khalili, S. Golzari, Zeinab Salkhorde, Sajjad Behravesh, Shahin Akbarpour
{"title":"使用深度卷积网络识别波斯语手写文字","authors":"Rasool Sabzi, Zahra Fotoohinya, Abdullah Khalili, S. Golzari, Zeinab Salkhorde, Sajjad Behravesh, Shahin Akbarpour","doi":"10.1109/AISP.2017.8324114","DOIUrl":null,"url":null,"abstract":"Handwritten word recognition is an active research area due to numerous commercial applications in offline and online recognition systems. The diversity and complexity of Persian handwritten words makes them more difficult to recognize. In current methods, discriminative features are manually extracted from images by humans so their performance depends on human creativity. This process is called shallow learning. In this study, deep Convolutional Neural Networks (CNNs), a widely used type of deep learning, is employed to automatically extract the discriminative features. Deep learning is able to discover complex structure (discriminative feature here) in large datasets. First in the proposed method, a preprocessing algorithm converts the images to equal size while maintaining handwritten words structure. Then, the images are given to two different architectures of CNNs, AlexNet and GoogLeNet with and without batch normalization. Finally, the proposed method is evaluated on “IRANSHAHR” dataset which includes 15383 images of 503 different city names of Iran. Experimental results show that GoogLeNet with preprocessed data and batch normalization achieves higher accuracy (99.13%) and outperforms the current methods.","PeriodicalId":386952,"journal":{"name":"2017 Artificial Intelligence and Signal Processing Conference (AISP)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Recognizing Persian handwritten words using deep convolutional networks\",\"authors\":\"Rasool Sabzi, Zahra Fotoohinya, Abdullah Khalili, S. Golzari, Zeinab Salkhorde, Sajjad Behravesh, Shahin Akbarpour\",\"doi\":\"10.1109/AISP.2017.8324114\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Handwritten word recognition is an active research area due to numerous commercial applications in offline and online recognition systems. The diversity and complexity of Persian handwritten words makes them more difficult to recognize. In current methods, discriminative features are manually extracted from images by humans so their performance depends on human creativity. This process is called shallow learning. In this study, deep Convolutional Neural Networks (CNNs), a widely used type of deep learning, is employed to automatically extract the discriminative features. Deep learning is able to discover complex structure (discriminative feature here) in large datasets. First in the proposed method, a preprocessing algorithm converts the images to equal size while maintaining handwritten words structure. Then, the images are given to two different architectures of CNNs, AlexNet and GoogLeNet with and without batch normalization. Finally, the proposed method is evaluated on “IRANSHAHR” dataset which includes 15383 images of 503 different city names of Iran. Experimental results show that GoogLeNet with preprocessed data and batch normalization achieves higher accuracy (99.13%) and outperforms the current methods.\",\"PeriodicalId\":386952,\"journal\":{\"name\":\"2017 Artificial Intelligence and Signal Processing Conference (AISP)\",\"volume\":\"41 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 Artificial Intelligence and Signal Processing Conference (AISP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AISP.2017.8324114\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 Artificial Intelligence and Signal Processing Conference (AISP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AISP.2017.8324114","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

摘要

由于在离线和在线识别系统中有大量的商业应用,手写单词识别是一个活跃的研究领域。波斯语手写文字的多样性和复杂性使它们更难以识别。在目前的方法中,判别特征是由人类手动从图像中提取的,因此它们的表现取决于人类的创造力。这个过程被称为浅层学习。在本研究中,深度卷积神经网络(cnn)是一种广泛使用的深度学习类型,用于自动提取判别特征。深度学习能够在大型数据集中发现复杂的结构(这里是判别特征)。该方法首先采用预处理算法,在保持手写文字结构的前提下,将图像转换为相等大小。然后,将图像分别给予两种不同的cnn架构,AlexNet和GoogLeNet进行批处理归一化和不进行批处理归一化。最后,在包含伊朗503个不同城市名称的15383幅图像的“IRANSHAHR”数据集上对该方法进行了评估。实验结果表明,采用预处理数据和批处理归一化方法的GoogLeNet取得了更高的准确率(99.13%),优于现有方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Recognizing Persian handwritten words using deep convolutional networks
Handwritten word recognition is an active research area due to numerous commercial applications in offline and online recognition systems. The diversity and complexity of Persian handwritten words makes them more difficult to recognize. In current methods, discriminative features are manually extracted from images by humans so their performance depends on human creativity. This process is called shallow learning. In this study, deep Convolutional Neural Networks (CNNs), a widely used type of deep learning, is employed to automatically extract the discriminative features. Deep learning is able to discover complex structure (discriminative feature here) in large datasets. First in the proposed method, a preprocessing algorithm converts the images to equal size while maintaining handwritten words structure. Then, the images are given to two different architectures of CNNs, AlexNet and GoogLeNet with and without batch normalization. Finally, the proposed method is evaluated on “IRANSHAHR” dataset which includes 15383 images of 503 different city names of Iran. Experimental results show that GoogLeNet with preprocessed data and batch normalization achieves higher accuracy (99.13%) and outperforms the current methods.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信