使用深度卷积网络识别波斯语手写文字

2017 Artificial Intelligence and Signal Processing Conference (AISP) Pub Date : 2017-10-01 DOI:10.1109/AISP.2017.8324114

Rasool Sabzi, Zahra Fotoohinya, Abdullah Khalili, S. Golzari, Zeinab Salkhorde, Sajjad Behravesh, Shahin Akbarpour

{"title":"使用深度卷积网络识别波斯语手写文字","authors":"Rasool Sabzi, Zahra Fotoohinya, Abdullah Khalili, S. Golzari, Zeinab Salkhorde, Sajjad Behravesh, Shahin Akbarpour","doi":"10.1109/AISP.2017.8324114","DOIUrl":null,"url":null,"abstract":"Handwritten word recognition is an active research area due to numerous commercial applications in offline and online recognition systems. The diversity and complexity of Persian handwritten words makes them more difficult to recognize. In current methods, discriminative features are manually extracted from images by humans so their performance depends on human creativity. This process is called shallow learning. In this study, deep Convolutional Neural Networks (CNNs), a widely used type of deep learning, is employed to automatically extract the discriminative features. Deep learning is able to discover complex structure (discriminative feature here) in large datasets. First in the proposed method, a preprocessing algorithm converts the images to equal size while maintaining handwritten words structure. Then, the images are given to two different architectures of CNNs, AlexNet and GoogLeNet with and without batch normalization. Finally, the proposed method is evaluated on “IRANSHAHR” dataset which includes 15383 images of 503 different city names of Iran. Experimental results show that GoogLeNet with preprocessed data and batch normalization achieves higher accuracy (99.13%) and outperforms the current methods.","PeriodicalId":386952,"journal":{"name":"2017 Artificial Intelligence and Signal Processing Conference (AISP)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Recognizing Persian handwritten words using deep convolutional networks\",\"authors\":\"Rasool Sabzi, Zahra Fotoohinya, Abdullah Khalili, S. Golzari, Zeinab Salkhorde, Sajjad Behravesh, Shahin Akbarpour\",\"doi\":\"10.1109/AISP.2017.8324114\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Handwritten word recognition is an active research area due to numerous commercial applications in offline and online recognition systems. The diversity and complexity of Persian handwritten words makes them more difficult to recognize. In current methods, discriminative features are manually extracted from images by humans so their performance depends on human creativity. This process is called shallow learning. In this study, deep Convolutional Neural Networks (CNNs), a widely used type of deep learning, is employed to automatically extract the discriminative features. Deep learning is able to discover complex structure (discriminative feature here) in large datasets. First in the proposed method, a preprocessing algorithm converts the images to equal size while maintaining handwritten words structure. Then, the images are given to two different architectures of CNNs, AlexNet and GoogLeNet with and without batch normalization. Finally, the proposed method is evaluated on “IRANSHAHR” dataset which includes 15383 images of 503 different city names of Iran. Experimental results show that GoogLeNet with preprocessed data and batch normalization achieves higher accuracy (99.13%) and outperforms the current methods.\",\"PeriodicalId\":386952,\"journal\":{\"name\":\"2017 Artificial Intelligence and Signal Processing Conference (AISP)\",\"volume\":\"41 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 Artificial Intelligence and Signal Processing Conference (AISP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AISP.2017.8324114\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 Artificial Intelligence and Signal Processing Conference (AISP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AISP.2017.8324114","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

摘要

由于在离线和在线识别系统中有大量的商业应用，手写单词识别是一个活跃的研究领域。波斯语手写文字的多样性和复杂性使它们更难以识别。在目前的方法中，判别特征是由人类手动从图像中提取的，因此它们的表现取决于人类的创造力。这个过程被称为浅层学习。在本研究中，深度卷积神经网络(cnn)是一种广泛使用的深度学习类型，用于自动提取判别特征。深度学习能够在大型数据集中发现复杂的结构(这里是判别特征)。该方法首先采用预处理算法，在保持手写文字结构的前提下，将图像转换为相等大小。然后，将图像分别给予两种不同的cnn架构，AlexNet和GoogLeNet进行批处理归一化和不进行批处理归一化。最后，在包含伊朗503个不同城市名称的15383幅图像的“IRANSHAHR”数据集上对该方法进行了评估。实验结果表明，采用预处理数据和批处理归一化方法的GoogLeNet取得了更高的准确率(99.13%)，优于现有方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Recognizing Persian handwritten words using deep convolutional networks

Handwritten word recognition is an active research area due to numerous commercial applications in offline and online recognition systems. The diversity and complexity of Persian handwritten words makes them more difficult to recognize. In current methods, discriminative features are manually extracted from images by humans so their performance depends on human creativity. This process is called shallow learning. In this study, deep Convolutional Neural Networks (CNNs), a widely used type of deep learning, is employed to automatically extract the discriminative features. Deep learning is able to discover complex structure (discriminative feature here) in large datasets. First in the proposed method, a preprocessing algorithm converts the images to equal size while maintaining handwritten words structure. Then, the images are given to two different architectures of CNNs, AlexNet and GoogLeNet with and without batch normalization. Finally, the proposed method is evaluated on “IRANSHAHR” dataset which includes 15383 images of 503 different city names of Iran. Experimental results show that GoogLeNet with preprocessed data and batch normalization achieves higher accuracy (99.13%) and outperforms the current methods.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2017 Artificial Intelligence and Signal Processing Conference (AISP)

自引率

0.00%

发文量