Rasool Sabzi, Zahra Fotoohinya, Abdullah Khalili, S. Golzari, Zeinab Salkhorde, Sajjad Behravesh, Shahin Akbarpour
{"title":"使用深度卷积网络识别波斯语手写文字","authors":"Rasool Sabzi, Zahra Fotoohinya, Abdullah Khalili, S. Golzari, Zeinab Salkhorde, Sajjad Behravesh, Shahin Akbarpour","doi":"10.1109/AISP.2017.8324114","DOIUrl":null,"url":null,"abstract":"Handwritten word recognition is an active research area due to numerous commercial applications in offline and online recognition systems. The diversity and complexity of Persian handwritten words makes them more difficult to recognize. In current methods, discriminative features are manually extracted from images by humans so their performance depends on human creativity. This process is called shallow learning. In this study, deep Convolutional Neural Networks (CNNs), a widely used type of deep learning, is employed to automatically extract the discriminative features. Deep learning is able to discover complex structure (discriminative feature here) in large datasets. First in the proposed method, a preprocessing algorithm converts the images to equal size while maintaining handwritten words structure. Then, the images are given to two different architectures of CNNs, AlexNet and GoogLeNet with and without batch normalization. Finally, the proposed method is evaluated on “IRANSHAHR” dataset which includes 15383 images of 503 different city names of Iran. Experimental results show that GoogLeNet with preprocessed data and batch normalization achieves higher accuracy (99.13%) and outperforms the current methods.","PeriodicalId":386952,"journal":{"name":"2017 Artificial Intelligence and Signal Processing Conference (AISP)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Recognizing Persian handwritten words using deep convolutional networks\",\"authors\":\"Rasool Sabzi, Zahra Fotoohinya, Abdullah Khalili, S. Golzari, Zeinab Salkhorde, Sajjad Behravesh, Shahin Akbarpour\",\"doi\":\"10.1109/AISP.2017.8324114\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Handwritten word recognition is an active research area due to numerous commercial applications in offline and online recognition systems. The diversity and complexity of Persian handwritten words makes them more difficult to recognize. In current methods, discriminative features are manually extracted from images by humans so their performance depends on human creativity. This process is called shallow learning. In this study, deep Convolutional Neural Networks (CNNs), a widely used type of deep learning, is employed to automatically extract the discriminative features. Deep learning is able to discover complex structure (discriminative feature here) in large datasets. First in the proposed method, a preprocessing algorithm converts the images to equal size while maintaining handwritten words structure. Then, the images are given to two different architectures of CNNs, AlexNet and GoogLeNet with and without batch normalization. Finally, the proposed method is evaluated on “IRANSHAHR” dataset which includes 15383 images of 503 different city names of Iran. Experimental results show that GoogLeNet with preprocessed data and batch normalization achieves higher accuracy (99.13%) and outperforms the current methods.\",\"PeriodicalId\":386952,\"journal\":{\"name\":\"2017 Artificial Intelligence and Signal Processing Conference (AISP)\",\"volume\":\"41 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 Artificial Intelligence and Signal Processing Conference (AISP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AISP.2017.8324114\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 Artificial Intelligence and Signal Processing Conference (AISP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AISP.2017.8324114","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Recognizing Persian handwritten words using deep convolutional networks
Handwritten word recognition is an active research area due to numerous commercial applications in offline and online recognition systems. The diversity and complexity of Persian handwritten words makes them more difficult to recognize. In current methods, discriminative features are manually extracted from images by humans so their performance depends on human creativity. This process is called shallow learning. In this study, deep Convolutional Neural Networks (CNNs), a widely used type of deep learning, is employed to automatically extract the discriminative features. Deep learning is able to discover complex structure (discriminative feature here) in large datasets. First in the proposed method, a preprocessing algorithm converts the images to equal size while maintaining handwritten words structure. Then, the images are given to two different architectures of CNNs, AlexNet and GoogLeNet with and without batch normalization. Finally, the proposed method is evaluated on “IRANSHAHR” dataset which includes 15383 images of 503 different city names of Iran. Experimental results show that GoogLeNet with preprocessed data and batch normalization achieves higher accuracy (99.13%) and outperforms the current methods.