{"title":"使用胶囊网络的手写阿拉伯字符识别","authors":"Daldali Mehdi, A. Souhar","doi":"10.1109/WINCOM55661.2022.9966452","DOIUrl":null,"url":null,"abstract":"Handwritten text exhibits a diversity in styles, and unpredictable characteristics making of Handwriting Recognition (HWR) an interesting computer vision problem. In the case of Arabic script, the recognition task is especially more complex, due to its cursive nature, and emphasis on fluid and connected pen strokes, rarely seen in other scripts. Unfortunately current end-to-end approaches fail at modeling the structural aspect of visual information with high accuracy, which is detrimental for tasks such as Optical Character Recognition (OCR). Capsule networks are a neural network architecture, using a set of interesting concepts which can accurately model the structural aspect of image features, to solve some of the Arabic script recognition problems faced by other systems. Our proposed approach based on Capsules yielded interesting results using a data set of around 50 thousand Arabic characters with neither binarization nor noise reduction, while achieving more than 97% TOP-1 accuracy.","PeriodicalId":128342,"journal":{"name":"2022 9th International Conference on Wireless Networks and Mobile Communications (WINCOM)","volume":"186 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Handwritten Arabic characters recognition using Capsule Networks\",\"authors\":\"Daldali Mehdi, A. Souhar\",\"doi\":\"10.1109/WINCOM55661.2022.9966452\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Handwritten text exhibits a diversity in styles, and unpredictable characteristics making of Handwriting Recognition (HWR) an interesting computer vision problem. In the case of Arabic script, the recognition task is especially more complex, due to its cursive nature, and emphasis on fluid and connected pen strokes, rarely seen in other scripts. Unfortunately current end-to-end approaches fail at modeling the structural aspect of visual information with high accuracy, which is detrimental for tasks such as Optical Character Recognition (OCR). Capsule networks are a neural network architecture, using a set of interesting concepts which can accurately model the structural aspect of image features, to solve some of the Arabic script recognition problems faced by other systems. Our proposed approach based on Capsules yielded interesting results using a data set of around 50 thousand Arabic characters with neither binarization nor noise reduction, while achieving more than 97% TOP-1 accuracy.\",\"PeriodicalId\":128342,\"journal\":{\"name\":\"2022 9th International Conference on Wireless Networks and Mobile Communications (WINCOM)\",\"volume\":\"186 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 9th International Conference on Wireless Networks and Mobile Communications (WINCOM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WINCOM55661.2022.9966452\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 9th International Conference on Wireless Networks and Mobile Communications (WINCOM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WINCOM55661.2022.9966452","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Handwritten Arabic characters recognition using Capsule Networks
Handwritten text exhibits a diversity in styles, and unpredictable characteristics making of Handwriting Recognition (HWR) an interesting computer vision problem. In the case of Arabic script, the recognition task is especially more complex, due to its cursive nature, and emphasis on fluid and connected pen strokes, rarely seen in other scripts. Unfortunately current end-to-end approaches fail at modeling the structural aspect of visual information with high accuracy, which is detrimental for tasks such as Optical Character Recognition (OCR). Capsule networks are a neural network architecture, using a set of interesting concepts which can accurately model the structural aspect of image features, to solve some of the Arabic script recognition problems faced by other systems. Our proposed approach based on Capsules yielded interesting results using a data set of around 50 thousand Arabic characters with neither binarization nor noise reduction, while achieving more than 97% TOP-1 accuracy.