{"title":"基于卷积神经网络和BLSTM的离线阿拉伯手写识别","authors":"R. Maalej, M. Kherallah","doi":"10.1109/ACIT.2018.8672667","DOIUrl":null,"url":null,"abstract":"There have been an exciting advance in machine learning during the last decade. In fact, increasing computer processing power has supported the analytical capabilities of recognition systems. In this study, we focus on Offline Arabic handwritten recognition and for this task, we propose a new system based on the integration of two deep neural networks. First a Convolutional Neural Network (CNN) to automatically extract features from raw images, then the Bidirectional Long Short-Term Memory (BLSTM) followed by a Connectionist Temporal Classification layer (CTC) for sequence labelling. We validate this model on an extended IFN/ENIT database, created with data augmentation techniques. This hybrid architecture results in appealing performance. It outperforms both hand-crafted features-based approaches and models based on automatic features extraction. According to the experiments results, the recognition rate reaches 92.21%.","PeriodicalId":443170,"journal":{"name":"2018 International Arab Conference on Information Technology (ACIT)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"22","resultStr":"{\"title\":\"Convolutional Neural Network and BLSTM for Offline Arabic Handwriting Recognition\",\"authors\":\"R. Maalej, M. Kherallah\",\"doi\":\"10.1109/ACIT.2018.8672667\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"There have been an exciting advance in machine learning during the last decade. In fact, increasing computer processing power has supported the analytical capabilities of recognition systems. In this study, we focus on Offline Arabic handwritten recognition and for this task, we propose a new system based on the integration of two deep neural networks. First a Convolutional Neural Network (CNN) to automatically extract features from raw images, then the Bidirectional Long Short-Term Memory (BLSTM) followed by a Connectionist Temporal Classification layer (CTC) for sequence labelling. We validate this model on an extended IFN/ENIT database, created with data augmentation techniques. This hybrid architecture results in appealing performance. It outperforms both hand-crafted features-based approaches and models based on automatic features extraction. According to the experiments results, the recognition rate reaches 92.21%.\",\"PeriodicalId\":443170,\"journal\":{\"name\":\"2018 International Arab Conference on Information Technology (ACIT)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"22\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 International Arab Conference on Information Technology (ACIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ACIT.2018.8672667\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Arab Conference on Information Technology (ACIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACIT.2018.8672667","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Convolutional Neural Network and BLSTM for Offline Arabic Handwriting Recognition
There have been an exciting advance in machine learning during the last decade. In fact, increasing computer processing power has supported the analytical capabilities of recognition systems. In this study, we focus on Offline Arabic handwritten recognition and for this task, we propose a new system based on the integration of two deep neural networks. First a Convolutional Neural Network (CNN) to automatically extract features from raw images, then the Bidirectional Long Short-Term Memory (BLSTM) followed by a Connectionist Temporal Classification layer (CTC) for sequence labelling. We validate this model on an extended IFN/ENIT database, created with data augmentation techniques. This hybrid architecture results in appealing performance. It outperforms both hand-crafted features-based approaches and models based on automatic features extraction. According to the experiments results, the recognition rate reaches 92.21%.