Latifa Aljiffry, Hassanin M. Al-Barhamtoshy, A. Jamal, Felwa A. Abukhodair
{"title":"阿拉伯语文档布局分析(ADLA)使用微调更快的RCN","authors":"Latifa Aljiffry, Hassanin M. Al-Barhamtoshy, A. Jamal, Felwa A. Abukhodair","doi":"10.1109/ESOLEC54569.2022.10009375","DOIUrl":null,"url":null,"abstract":"At present, there is a massive interest in document digitization, image searching, and natural language processing models, using different types of models. The first step in applying any type of processing like the image to text converting, is layout analysis, which is the paper's interest field. The problem in layout analysis comes in Arabic language, where there is a well-noticed gap for research in this field. The main limitations of the existed research are common, the dataset size, where in its return, gives a not very accurate result. In this paper, we are using two distinct types of Arabic language datasets. We propose a tuned model for layout analysis for Arabic printed and early printed documents using Faster RCNN (ADLA). The proposed model is based on tuning Faster Region-based Convolutional Neural Network (RCNN) model to match our two datasets, with different regions of interest (RoI). For evaluation, we compared the proposed model with two distinct existing models (LABA & FFRA). The F1 score result for our proposed model exceeds the LABA model with 99.4%, whereas the LABA model has 90.5%. Our model exceeds the FFRA model with 99.59% accuracy, whereas the FFRA model got 99.83% accuracy result.","PeriodicalId":179850,"journal":{"name":"2022 20th International Conference on Language Engineering (ESOLEC)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2022-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Arabic Documents Layout Analysis (ADLA) using Fine-tuned Faster RCN\",\"authors\":\"Latifa Aljiffry, Hassanin M. Al-Barhamtoshy, A. Jamal, Felwa A. Abukhodair\",\"doi\":\"10.1109/ESOLEC54569.2022.10009375\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"At present, there is a massive interest in document digitization, image searching, and natural language processing models, using different types of models. The first step in applying any type of processing like the image to text converting, is layout analysis, which is the paper's interest field. The problem in layout analysis comes in Arabic language, where there is a well-noticed gap for research in this field. The main limitations of the existed research are common, the dataset size, where in its return, gives a not very accurate result. In this paper, we are using two distinct types of Arabic language datasets. We propose a tuned model for layout analysis for Arabic printed and early printed documents using Faster RCNN (ADLA). The proposed model is based on tuning Faster Region-based Convolutional Neural Network (RCNN) model to match our two datasets, with different regions of interest (RoI). For evaluation, we compared the proposed model with two distinct existing models (LABA & FFRA). The F1 score result for our proposed model exceeds the LABA model with 99.4%, whereas the LABA model has 90.5%. Our model exceeds the FFRA model with 99.59% accuracy, whereas the FFRA model got 99.83% accuracy result.\",\"PeriodicalId\":179850,\"journal\":{\"name\":\"2022 20th International Conference on Language Engineering (ESOLEC)\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 20th International Conference on Language Engineering (ESOLEC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ESOLEC54569.2022.10009375\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 20th International Conference on Language Engineering (ESOLEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ESOLEC54569.2022.10009375","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Arabic Documents Layout Analysis (ADLA) using Fine-tuned Faster RCN
At present, there is a massive interest in document digitization, image searching, and natural language processing models, using different types of models. The first step in applying any type of processing like the image to text converting, is layout analysis, which is the paper's interest field. The problem in layout analysis comes in Arabic language, where there is a well-noticed gap for research in this field. The main limitations of the existed research are common, the dataset size, where in its return, gives a not very accurate result. In this paper, we are using two distinct types of Arabic language datasets. We propose a tuned model for layout analysis for Arabic printed and early printed documents using Faster RCNN (ADLA). The proposed model is based on tuning Faster Region-based Convolutional Neural Network (RCNN) model to match our two datasets, with different regions of interest (RoI). For evaluation, we compared the proposed model with two distinct existing models (LABA & FFRA). The F1 score result for our proposed model exceeds the LABA model with 99.4%, whereas the LABA model has 90.5%. Our model exceeds the FFRA model with 99.59% accuracy, whereas the FFRA model got 99.83% accuracy result.