Sadek Mansouri, Salah Zrigui, M. Zrigui, Dhaou Berchech
{"title":"Text detection in Arabic news video based on MSER and RetinaNet","authors":"Sadek Mansouri, Salah Zrigui, M. Zrigui, Dhaou Berchech","doi":"10.1109/AICCSA53542.2021.9686930","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a novel approach for text detection in Arabic news videos. Firstly, we apply MSER method and morphological operators (open and close) to extract candidate regions of text in image. Then, we use a deep learning method called RatinaNet. It is based in two stages. The first one aims to extract features using residual network (ResNet) and a pyramidal feature network (FPN). In the second step, we use two fully convolutional networks (FCN), one is for the classification task and the other for the bounding box regression task. For training and testing stages, we have used the AcTiVD [18] dataset. Experiments results proves the efficiency and performance of the proposed method.","PeriodicalId":423896,"journal":{"name":"2021 IEEE/ACS 18th International Conference on Computer Systems and Applications (AICCSA)","volume":"118 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE/ACS 18th International Conference on Computer Systems and Applications (AICCSA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AICCSA53542.2021.9686930","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
In this paper, we propose a novel approach for text detection in Arabic news videos. Firstly, we apply MSER method and morphological operators (open and close) to extract candidate regions of text in image. Then, we use a deep learning method called RatinaNet. It is based in two stages. The first one aims to extract features using residual network (ResNet) and a pyramidal feature network (FPN). In the second step, we use two fully convolutional networks (FCN), one is for the classification task and the other for the bounding box regression task. For training and testing stages, we have used the AcTiVD [18] dataset. Experiments results proves the efficiency and performance of the proposed method.