Sadek Mansouri, Salah Zrigui, M. Zrigui, Dhaou Berchech
{"title":"基于MSER和retanet的阿拉伯语新闻视频文本检测","authors":"Sadek Mansouri, Salah Zrigui, M. Zrigui, Dhaou Berchech","doi":"10.1109/AICCSA53542.2021.9686930","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a novel approach for text detection in Arabic news videos. Firstly, we apply MSER method and morphological operators (open and close) to extract candidate regions of text in image. Then, we use a deep learning method called RatinaNet. It is based in two stages. The first one aims to extract features using residual network (ResNet) and a pyramidal feature network (FPN). In the second step, we use two fully convolutional networks (FCN), one is for the classification task and the other for the bounding box regression task. For training and testing stages, we have used the AcTiVD [18] dataset. Experiments results proves the efficiency and performance of the proposed method.","PeriodicalId":423896,"journal":{"name":"2021 IEEE/ACS 18th International Conference on Computer Systems and Applications (AICCSA)","volume":"118 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Text detection in Arabic news video based on MSER and RetinaNet\",\"authors\":\"Sadek Mansouri, Salah Zrigui, M. Zrigui, Dhaou Berchech\",\"doi\":\"10.1109/AICCSA53542.2021.9686930\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we propose a novel approach for text detection in Arabic news videos. Firstly, we apply MSER method and morphological operators (open and close) to extract candidate regions of text in image. Then, we use a deep learning method called RatinaNet. It is based in two stages. The first one aims to extract features using residual network (ResNet) and a pyramidal feature network (FPN). In the second step, we use two fully convolutional networks (FCN), one is for the classification task and the other for the bounding box regression task. For training and testing stages, we have used the AcTiVD [18] dataset. Experiments results proves the efficiency and performance of the proposed method.\",\"PeriodicalId\":423896,\"journal\":{\"name\":\"2021 IEEE/ACS 18th International Conference on Computer Systems and Applications (AICCSA)\",\"volume\":\"118 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE/ACS 18th International Conference on Computer Systems and Applications (AICCSA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AICCSA53542.2021.9686930\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE/ACS 18th International Conference on Computer Systems and Applications (AICCSA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AICCSA53542.2021.9686930","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Text detection in Arabic news video based on MSER and RetinaNet
In this paper, we propose a novel approach for text detection in Arabic news videos. Firstly, we apply MSER method and morphological operators (open and close) to extract candidate regions of text in image. Then, we use a deep learning method called RatinaNet. It is based in two stages. The first one aims to extract features using residual network (ResNet) and a pyramidal feature network (FPN). In the second step, we use two fully convolutional networks (FCN), one is for the classification task and the other for the bounding box regression task. For training and testing stages, we have used the AcTiVD [18] dataset. Experiments results proves the efficiency and performance of the proposed method.