基于MSER和retanet的阿拉伯语新闻视频文本检测

2021 IEEE/ACS 18th International Conference on Computer Systems and Applications (AICCSA) Pub Date : 2021-11-01 DOI:10.1109/AICCSA53542.2021.9686930

Sadek Mansouri, Salah Zrigui, M. Zrigui, Dhaou Berchech

{"title":"基于MSER和retanet的阿拉伯语新闻视频文本检测","authors":"Sadek Mansouri, Salah Zrigui, M. Zrigui, Dhaou Berchech","doi":"10.1109/AICCSA53542.2021.9686930","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a novel approach for text detection in Arabic news videos. Firstly, we apply MSER method and morphological operators (open and close) to extract candidate regions of text in image. Then, we use a deep learning method called RatinaNet. It is based in two stages. The first one aims to extract features using residual network (ResNet) and a pyramidal feature network (FPN). In the second step, we use two fully convolutional networks (FCN), one is for the classification task and the other for the bounding box regression task. For training and testing stages, we have used the AcTiVD [18] dataset. Experiments results proves the efficiency and performance of the proposed method.","PeriodicalId":423896,"journal":{"name":"2021 IEEE/ACS 18th International Conference on Computer Systems and Applications (AICCSA)","volume":"118 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Text detection in Arabic news video based on MSER and RetinaNet\",\"authors\":\"Sadek Mansouri, Salah Zrigui, M. Zrigui, Dhaou Berchech\",\"doi\":\"10.1109/AICCSA53542.2021.9686930\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we propose a novel approach for text detection in Arabic news videos. Firstly, we apply MSER method and morphological operators (open and close) to extract candidate regions of text in image. Then, we use a deep learning method called RatinaNet. It is based in two stages. The first one aims to extract features using residual network (ResNet) and a pyramidal feature network (FPN). In the second step, we use two fully convolutional networks (FCN), one is for the classification task and the other for the bounding box regression task. For training and testing stages, we have used the AcTiVD [18] dataset. Experiments results proves the efficiency and performance of the proposed method.\",\"PeriodicalId\":423896,\"journal\":{\"name\":\"2021 IEEE/ACS 18th International Conference on Computer Systems and Applications (AICCSA)\",\"volume\":\"118 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE/ACS 18th International Conference on Computer Systems and Applications (AICCSA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AICCSA53542.2021.9686930\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE/ACS 18th International Conference on Computer Systems and Applications (AICCSA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AICCSA53542.2021.9686930","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

本文提出了一种新的阿拉伯语新闻视频文本检测方法。首先，应用MSER方法和形态学算子(开闭)提取图像中文本的候选区域;然后，我们使用一种叫做RatinaNet的深度学习方法。它基于两个阶段。第一个目标是使用残差网络(ResNet)和金字塔特征网络(FPN)提取特征。在第二步中，我们使用两个全卷积网络(FCN)，一个用于分类任务，另一个用于边界盒回归任务。对于训练和测试阶段，我们使用了AcTiVD[18]数据集。实验结果证明了该方法的有效性和性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Text detection in Arabic news video based on MSER and RetinaNet

In this paper, we propose a novel approach for text detection in Arabic news videos. Firstly, we apply MSER method and morphological operators (open and close) to extract candidate regions of text in image. Then, we use a deep learning method called RatinaNet. It is based in two stages. The first one aims to extract features using residual network (ResNet) and a pyramidal feature network (FPN). In the second step, we use two fully convolutional networks (FCN), one is for the classification task and the other for the bounding box regression task. For training and testing stages, we have used the AcTiVD [18] dataset. Experiments results proves the efficiency and performance of the proposed method.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 IEEE/ACS 18th International Conference on Computer Systems and Applications (AICCSA)

自引率

0.00%

发文量