Text detection in Arabic news video based on MSER and RetinaNet

2021 IEEE/ACS 18th International Conference on Computer Systems and Applications (AICCSA) Pub Date : 2021-11-01 DOI:10.1109/AICCSA53542.2021.9686930

Sadek Mansouri, Salah Zrigui, M. Zrigui, Dhaou Berchech

引用次数: 1

Abstract

In this paper, we propose a novel approach for text detection in Arabic news videos. Firstly, we apply MSER method and morphological operators (open and close) to extract candidate regions of text in image. Then, we use a deep learning method called RatinaNet. It is based in two stages. The first one aims to extract features using residual network (ResNet) and a pyramidal feature network (FPN). In the second step, we use two fully convolutional networks (FCN), one is for the classification task and the other for the bounding box regression task. For training and testing stages, we have used the AcTiVD [18] dataset. Experiments results proves the efficiency and performance of the proposed method.

查看原文本刊更多论文

基于MSER和retanet的阿拉伯语新闻视频文本检测

本文提出了一种新的阿拉伯语新闻视频文本检测方法。首先，应用MSER方法和形态学算子(开闭)提取图像中文本的候选区域;然后，我们使用一种叫做RatinaNet的深度学习方法。它基于两个阶段。第一个目标是使用残差网络(ResNet)和金字塔特征网络(FPN)提取特征。在第二步中，我们使用两个全卷积网络(FCN)，一个用于分类任务，另一个用于边界盒回归任务。对于训练和测试阶段，我们使用了AcTiVD[18]数据集。实验结果证明了该方法的有效性和性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 IEEE/ACS 18th International Conference on Computer Systems and Applications (AICCSA)

自引率

0.00%

发文量