Detection and Recognition of Arabic Text in Video Frames

2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR) Pub Date : 2017-11-01 DOI:10.1109/ICDAR.2017.360

W. Ohyama, Seiya Iwata, T. Wakabayashi, F. Kimura

{"title":"Detection and Recognition of Arabic Text in Video Frames","authors":"W. Ohyama, Seiya Iwata, T. Wakabayashi, F. Kimura","doi":"10.1109/ICDAR.2017.360","DOIUrl":null,"url":null,"abstract":"The authors have developed an end-to-end system for Arabic text recognition in video frames. The end-to-end system consists of the steps for text-line detection, word segmentation and word recognition. In order to achieve high text recognition accuracy we propose a new scheme of integrated text detection-recognition scheme, where the true text-lines are detected with as higher recall rate as possible and the false words in the false lines are rejected in the successive word recognition step. We reported a recognition based transition frame detection of Arabic news captions in single channel video images. In this paper the recognition system is integrated with n-gram language model and extended to text detection/recognition of multi-channel video images. The multi-channel, multi-font performance of the system is experimentally evaluated using AcTiV-D and AcTiV-R dataset. The multi-channel text detection performance for three channels, France24, Russia Today and TunisiaNat1 is 91.29% in (F)-measure. The multi-channel, multi-font character recognition performance for these channels is 94.84% in F-measure.","PeriodicalId":433676,"journal":{"name":"2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)","volume":"177 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDAR.2017.360","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

The authors have developed an end-to-end system for Arabic text recognition in video frames. The end-to-end system consists of the steps for text-line detection, word segmentation and word recognition. In order to achieve high text recognition accuracy we propose a new scheme of integrated text detection-recognition scheme, where the true text-lines are detected with as higher recall rate as possible and the false words in the false lines are rejected in the successive word recognition step. We reported a recognition based transition frame detection of Arabic news captions in single channel video images. In this paper the recognition system is integrated with n-gram language model and extended to text detection/recognition of multi-channel video images. The multi-channel, multi-font performance of the system is experimentally evaluated using AcTiV-D and AcTiV-R dataset. The multi-channel text detection performance for three channels, France24, Russia Today and TunisiaNat1 is 91.29% in (F)-measure. The multi-channel, multi-font character recognition performance for these channels is 94.84% in F-measure.

查看原文本刊更多论文

视频帧中阿拉伯语文本的检测与识别

作者开发了一个端到端的系统，用于视频帧中的阿拉伯语文本识别。端到端系统包括文本行检测、分词和词识别三个步骤。为了达到较高的文本识别准确率，本文提出了一种综合文本检测-识别方案，该方案以尽可能高的召回率检测出真实的文本行，并在后续的单词识别步骤中剔除假行中的假单词。我们报道了一种基于识别的单通道视频图像中阿拉伯语新闻字幕的过渡帧检测方法。本文将该识别系统与n-gram语言模型相结合，并将其扩展到多通道视频图像的文本检测/识别。利用AcTiV-D和AcTiV-R数据集对系统的多通道、多字体性能进行了实验评估。France24、Russia Today和TunisiaNat1三个频道的多通道文本检测性能为91.29% (F)-measure。这些通道的多通道、多字体字符识别性能在F-measure中为94.84%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)

自引率

0.00%

发文量