{"title":"Detection and Recognition of Arabic Text in Video Frames","authors":"W. Ohyama, Seiya Iwata, T. Wakabayashi, F. Kimura","doi":"10.1109/ICDAR.2017.360","DOIUrl":null,"url":null,"abstract":"The authors have developed an end-to-end system for Arabic text recognition in video frames. The end-to-end system consists of the steps for text-line detection, word segmentation and word recognition. In order to achieve high text recognition accuracy we propose a new scheme of integrated text detection-recognition scheme, where the true text-lines are detected with as higher recall rate as possible and the false words in the false lines are rejected in the successive word recognition step. We reported a recognition based transition frame detection of Arabic news captions in single channel video images. In this paper the recognition system is integrated with n-gram language model and extended to text detection/recognition of multi-channel video images. The multi-channel, multi-font performance of the system is experimentally evaluated using AcTiV-D and AcTiV-R dataset. The multi-channel text detection performance for three channels, France24, Russia Today and TunisiaNat1 is 91.29% in (F)-measure. The multi-channel, multi-font character recognition performance for these channels is 94.84% in F-measure.","PeriodicalId":433676,"journal":{"name":"2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)","volume":"177 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDAR.2017.360","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
The authors have developed an end-to-end system for Arabic text recognition in video frames. The end-to-end system consists of the steps for text-line detection, word segmentation and word recognition. In order to achieve high text recognition accuracy we propose a new scheme of integrated text detection-recognition scheme, where the true text-lines are detected with as higher recall rate as possible and the false words in the false lines are rejected in the successive word recognition step. We reported a recognition based transition frame detection of Arabic news captions in single channel video images. In this paper the recognition system is integrated with n-gram language model and extended to text detection/recognition of multi-channel video images. The multi-channel, multi-font performance of the system is experimentally evaluated using AcTiV-D and AcTiV-R dataset. The multi-channel text detection performance for three channels, France24, Russia Today and TunisiaNat1 is 91.29% in (F)-measure. The multi-channel, multi-font character recognition performance for these channels is 94.84% in F-measure.