{"title":"视频帧中阿拉伯语文本的检测与识别","authors":"W. Ohyama, Seiya Iwata, T. Wakabayashi, F. Kimura","doi":"10.1109/ICDAR.2017.360","DOIUrl":null,"url":null,"abstract":"The authors have developed an end-to-end system for Arabic text recognition in video frames. The end-to-end system consists of the steps for text-line detection, word segmentation and word recognition. In order to achieve high text recognition accuracy we propose a new scheme of integrated text detection-recognition scheme, where the true text-lines are detected with as higher recall rate as possible and the false words in the false lines are rejected in the successive word recognition step. We reported a recognition based transition frame detection of Arabic news captions in single channel video images. In this paper the recognition system is integrated with n-gram language model and extended to text detection/recognition of multi-channel video images. The multi-channel, multi-font performance of the system is experimentally evaluated using AcTiV-D and AcTiV-R dataset. The multi-channel text detection performance for three channels, France24, Russia Today and TunisiaNat1 is 91.29% in (F)-measure. The multi-channel, multi-font character recognition performance for these channels is 94.84% in F-measure.","PeriodicalId":433676,"journal":{"name":"2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)","volume":"177 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Detection and Recognition of Arabic Text in Video Frames\",\"authors\":\"W. Ohyama, Seiya Iwata, T. Wakabayashi, F. Kimura\",\"doi\":\"10.1109/ICDAR.2017.360\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The authors have developed an end-to-end system for Arabic text recognition in video frames. The end-to-end system consists of the steps for text-line detection, word segmentation and word recognition. In order to achieve high text recognition accuracy we propose a new scheme of integrated text detection-recognition scheme, where the true text-lines are detected with as higher recall rate as possible and the false words in the false lines are rejected in the successive word recognition step. We reported a recognition based transition frame detection of Arabic news captions in single channel video images. In this paper the recognition system is integrated with n-gram language model and extended to text detection/recognition of multi-channel video images. The multi-channel, multi-font performance of the system is experimentally evaluated using AcTiV-D and AcTiV-R dataset. The multi-channel text detection performance for three channels, France24, Russia Today and TunisiaNat1 is 91.29% in (F)-measure. The multi-channel, multi-font character recognition performance for these channels is 94.84% in F-measure.\",\"PeriodicalId\":433676,\"journal\":{\"name\":\"2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)\",\"volume\":\"177 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDAR.2017.360\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDAR.2017.360","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Detection and Recognition of Arabic Text in Video Frames
The authors have developed an end-to-end system for Arabic text recognition in video frames. The end-to-end system consists of the steps for text-line detection, word segmentation and word recognition. In order to achieve high text recognition accuracy we propose a new scheme of integrated text detection-recognition scheme, where the true text-lines are detected with as higher recall rate as possible and the false words in the false lines are rejected in the successive word recognition step. We reported a recognition based transition frame detection of Arabic news captions in single channel video images. In this paper the recognition system is integrated with n-gram language model and extended to text detection/recognition of multi-channel video images. The multi-channel, multi-font performance of the system is experimentally evaluated using AcTiV-D and AcTiV-R dataset. The multi-channel text detection performance for three channels, France24, Russia Today and TunisiaNat1 is 91.29% in (F)-measure. The multi-channel, multi-font character recognition performance for these channels is 94.84% in F-measure.