Z. Malik, Ali Mirza, A. Bennour, I. Siddiqi, Chawki Djeddi
{"title":"Video Script Identification Using a Combination of Textural Features","authors":"Z. Malik, Ali Mirza, A. Bennour, I. Siddiqi, Chawki Djeddi","doi":"10.1109/SITIS.2015.15","DOIUrl":null,"url":null,"abstract":"This paper presents a system for script recognition of the text appearing in video frames. The textual content in videos is generally extracted and recognized for development of text based indexing and retrieval systems. If the text in videos appears only in a single script, the output of text detector is directly fed to a video Optical Character Recognition (OCR) system for recognition. However, in cases where text may appear in multiple scripts, a script recognition module is required to recognize the script of the text so that it can be processed by the respective OCR. We propose a video script recognition system that considers text in each script as a unique texture. A number of texture measures are extracted from text blocks and an artificial neural network is trained to learn to distinguish between different scripts. The system evaluated on video text blocks in five different scripts (Arabic, English, Urdu, Hindi and Chinese) reported promising recognition rates. In addition to the performance of individual textural features, different combinations of texture measures were investigated which realized interesting results.","PeriodicalId":128616,"journal":{"name":"2015 11th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 11th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SITIS.2015.15","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
This paper presents a system for script recognition of the text appearing in video frames. The textual content in videos is generally extracted and recognized for development of text based indexing and retrieval systems. If the text in videos appears only in a single script, the output of text detector is directly fed to a video Optical Character Recognition (OCR) system for recognition. However, in cases where text may appear in multiple scripts, a script recognition module is required to recognize the script of the text so that it can be processed by the respective OCR. We propose a video script recognition system that considers text in each script as a unique texture. A number of texture measures are extracted from text blocks and an artificial neural network is trained to learn to distinguish between different scripts. The system evaluated on video text blocks in five different scripts (Arabic, English, Urdu, Hindi and Chinese) reported promising recognition rates. In addition to the performance of individual textural features, different combinations of texture measures were investigated which realized interesting results.