Script recognition in images with complex backgrounds

Proceedings of the Fifth IEEE International Symposium on Signal Processing and Information Technology, 2005. Pub Date : 2005-12-21 DOI:10.1109/ISSPIT.2005.1577163

J. Gllavata, Bernd Freisleben

引用次数: 30

Abstract

The extraction of textual information from images and videos is an important task for automatic content-based indexing and retrieval purposes. To extract text from images or videos coming from unknown international sources, it is necessary to know the script beforehand in order to employ suitable text segmentation and optical character recognition (OCR) methods. In this paper, we present an approach for discriminating between Latin and Ideographic script. The proposed approach proceeds as follows: first, the text present in an image is localized. Then, a set of low-level features is extracted from the localized text image. Finally, based on the extracted features, the decision about the type of the script is made using a k-nearest neighbour classifier. Initial experimental results for a set of images containing text of different scripts demonstrate the good performance of the proposed solution

查看原文本刊更多论文

具有复杂背景图像的脚本识别

从图像和视频中提取文本信息是实现基于内容的自动索引和检索的重要任务。为了从未知国际来源的图像或视频中提取文本，需要事先了解文本，以便采用合适的文本分割和光学字符识别(OCR)方法。本文提出了一种区分拉丁文和表意文字的方法。本文提出的方法如下:首先，对图像中的文本进行定位。然后，从定位后的文本图像中提取一组低级特征。最后，基于提取的特征，使用k近邻分类器决定脚本的类型。对一组包含不同脚本文本的图像进行了初步实验，结果表明该方法具有良好的性能

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the Fifth IEEE International Symposium on Signal Processing and Information Technology, 2005.

自引率

0.00%

发文量