WEB Image Classification Based on the Fusion of Image and Text Classifiers

Ninth International Conference on Document Analysis and Recognition (ICDAR 2007) Pub Date : 2007-09-23 DOI:10.1109/ICDAR.2007.264

P. R. Kalva, F. Enembreck, Alessandro Lameiras Koerich

引用次数: 23

Abstract

This paper presents a novel method for the classification of images that combines information extracted from the images and contextual information. The main hypothesis is that contextual information related to an image can contribute in the image classification process. First, independent classifiers are designed to deal with images and text. From the images color, shape and texture features are extracted. These features are used with a neural network (NN) classifier to carry out image classification. On the other hand, contextual information is processed and used with a Naive Bayes (NB) classifier. At the end, the outputs of both classifiers are combined through heuristic rules. Experimental results on a database of more than 5,000 HTML documents have shown that the combination of classifiers provides a meaningful improvement (about 16%) in the correct image classification rate relative to the results provided by the NN classifier alone.

查看原文本刊更多论文

基于图像和文本分类器融合的WEB图像分类

本文提出了一种将图像提取信息与上下文信息相结合的图像分类方法。主要假设是与图像相关的上下文信息可以在图像分类过程中发挥作用。首先，设计独立的分类器来处理图像和文本。从图像中提取颜色、形状和纹理特征。将这些特征与神经网络(NN)分类器一起进行图像分类。另一方面，使用朴素贝叶斯(NB)分类器处理和使用上下文信息。最后，通过启发式规则将两个分类器的输出组合起来。在超过5000个HTML文档的数据库上的实验结果表明，相对于单独使用NN分类器提供的结果，分类器的组合在正确的图像分类率方面提供了有意义的提高(约16%)。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)

自引率

0.00%

发文量