P. R. Kalva, F. Enembreck, Alessandro Lameiras Koerich
{"title":"WEB Image Classification Based on the Fusion of Image and Text Classifiers","authors":"P. R. Kalva, F. Enembreck, Alessandro Lameiras Koerich","doi":"10.1109/ICDAR.2007.264","DOIUrl":null,"url":null,"abstract":"This paper presents a novel method for the classification of images that combines information extracted from the images and contextual information. The main hypothesis is that contextual information related to an image can contribute in the image classification process. First, independent classifiers are designed to deal with images and text. From the images color, shape and texture features are extracted. These features are used with a neural network (NN) classifier to carry out image classification. On the other hand, contextual information is processed and used with a Naive Bayes (NB) classifier. At the end, the outputs of both classifiers are combined through heuristic rules. Experimental results on a database of more than 5,000 HTML documents have shown that the combination of classifiers provides a meaningful improvement (about 16%) in the correct image classification rate relative to the results provided by the NN classifier alone.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"23","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDAR.2007.264","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 23
Abstract
This paper presents a novel method for the classification of images that combines information extracted from the images and contextual information. The main hypothesis is that contextual information related to an image can contribute in the image classification process. First, independent classifiers are designed to deal with images and text. From the images color, shape and texture features are extracted. These features are used with a neural network (NN) classifier to carry out image classification. On the other hand, contextual information is processed and used with a Naive Bayes (NB) classifier. At the end, the outputs of both classifiers are combined through heuristic rules. Experimental results on a database of more than 5,000 HTML documents have shown that the combination of classifiers provides a meaningful improvement (about 16%) in the correct image classification rate relative to the results provided by the NN classifier alone.