Fisher Kernels for Handwritten Word-spotting

2009 10th International Conference on Document Analysis and Recognition Pub Date : 2009-07-26 DOI:10.1109/ICDAR.2009.16

F. Perronnin, José A. Rodríguez-Serrano

引用次数: 41

Abstract

The Fisher kernel is a generic framework which combines the benefits of generative and discriminative approaches to pattern classification. In this contribution, we propose to apply this framework to handwritten word-spotting. Given a word image and a keyword generative model, the idea is to generate a vector which describes how the parameters of the keyword model should be modified to best fit the word image.This vector can then be used as the input of a discriminative classifier. We compare the performance of the proposed approach with that of a generative baseline on a challenging real-world dataset of customer letters. When the kernel used by the classifier is linear, the performance improvement is marginal but the proposed system is approximately 15 times faster than the baseline. If we use a non-linear kernel devised for this task, we obtain a 15\% relative reduction of the error but the detector is approximately 15 times slower.

查看原文本刊更多论文

用于手写单词识别的Fisher核函数

Fisher核是一个通用框架，它结合了模式分类的生成和判别方法的优点。在这篇文章中，我们建议将这个框架应用于手写单词识别。给定一个单词图像和一个关键字生成模型，其思想是生成一个矢量，该矢量描述了如何修改关键字模型的参数以最适合单词图像。这个向量可以用作判别分类器的输入。我们将所提出的方法的性能与在具有挑战性的真实客户信件数据集上生成基线的性能进行了比较。当分类器使用的内核是线性的时，性能的提高是微不足道的，但是所提出的系统比基线快了大约15倍。如果我们使用为这项任务设计的非线性内核，我们可以获得15%的相对误差减少，但检测器的速度大约要慢15倍。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2009 10th International Conference on Document Analysis and Recognition

自引率

0.00%

发文量