Augmented text character proposals and convolutional neural networks for text spotting from scene images

2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR) Pub Date : 2015-11-01 DOI:10.1109/ACPR.2015.7486493

Alessandro Zamberletti, I. Gallo, L. Noce

引用次数: 14

Abstract

In this work we propose a novel method for text spotting from scene images based on augmented Multi-resolution Maximally Stable Extremal Regions and Convolutional Neural Networks. The goal of this work is augmenting text character proposals to maximize their coverage rate over text elements in scene images, to obtain satisfying text detection rates without the need of using very deep architectures nor large amount of training data. Using simple and fast geometric transformations on multi-resolution proposals our system achieves good results for several challenging datasets while also being computationally efficient to train and test on a desktop computer.

查看原文本刊更多论文

增强文本字符建议和卷积神经网络在场景图像文本识别中的应用

在这项工作中，我们提出了一种基于增强多分辨率最大稳定极值区域和卷积神经网络的场景图像文本识别新方法。这项工作的目标是增强文本字符建议，以最大限度地提高它们对场景图像中文本元素的覆盖率，从而在不需要使用非常深的架构或大量训练数据的情况下获得令人满意的文本检测率。在多分辨率建议上使用简单快速的几何变换，我们的系统在几个具有挑战性的数据集上取得了良好的结果，同时在台式计算机上进行训练和测试的计算效率也很高。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR)

自引率

0.00%

发文量