增强文本字符建议和卷积神经网络在场景图像文本识别中的应用

2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR) Pub Date : 2015-11-01 DOI:10.1109/ACPR.2015.7486493

Alessandro Zamberletti, I. Gallo, L. Noce

{"title":"增强文本字符建议和卷积神经网络在场景图像文本识别中的应用","authors":"Alessandro Zamberletti, I. Gallo, L. Noce","doi":"10.1109/ACPR.2015.7486493","DOIUrl":null,"url":null,"abstract":"In this work we propose a novel method for text spotting from scene images based on augmented Multi-resolution Maximally Stable Extremal Regions and Convolutional Neural Networks. The goal of this work is augmenting text character proposals to maximize their coverage rate over text elements in scene images, to obtain satisfying text detection rates without the need of using very deep architectures nor large amount of training data. Using simple and fast geometric transformations on multi-resolution proposals our system achieves good results for several challenging datasets while also being computationally efficient to train and test on a desktop computer.","PeriodicalId":240902,"journal":{"name":"2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":"{\"title\":\"Augmented text character proposals and convolutional neural networks for text spotting from scene images\",\"authors\":\"Alessandro Zamberletti, I. Gallo, L. Noce\",\"doi\":\"10.1109/ACPR.2015.7486493\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this work we propose a novel method for text spotting from scene images based on augmented Multi-resolution Maximally Stable Extremal Regions and Convolutional Neural Networks. The goal of this work is augmenting text character proposals to maximize their coverage rate over text elements in scene images, to obtain satisfying text detection rates without the need of using very deep architectures nor large amount of training data. Using simple and fast geometric transformations on multi-resolution proposals our system achieves good results for several challenging datasets while also being computationally efficient to train and test on a desktop computer.\",\"PeriodicalId\":240902,\"journal\":{\"name\":\"2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR)\",\"volume\":\"41 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"14\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ACPR.2015.7486493\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACPR.2015.7486493","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 14

摘要

在这项工作中，我们提出了一种基于增强多分辨率最大稳定极值区域和卷积神经网络的场景图像文本识别新方法。这项工作的目标是增强文本字符建议，以最大限度地提高它们对场景图像中文本元素的覆盖率，从而在不需要使用非常深的架构或大量训练数据的情况下获得令人满意的文本检测率。在多分辨率建议上使用简单快速的几何变换，我们的系统在几个具有挑战性的数据集上取得了良好的结果，同时在台式计算机上进行训练和测试的计算效率也很高。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Augmented text character proposals and convolutional neural networks for text spotting from scene images

In this work we propose a novel method for text spotting from scene images based on augmented Multi-resolution Maximally Stable Extremal Regions and Convolutional Neural Networks. The goal of this work is augmenting text character proposals to maximize their coverage rate over text elements in scene images, to obtain satisfying text detection rates without the need of using very deep architectures nor large amount of training data. Using simple and fast geometric transformations on multi-resolution proposals our system achieves good results for several challenging datasets while also being computationally efficient to train and test on a desktop computer.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR)

自引率

0.00%

发文量