Text detection in natural scene images using two masks filtering

Houssem Turki, Mohamed Ben Halima, A. Alimi
{"title":"Text detection in natural scene images using two masks filtering","authors":"Houssem Turki, Mohamed Ben Halima, A. Alimi","doi":"10.1109/AICCSA.2016.7945644","DOIUrl":null,"url":null,"abstract":"Text detection in natural scenes holds great importance in the field of research and still remains a challenge because of size, various fonts, line orientation, different illumination conditions, weak character and complex background in image. The contribution of the proposed method is filtering out complex backgrounds by utilizing two masks filtering based on text confidence map in the first step and multi-channel maximally stable extremal regions (MSERs) in the second step. Both steps are designed to enhancement, maximize capacity of zones text pixels candidates to distinguish text boxes from the rest of the image. Then non-text components are filtered by the classification of character candidate based on Support Vector Machines (SVM) using HOG features. The false positives are eliminated by geometrical properties of text blocks. Finally we apply boundary box localization after a stage of word grouping. The proposed method has been evaluated on ICDAR 2013 scene text detection competition dataset and the encouraging experiments results demonstrate the robustness of our method.","PeriodicalId":448329,"journal":{"name":"2016 IEEE/ACS 13th International Conference of Computer Systems and Applications (AICCSA)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE/ACS 13th International Conference of Computer Systems and Applications (AICCSA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AICCSA.2016.7945644","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8

Abstract

Text detection in natural scenes holds great importance in the field of research and still remains a challenge because of size, various fonts, line orientation, different illumination conditions, weak character and complex background in image. The contribution of the proposed method is filtering out complex backgrounds by utilizing two masks filtering based on text confidence map in the first step and multi-channel maximally stable extremal regions (MSERs) in the second step. Both steps are designed to enhancement, maximize capacity of zones text pixels candidates to distinguish text boxes from the rest of the image. Then non-text components are filtered by the classification of character candidate based on Support Vector Machines (SVM) using HOG features. The false positives are eliminated by geometrical properties of text blocks. Finally we apply boundary box localization after a stage of word grouping. The proposed method has been evaluated on ICDAR 2013 scene text detection competition dataset and the encouraging experiments results demonstrate the robustness of our method.
文本检测在自然场景图像中使用两个蒙版滤波
自然场景中的文本检测在研究领域中占有重要地位,但由于图像的大小、字体、线条方向、光照条件不同、字符弱、背景复杂等原因,文本检测仍然是一个挑战。该方法的贡献在于首先利用基于文本置信度图的双掩模滤波和第二步利用多通道最大稳定极值区域(mser)滤除复杂背景。这两个步骤都旨在增强,最大化区域文本像素候选的容量,以区分文本框与图像的其余部分。然后利用HOG特征,利用支持向量机(SVM)对候选字符进行分类,过滤非文本成分;通过文本块的几何特性消除了误报。最后在分组后进行边界盒定位。在ICDAR 2013场景文本检测竞争数据集上对该方法进行了评估,实验结果表明了该方法的鲁棒性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信