文本检测在自然场景图像中使用两个蒙版滤波

2016 IEEE/ACS 13th International Conference of Computer Systems and Applications (AICCSA) Pub Date : 2016-11-01 DOI:10.1109/AICCSA.2016.7945644

Houssem Turki, Mohamed Ben Halima, A. Alimi

{"title":"文本检测在自然场景图像中使用两个蒙版滤波","authors":"Houssem Turki, Mohamed Ben Halima, A. Alimi","doi":"10.1109/AICCSA.2016.7945644","DOIUrl":null,"url":null,"abstract":"Text detection in natural scenes holds great importance in the field of research and still remains a challenge because of size, various fonts, line orientation, different illumination conditions, weak character and complex background in image. The contribution of the proposed method is filtering out complex backgrounds by utilizing two masks filtering based on text confidence map in the first step and multi-channel maximally stable extremal regions (MSERs) in the second step. Both steps are designed to enhancement, maximize capacity of zones text pixels candidates to distinguish text boxes from the rest of the image. Then non-text components are filtered by the classification of character candidate based on Support Vector Machines (SVM) using HOG features. The false positives are eliminated by geometrical properties of text blocks. Finally we apply boundary box localization after a stage of word grouping. The proposed method has been evaluated on ICDAR 2013 scene text detection competition dataset and the encouraging experiments results demonstrate the robustness of our method.","PeriodicalId":448329,"journal":{"name":"2016 IEEE/ACS 13th International Conference of Computer Systems and Applications (AICCSA)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Text detection in natural scene images using two masks filtering\",\"authors\":\"Houssem Turki, Mohamed Ben Halima, A. Alimi\",\"doi\":\"10.1109/AICCSA.2016.7945644\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Text detection in natural scenes holds great importance in the field of research and still remains a challenge because of size, various fonts, line orientation, different illumination conditions, weak character and complex background in image. The contribution of the proposed method is filtering out complex backgrounds by utilizing two masks filtering based on text confidence map in the first step and multi-channel maximally stable extremal regions (MSERs) in the second step. Both steps are designed to enhancement, maximize capacity of zones text pixels candidates to distinguish text boxes from the rest of the image. Then non-text components are filtered by the classification of character candidate based on Support Vector Machines (SVM) using HOG features. The false positives are eliminated by geometrical properties of text blocks. Finally we apply boundary box localization after a stage of word grouping. The proposed method has been evaluated on ICDAR 2013 scene text detection competition dataset and the encouraging experiments results demonstrate the robustness of our method.\",\"PeriodicalId\":448329,\"journal\":{\"name\":\"2016 IEEE/ACS 13th International Conference of Computer Systems and Applications (AICCSA)\",\"volume\":\"39 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 IEEE/ACS 13th International Conference of Computer Systems and Applications (AICCSA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AICCSA.2016.7945644\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE/ACS 13th International Conference of Computer Systems and Applications (AICCSA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AICCSA.2016.7945644","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 8

摘要

自然场景中的文本检测在研究领域中占有重要地位，但由于图像的大小、字体、线条方向、光照条件不同、字符弱、背景复杂等原因，文本检测仍然是一个挑战。该方法的贡献在于首先利用基于文本置信度图的双掩模滤波和第二步利用多通道最大稳定极值区域(mser)滤除复杂背景。这两个步骤都旨在增强，最大化区域文本像素候选的容量，以区分文本框与图像的其余部分。然后利用HOG特征，利用支持向量机(SVM)对候选字符进行分类，过滤非文本成分;通过文本块的几何特性消除了误报。最后在分组后进行边界盒定位。在ICDAR 2013场景文本检测竞争数据集上对该方法进行了评估，实验结果表明了该方法的鲁棒性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Text detection in natural scene images using two masks filtering

Text detection in natural scenes holds great importance in the field of research and still remains a challenge because of size, various fonts, line orientation, different illumination conditions, weak character and complex background in image. The contribution of the proposed method is filtering out complex backgrounds by utilizing two masks filtering based on text confidence map in the first step and multi-channel maximally stable extremal regions (MSERs) in the second step. Both steps are designed to enhancement, maximize capacity of zones text pixels candidates to distinguish text boxes from the rest of the image. Then non-text components are filtered by the classification of character candidate based on Support Vector Machines (SVM) using HOG features. The false positives are eliminated by geometrical properties of text blocks. Finally we apply boundary box localization after a stage of word grouping. The proposed method has been evaluated on ICDAR 2013 scene text detection competition dataset and the encouraging experiments results demonstrate the robustness of our method.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2016 IEEE/ACS 13th International Conference of Computer Systems and Applications (AICCSA)

自引率

0.00%

发文量