{"title":"文本检测在自然场景图像中使用两个蒙版滤波","authors":"Houssem Turki, Mohamed Ben Halima, A. Alimi","doi":"10.1109/AICCSA.2016.7945644","DOIUrl":null,"url":null,"abstract":"Text detection in natural scenes holds great importance in the field of research and still remains a challenge because of size, various fonts, line orientation, different illumination conditions, weak character and complex background in image. The contribution of the proposed method is filtering out complex backgrounds by utilizing two masks filtering based on text confidence map in the first step and multi-channel maximally stable extremal regions (MSERs) in the second step. Both steps are designed to enhancement, maximize capacity of zones text pixels candidates to distinguish text boxes from the rest of the image. Then non-text components are filtered by the classification of character candidate based on Support Vector Machines (SVM) using HOG features. The false positives are eliminated by geometrical properties of text blocks. Finally we apply boundary box localization after a stage of word grouping. The proposed method has been evaluated on ICDAR 2013 scene text detection competition dataset and the encouraging experiments results demonstrate the robustness of our method.","PeriodicalId":448329,"journal":{"name":"2016 IEEE/ACS 13th International Conference of Computer Systems and Applications (AICCSA)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Text detection in natural scene images using two masks filtering\",\"authors\":\"Houssem Turki, Mohamed Ben Halima, A. Alimi\",\"doi\":\"10.1109/AICCSA.2016.7945644\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Text detection in natural scenes holds great importance in the field of research and still remains a challenge because of size, various fonts, line orientation, different illumination conditions, weak character and complex background in image. The contribution of the proposed method is filtering out complex backgrounds by utilizing two masks filtering based on text confidence map in the first step and multi-channel maximally stable extremal regions (MSERs) in the second step. Both steps are designed to enhancement, maximize capacity of zones text pixels candidates to distinguish text boxes from the rest of the image. Then non-text components are filtered by the classification of character candidate based on Support Vector Machines (SVM) using HOG features. The false positives are eliminated by geometrical properties of text blocks. Finally we apply boundary box localization after a stage of word grouping. The proposed method has been evaluated on ICDAR 2013 scene text detection competition dataset and the encouraging experiments results demonstrate the robustness of our method.\",\"PeriodicalId\":448329,\"journal\":{\"name\":\"2016 IEEE/ACS 13th International Conference of Computer Systems and Applications (AICCSA)\",\"volume\":\"39 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 IEEE/ACS 13th International Conference of Computer Systems and Applications (AICCSA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AICCSA.2016.7945644\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE/ACS 13th International Conference of Computer Systems and Applications (AICCSA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AICCSA.2016.7945644","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Text detection in natural scene images using two masks filtering
Text detection in natural scenes holds great importance in the field of research and still remains a challenge because of size, various fonts, line orientation, different illumination conditions, weak character and complex background in image. The contribution of the proposed method is filtering out complex backgrounds by utilizing two masks filtering based on text confidence map in the first step and multi-channel maximally stable extremal regions (MSERs) in the second step. Both steps are designed to enhancement, maximize capacity of zones text pixels candidates to distinguish text boxes from the rest of the image. Then non-text components are filtered by the classification of character candidate based on Support Vector Machines (SVM) using HOG features. The false positives are eliminated by geometrical properties of text blocks. Finally we apply boundary box localization after a stage of word grouping. The proposed method has been evaluated on ICDAR 2013 scene text detection competition dataset and the encouraging experiments results demonstrate the robustness of our method.