基于区域增长算法的广告展示板内容提取

Ramesh M. Badiger, Manjunath Y. Kammar, Ningappa T. Pujar
{"title":"基于区域增长算法的广告展示板内容提取","authors":"Ramesh M. Badiger, Manjunath Y. Kammar, Ningappa T. Pujar","doi":"10.1109/ICAECCT.2016.7942603","DOIUrl":null,"url":null,"abstract":"In recent years portable camera devices have gained increased popularity and embedded visual processing, text extraction from natural scene images like advertisement display boards, government office boards has become a key problem in everyday lives. The problem is challenging in nature due to variations in the font size and color, text alignment, illumination change and reflections. Here a novel method for extraction of text from advertisement display boards using Region growing algorithm is proposed. The proposed algorithm is generally composed of four stages, the colored image is converted into grayscale image and canny edge method is used to detect the edges of the image, the edge detected image is preprocessed by applying morphological operations and rule based method is used to remove the non text objects based on width, height and area, later finding the centroid point of the connected component of identified objects and finally proposed algorithm region growing method is used to start extracting the characters. The method is robust and insensitive to noise, blur, variation in font size and style, color, uneven thickness, and varying lightning conditions. The text extraction accuracy of 90.94% is achieved.","PeriodicalId":6629,"journal":{"name":"2016 IEEE International Conference on Advances in Electronics, Communication and Computer Technology (ICAECCT)","volume":"17 1","pages":"303-307"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Content extraction from advertisement display boards utilizing Region growing algorithm\",\"authors\":\"Ramesh M. Badiger, Manjunath Y. Kammar, Ningappa T. Pujar\",\"doi\":\"10.1109/ICAECCT.2016.7942603\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent years portable camera devices have gained increased popularity and embedded visual processing, text extraction from natural scene images like advertisement display boards, government office boards has become a key problem in everyday lives. The problem is challenging in nature due to variations in the font size and color, text alignment, illumination change and reflections. Here a novel method for extraction of text from advertisement display boards using Region growing algorithm is proposed. The proposed algorithm is generally composed of four stages, the colored image is converted into grayscale image and canny edge method is used to detect the edges of the image, the edge detected image is preprocessed by applying morphological operations and rule based method is used to remove the non text objects based on width, height and area, later finding the centroid point of the connected component of identified objects and finally proposed algorithm region growing method is used to start extracting the characters. The method is robust and insensitive to noise, blur, variation in font size and style, color, uneven thickness, and varying lightning conditions. The text extraction accuracy of 90.94% is achieved.\",\"PeriodicalId\":6629,\"journal\":{\"name\":\"2016 IEEE International Conference on Advances in Electronics, Communication and Computer Technology (ICAECCT)\",\"volume\":\"17 1\",\"pages\":\"303-307\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 IEEE International Conference on Advances in Electronics, Communication and Computer Technology (ICAECCT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICAECCT.2016.7942603\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE International Conference on Advances in Electronics, Communication and Computer Technology (ICAECCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAECCT.2016.7942603","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

近年来,便携式摄像设备越来越普及,嵌入式视觉处理,从广告展板、政府办公板等自然场景图像中提取文本已成为人们日常生活中的关键问题。由于字体大小和颜色、文本对齐、照明变化和反射的变化,这个问题本质上是具有挑战性的。本文提出了一种基于区域增长算法的广告展示板文本提取方法。该算法一般由四个阶段组成:将彩色图像转换为灰度图像,使用canny边缘法对图像进行边缘检测,对检测到的图像进行形态学预处理,并使用基于规则的方法对基于宽度、高度和面积的非文本对象进行去除;然后找到识别对象的连通分量的质心点,最后采用提出的算法区域生长法开始提取特征。该方法具有鲁棒性,对噪声、模糊、字体大小和样式变化、颜色、厚度不均匀和闪电条件变化不敏感。文本提取准确率达到90.94%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Content extraction from advertisement display boards utilizing Region growing algorithm
In recent years portable camera devices have gained increased popularity and embedded visual processing, text extraction from natural scene images like advertisement display boards, government office boards has become a key problem in everyday lives. The problem is challenging in nature due to variations in the font size and color, text alignment, illumination change and reflections. Here a novel method for extraction of text from advertisement display boards using Region growing algorithm is proposed. The proposed algorithm is generally composed of four stages, the colored image is converted into grayscale image and canny edge method is used to detect the edges of the image, the edge detected image is preprocessed by applying morphological operations and rule based method is used to remove the non text objects based on width, height and area, later finding the centroid point of the connected component of identified objects and finally proposed algorithm region growing method is used to start extracting the characters. The method is robust and insensitive to noise, blur, variation in font size and style, color, uneven thickness, and varying lightning conditions. The text extraction accuracy of 90.94% is achieved.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信