Integrating Text Recognition for Overlapping Text Detection in Maps

N. Nazari, Tianxiang Tan, Yao-Yi Chiang
{"title":"Integrating Text Recognition for Overlapping Text Detection in Maps","authors":"N. Nazari, Tianxiang Tan, Yao-Yi Chiang","doi":"10.2352/ISSN.2470-1173.2016.17.DRR-061","DOIUrl":null,"url":null,"abstract":"Detecting overlapping text from map images is a challenging problem. Previous algorithms generally assume specific cartographic styles (e.g., road shapes and text format) and are difficult to adjust for handling different map types. In this paper, we build on our previous text recognition work, Strabo, to develop an algorithm for detecting overlapping characters from non-text symbols. We call this algorithm Overlapping Text Detection (OTD). OTD uses the recognition results and locations of detected text labels (from Strabo) to detect potential areas that contain overlapping text. Next, OTD classifies these areas as either text or non-text regions based on their shape descriptions (including the ratio of number of foreground pixels to area size, number of connected components, and number of holes). The average precision and recall of OTD in classifying text and non-text regions were 77% and 86%, respectively. We show that OTD improved the precision and recall of text detection in Strabo by 19% and 41%, respectively, and produced higher accuracy compared to a state-ofthe-art text/graphic separation algorithm.","PeriodicalId":152377,"journal":{"name":"Document Recognition and Retrieval","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Document Recognition and Retrieval","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2352/ISSN.2470-1173.2016.17.DRR-061","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7

Abstract

Detecting overlapping text from map images is a challenging problem. Previous algorithms generally assume specific cartographic styles (e.g., road shapes and text format) and are difficult to adjust for handling different map types. In this paper, we build on our previous text recognition work, Strabo, to develop an algorithm for detecting overlapping characters from non-text symbols. We call this algorithm Overlapping Text Detection (OTD). OTD uses the recognition results and locations of detected text labels (from Strabo) to detect potential areas that contain overlapping text. Next, OTD classifies these areas as either text or non-text regions based on their shape descriptions (including the ratio of number of foreground pixels to area size, number of connected components, and number of holes). The average precision and recall of OTD in classifying text and non-text regions were 77% and 86%, respectively. We show that OTD improved the precision and recall of text detection in Strabo by 19% and 41%, respectively, and produced higher accuracy compared to a state-ofthe-art text/graphic separation algorithm.
集成文本识别的地图重叠文本检测
从地图图像中检测重叠文本是一个具有挑战性的问题。以前的算法通常假设特定的地图样式(例如,道路形状和文本格式),并且难以调整以处理不同的地图类型。在本文中,我们以之前的文本识别工作Strabo为基础,开发了一种从非文本符号中检测重叠字符的算法。我们称这种算法为重叠文本检测(OTD)。OTD使用识别结果和检测到的文本标签的位置(来自Strabo)来检测包含重叠文本的潜在区域。接下来,OTD根据这些区域的形状描述(包括前景像素与区域大小的比例、连接组件的数量和孔的数量)将这些区域分类为文本区域或非文本区域。OTD对文本和非文本区域分类的平均准确率和召回率分别为77%和86%。研究表明,与现有的文本/图形分离算法相比,OTD将Strabo中文本检测的准确率和召回率分别提高了19%和41%,并产生了更高的准确率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信