Modified Stroke Width Transform for Thai Text Detection

Taravichet Titijaroonroj
{"title":"Modified Stroke Width Transform for Thai Text Detection","authors":"Taravichet Titijaroonroj","doi":"10.23919/INCIT.2018.8584869","DOIUrl":null,"url":null,"abstract":"Stroke width transform (SWT) is widely used for detecting the candidate text in a natural scene image before forwarding it to character recognition to convert to editable text. It is an important part of an OCR application. However, the main problem affecting its performance is inconstant stroke width. This problem originates from inaccurate stroke width map. Due to this inaccurate map, the candidate text may get rejected in the SWT filtering step even though it is so. This leads to lower SWT performance. In order to solve this problem, we propose a modified stroke width transform (MSWT). This method divides a stroke width map from the candidate object into several sub-regions and computes the variance and mean of each sub-region instead of those of the whole stroke width map. This can reduce the destructive effect from inaccurately computed stroke width. Based on 100 images of natural scene with embedded Thai text, an experiment was conducted to comparatively evaluate the performance of MSWT against SWT in Thai text detection. The experimental results show that the recall rate of the proposed method was higher than that of the conventional SWT method.","PeriodicalId":144271,"journal":{"name":"2018 International Conference on Information Technology (InCIT)","volume":"58 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on Information Technology (InCIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/INCIT.2018.8584869","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Stroke width transform (SWT) is widely used for detecting the candidate text in a natural scene image before forwarding it to character recognition to convert to editable text. It is an important part of an OCR application. However, the main problem affecting its performance is inconstant stroke width. This problem originates from inaccurate stroke width map. Due to this inaccurate map, the candidate text may get rejected in the SWT filtering step even though it is so. This leads to lower SWT performance. In order to solve this problem, we propose a modified stroke width transform (MSWT). This method divides a stroke width map from the candidate object into several sub-regions and computes the variance and mean of each sub-region instead of those of the whole stroke width map. This can reduce the destructive effect from inaccurately computed stroke width. Based on 100 images of natural scene with embedded Thai text, an experiment was conducted to comparatively evaluate the performance of MSWT against SWT in Thai text detection. The experimental results show that the recall rate of the proposed method was higher than that of the conventional SWT method.
修改的笔画宽度变换的泰国文本检测
笔画宽度变换(SWT)广泛用于检测自然场景图像中的候选文本,然后将其转发给字符识别以转换为可编辑文本。它是OCR应用程序的重要组成部分。但影响其性能的主要问题是行程宽度不恒定。这一问题的根源在于笔画宽度图不准确。由于这种不准确的映射,候选文本可能会在SWT过滤步骤中被拒绝,尽管事实就是如此。这将导致较低的SWT性能。为了解决这个问题,我们提出了一种改进的笔画宽度变换(MSWT)。该方法将候选对象的笔画宽度图分成若干个子区域,计算每个子区域的方差和平均值,而不是计算整个笔画宽度图的方差和平均值。这可以减少不准确计算的冲程宽度的破坏性影响。基于100幅嵌入了泰文文本的自然场景图像,通过实验比较了MSWT和SWT在泰文文本检测中的性能。实验结果表明,该方法的查全率高于传统的SWT方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信