Scene Text Detection Based on Text Stroke Components.

Xinyue Hou, Pengsen Cheng, Hongyu Gao, Xin Li, Jiayong Liu
{"title":"Scene Text Detection Based on Text Stroke Components.","authors":"Xinyue Hou, Pengsen Cheng, Hongyu Gao, Xin Li, Jiayong Liu","doi":"10.1142/S0129065725500200","DOIUrl":null,"url":null,"abstract":"<p><p>The detection of scene text holds significant importance across a variety of application scenarios. However, previous methods were insufficient for detecting and recognizing text instances, such as variations in text size, chaotic background and diverse text orientations. To address these challenges, this paper proposes a novel methodology based on Text Stroke Components (TSC). The method leverages Harris corner detection to identify critical points of text strokes, such as endpoints, turning points, and curvatures. By analyzing the clustered regions of these points, the approach effectively localizes text characters. To enhance the detection process, a transparency parameter [Formula: see text] is introduced to control the fusion between original images and corner-detection images. This improves the localization of key stroke points, and reduces background noise interference. The proposed method is evaluated through extensive experiments, demonstrating superior performance compared to existing scene text detectors. Furthermore, the method is jointly trained with the ABINet recognition model across all stages. Comprehensive experiments conducted on 13 datasets reveal that this approach significantly outperforms SOTA methods. These results underscore the advantages of using text stroke components for key-point localization through the corner detection algorithm in scene text detection.</p>","PeriodicalId":94052,"journal":{"name":"International journal of neural systems","volume":"35 5","pages":"2550020"},"PeriodicalIF":0.0000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International journal of neural systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1142/S0129065725500200","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The detection of scene text holds significant importance across a variety of application scenarios. However, previous methods were insufficient for detecting and recognizing text instances, such as variations in text size, chaotic background and diverse text orientations. To address these challenges, this paper proposes a novel methodology based on Text Stroke Components (TSC). The method leverages Harris corner detection to identify critical points of text strokes, such as endpoints, turning points, and curvatures. By analyzing the clustered regions of these points, the approach effectively localizes text characters. To enhance the detection process, a transparency parameter [Formula: see text] is introduced to control the fusion between original images and corner-detection images. This improves the localization of key stroke points, and reduces background noise interference. The proposed method is evaluated through extensive experiments, demonstrating superior performance compared to existing scene text detectors. Furthermore, the method is jointly trained with the ABINet recognition model across all stages. Comprehensive experiments conducted on 13 datasets reveal that this approach significantly outperforms SOTA methods. These results underscore the advantages of using text stroke components for key-point localization through the corner detection algorithm in scene text detection.

基于文本笔画分量的场景文本检测。
场景文本的检测在各种应用场景中都具有重要意义。然而,以往的方法在检测和识别文本实例方面存在不足,如文本大小的变化、背景的混乱和文本方向的变化等。为了解决这些问题,本文提出了一种基于文本笔画分量(TSC)的新方法。该方法利用哈里斯角点检测来识别文本笔画的关键点,如端点、转折点和曲率。通过分析这些点的聚类区域,该方法可以有效地定位文本字符。为了增强检测过程,引入透明度参数[公式:见文]来控制原始图像与角点检测图像的融合。这提高了键行程点的定位,减少了背景噪声干扰。通过大量的实验对该方法进行了评估,与现有的场景文本检测器相比,该方法具有更好的性能。此外,该方法还与ABINet识别模型进行了跨阶段的联合训练。在13个数据集上进行的综合实验表明,该方法明显优于SOTA方法。这些结果强调了在场景文本检测中,通过角点检测算法使用文本笔画分量进行关键点定位的优势。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信