Scene Text Detection Based on Text Stroke Components.

IF 6.4

International journal of neural systems Pub Date : 2025-05-01 DOI:10.1142/S0129065725500200

Xinyue Hou, Pengsen Cheng, Hongyu Gao, Xin Li, Jiayong Liu

{"title":"Scene Text Detection Based on Text Stroke Components.","authors":"Xinyue Hou, Pengsen Cheng, Hongyu Gao, Xin Li, Jiayong Liu","doi":"10.1142/S0129065725500200","DOIUrl":null,"url":null,"abstract":"<p><p>The detection of scene text holds significant importance across a variety of application scenarios. However, previous methods were insufficient for detecting and recognizing text instances, such as variations in text size, chaotic background and diverse text orientations. To address these challenges, this paper proposes a novel methodology based on Text Stroke Components (TSC). The method leverages Harris corner detection to identify critical points of text strokes, such as endpoints, turning points, and curvatures. By analyzing the clustered regions of these points, the approach effectively localizes text characters. To enhance the detection process, a transparency parameter [Formula: see text] is introduced to control the fusion between original images and corner-detection images. This improves the localization of key stroke points, and reduces background noise interference. The proposed method is evaluated through extensive experiments, demonstrating superior performance compared to existing scene text detectors. Furthermore, the method is jointly trained with the ABINet recognition model across all stages. Comprehensive experiments conducted on 13 datasets reveal that this approach significantly outperforms SOTA methods. These results underscore the advantages of using text stroke components for key-point localization through the corner detection algorithm in scene text detection.</p>","PeriodicalId":94052,"journal":{"name":"International journal of neural systems","volume":"35 5","pages":"2550020"},"PeriodicalIF":6.4000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International journal of neural systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1142/S0129065725500200","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The detection of scene text holds significant importance across a variety of application scenarios. However, previous methods were insufficient for detecting and recognizing text instances, such as variations in text size, chaotic background and diverse text orientations. To address these challenges, this paper proposes a novel methodology based on Text Stroke Components (TSC). The method leverages Harris corner detection to identify critical points of text strokes, such as endpoints, turning points, and curvatures. By analyzing the clustered regions of these points, the approach effectively localizes text characters. To enhance the detection process, a transparency parameter [Formula: see text] is introduced to control the fusion between original images and corner-detection images. This improves the localization of key stroke points, and reduces background noise interference. The proposed method is evaluated through extensive experiments, demonstrating superior performance compared to existing scene text detectors. Furthermore, the method is jointly trained with the ABINet recognition model across all stages. Comprehensive experiments conducted on 13 datasets reveal that this approach significantly outperforms SOTA methods. These results underscore the advantages of using text stroke components for key-point localization through the corner detection algorithm in scene text detection.

查看原文本刊更多论文

基于文本笔画分量的场景文本检测。

场景文本的检测在各种应用场景中都具有重要意义。然而，以往的方法在检测和识别文本实例方面存在不足，如文本大小的变化、背景的混乱和文本方向的变化等。为了解决这些问题，本文提出了一种基于文本笔画分量（TSC）的新方法。该方法利用哈里斯角点检测来识别文本笔画的关键点，如端点、转折点和曲率。通过分析这些点的聚类区域，该方法可以有效地定位文本字符。为了增强检测过程，引入透明度参数[公式：见文]来控制原始图像与角点检测图像的融合。这提高了键行程点的定位，减少了背景噪声干扰。通过大量的实验对该方法进行了评估，与现有的场景文本检测器相比，该方法具有更好的性能。此外，该方法还与ABINet识别模型进行了跨阶段的联合训练。在13个数据集上进行的综合实验表明，该方法明显优于SOTA方法。这些结果强调了在场景文本检测中，通过角点检测算法使用文本笔画分量进行关键点定位的优势。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International journal of neural systems

自引率

0.00%

发文量