Xinyue Hou, Pengsen Cheng, Hongyu Gao, Xin Li, Jiayong Liu
{"title":"Scene Text Detection Based on Text Stroke Components.","authors":"Xinyue Hou, Pengsen Cheng, Hongyu Gao, Xin Li, Jiayong Liu","doi":"10.1142/S0129065725500200","DOIUrl":null,"url":null,"abstract":"<p><p>The detection of scene text holds significant importance across a variety of application scenarios. However, previous methods were insufficient for detecting and recognizing text instances, such as variations in text size, chaotic background and diverse text orientations. To address these challenges, this paper proposes a novel methodology based on Text Stroke Components (TSC). The method leverages Harris corner detection to identify critical points of text strokes, such as endpoints, turning points, and curvatures. By analyzing the clustered regions of these points, the approach effectively localizes text characters. To enhance the detection process, a transparency parameter [Formula: see text] is introduced to control the fusion between original images and corner-detection images. This improves the localization of key stroke points, and reduces background noise interference. The proposed method is evaluated through extensive experiments, demonstrating superior performance compared to existing scene text detectors. Furthermore, the method is jointly trained with the ABINet recognition model across all stages. Comprehensive experiments conducted on 13 datasets reveal that this approach significantly outperforms SOTA methods. These results underscore the advantages of using text stroke components for key-point localization through the corner detection algorithm in scene text detection.</p>","PeriodicalId":94052,"journal":{"name":"International journal of neural systems","volume":"35 5","pages":"2550020"},"PeriodicalIF":0.0000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International journal of neural systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1142/S0129065725500200","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The detection of scene text holds significant importance across a variety of application scenarios. However, previous methods were insufficient for detecting and recognizing text instances, such as variations in text size, chaotic background and diverse text orientations. To address these challenges, this paper proposes a novel methodology based on Text Stroke Components (TSC). The method leverages Harris corner detection to identify critical points of text strokes, such as endpoints, turning points, and curvatures. By analyzing the clustered regions of these points, the approach effectively localizes text characters. To enhance the detection process, a transparency parameter [Formula: see text] is introduced to control the fusion between original images and corner-detection images. This improves the localization of key stroke points, and reduces background noise interference. The proposed method is evaluated through extensive experiments, demonstrating superior performance compared to existing scene text detectors. Furthermore, the method is jointly trained with the ABINet recognition model across all stages. Comprehensive experiments conducted on 13 datasets reveal that this approach significantly outperforms SOTA methods. These results underscore the advantages of using text stroke components for key-point localization through the corner detection algorithm in scene text detection.