An efficient and effective text spotter for characters in natural scene images based on an improved YOLOv5 model

International Conference on Artificial Intelligence, Virtual Reality, and Visualization Pub Date : 2023-03-01 DOI:10.1117/12.2667388

Quanxing Xu, Guanyi Zheng, Wanglong Ren, Xin Li, Zhuo Yang, Zhicheng Huang

引用次数: 0

Abstract

Traditional scene text spotters aim to detect and recognize entire words or sentences in natural scene images, however, the detection and recognition of every single character is also as important as the spotting of unifying words or sentences in one image. There are few specialized methods to spot single character in scene text spotting, and some word-based methods can not recognize a series of characters in images if they can not be spelled as a correct word. In addition, some early models can only detect or recognize texts which are horizontal and distinctive. We realize that it is necessary to improve some existing models for achieving the goal of spotting characters, therefore, we propose a novel method based on an improved YOLOv5 model to accomplish the character-level spotting. It’s worth noting that this method can spots characters not only in regular texts but also in irregular texts (curved texts and oriented texts).

查看原文本刊更多论文

基于改进的YOLOv5模型的自然场景图像中字符的高效文本识别器

传统的场景文本识别技术的目标是对自然场景图像中的整个单词或句子进行检测和识别，但对单个字符的检测和识别与对单个图像中的统一单词或句子的检测和识别同样重要。在场景文本识别中，很少有专门的方法来识别单个字符，一些基于单词的方法如果不能将图像中的一系列字符拼写为正确的单词，则无法识别这些字符。此外，一些早期的模型只能检测或识别水平和独特的文本。我们意识到有必要对现有的一些模型进行改进，以实现字符的识别目标，因此，我们提出了一种基于改进的YOLOv5模型的新方法来实现字符级的识别。值得注意的是，该方法不仅可以识别规则文本中的字符，还可以识别不规则文本(曲面文本和定向文本)中的字符。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Conference on Artificial Intelligence, Virtual Reality, and Visualization

自引率

0.00%

发文量