An efficient and effective text spotter for characters in natural scene images based on an improved YOLOv5 model

Quanxing Xu, Guanyi Zheng, Wanglong Ren, Xin Li, Zhuo Yang, Zhicheng Huang
{"title":"An efficient and effective text spotter for characters in natural scene images based on an improved YOLOv5 model","authors":"Quanxing Xu, Guanyi Zheng, Wanglong Ren, Xin Li, Zhuo Yang, Zhicheng Huang","doi":"10.1117/12.2667388","DOIUrl":null,"url":null,"abstract":"Traditional scene text spotters aim to detect and recognize entire words or sentences in natural scene images, however, the detection and recognition of every single character is also as important as the spotting of unifying words or sentences in one image. There are few specialized methods to spot single character in scene text spotting, and some word-based methods can not recognize a series of characters in images if they can not be spelled as a correct word. In addition, some early models can only detect or recognize texts which are horizontal and distinctive. We realize that it is necessary to improve some existing models for achieving the goal of spotting characters, therefore, we propose a novel method based on an improved YOLOv5 model to accomplish the character-level spotting. It’s worth noting that this method can spots characters not only in regular texts but also in irregular texts (curved texts and oriented texts).","PeriodicalId":137914,"journal":{"name":"International Conference on Artificial Intelligence, Virtual Reality, and Visualization","volume":"183 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Artificial Intelligence, Virtual Reality, and Visualization","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2667388","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Traditional scene text spotters aim to detect and recognize entire words or sentences in natural scene images, however, the detection and recognition of every single character is also as important as the spotting of unifying words or sentences in one image. There are few specialized methods to spot single character in scene text spotting, and some word-based methods can not recognize a series of characters in images if they can not be spelled as a correct word. In addition, some early models can only detect or recognize texts which are horizontal and distinctive. We realize that it is necessary to improve some existing models for achieving the goal of spotting characters, therefore, we propose a novel method based on an improved YOLOv5 model to accomplish the character-level spotting. It’s worth noting that this method can spots characters not only in regular texts but also in irregular texts (curved texts and oriented texts).
基于改进的YOLOv5模型的自然场景图像中字符的高效文本识别器
传统的场景文本识别技术的目标是对自然场景图像中的整个单词或句子进行检测和识别,但对单个字符的检测和识别与对单个图像中的统一单词或句子的检测和识别同样重要。在场景文本识别中,很少有专门的方法来识别单个字符,一些基于单词的方法如果不能将图像中的一系列字符拼写为正确的单词,则无法识别这些字符。此外,一些早期的模型只能检测或识别水平和独特的文本。我们意识到有必要对现有的一些模型进行改进,以实现字符的识别目标,因此,我们提出了一种基于改进的YOLOv5模型的新方法来实现字符级的识别。值得注意的是,该方法不仅可以识别规则文本中的字符,还可以识别不规则文本(曲面文本和定向文本)中的字符。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信