Scene Text Recognition with Single-Point Decoding Network

Lei Chen, Haibo Qin, Shi-Xue Zhang, Chun Yang, Xucheng Yin
{"title":"Scene Text Recognition with Single-Point Decoding Network","authors":"Lei Chen, Haibo Qin, Shi-Xue Zhang, Chun Yang, Xucheng Yin","doi":"10.48550/arXiv.2209.01914","DOIUrl":null,"url":null,"abstract":", Abstract. In recent years, attention-based scene text recognition methods have been very popular and attracted the interest of many researchers. Attention-based methods can adaptively focus attention on a small area or even single point during decoding, in which the attention matrix is nearly one-hot distribution. Furthermore, the whole feature maps will be weighted and summed by all attention matrices during inference, caus-ing huge redundant computations. In this paper, we propose an efficient attention-free Single-Point Decoding Network (dubbed SPDN) for scene text recognition, which can replace the traditional attention-based decoding network. Specifically, we propose Single-Point Sampling Module (SPSM) to efficiently sample one key point on the feature map for decoding one character. In this way, our method can not only precisely locate the key point of each character but also remove redundant computations. Based on SPSM, we design an efficient and novel single-point decoding network to replace the attention-based decoding network. Extensive experiments on publicly available benchmarks verify that our SPDN can greatly improve decoding efficiency without sacrificing performance.","PeriodicalId":155654,"journal":{"name":"CAAI International Conference on Artificial Intelligence","volume":"55 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"CAAI International Conference on Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2209.01914","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

, Abstract. In recent years, attention-based scene text recognition methods have been very popular and attracted the interest of many researchers. Attention-based methods can adaptively focus attention on a small area or even single point during decoding, in which the attention matrix is nearly one-hot distribution. Furthermore, the whole feature maps will be weighted and summed by all attention matrices during inference, caus-ing huge redundant computations. In this paper, we propose an efficient attention-free Single-Point Decoding Network (dubbed SPDN) for scene text recognition, which can replace the traditional attention-based decoding network. Specifically, we propose Single-Point Sampling Module (SPSM) to efficiently sample one key point on the feature map for decoding one character. In this way, our method can not only precisely locate the key point of each character but also remove redundant computations. Based on SPSM, we design an efficient and novel single-point decoding network to replace the attention-based decoding network. Extensive experiments on publicly available benchmarks verify that our SPDN can greatly improve decoding efficiency without sacrificing performance.
场景文本识别与单点解码网络
、抽象。近年来,基于注意力的场景文本识别方法得到了广泛的应用,并引起了许多研究者的兴趣。基于注意的方法在解码过程中可以自适应地将注意力集中在小区域甚至单点上,注意矩阵几乎是一个热点分布。此外,在推理过程中,整个特征映射将被所有注意矩阵加权和求和,导致大量的冗余计算。本文提出了一种高效的场景文本识别无注意力单点解码网络(SPDN),可以取代传统的基于注意力的解码网络。具体而言,我们提出了单点采样模块(SPSM),以有效地对特征映射上的一个关键点进行采样,以解码一个字符。这样,我们的方法不仅可以精确地定位每个字符的关键点,而且可以消除冗余计算。基于SPSM,我们设计了一种高效新颖的单点解码网络来取代基于注意力的解码网络。在公开可用的基准测试上进行的大量实验验证了我们的SPDN可以在不牺牲性能的情况下大大提高解码效率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信