Arbitrary Scene Text Detection with Bezier Proposal

Yuan-Po Chen, Yihong Li
{"title":"Arbitrary Scene Text Detection with Bezier Proposal","authors":"Yuan-Po Chen, Yihong Li","doi":"10.1109/acait53529.2021.9731235","DOIUrl":null,"url":null,"abstract":"Scene text detection is widely studied in natural language processing since 2016, in which arbitrary scene text detection is always the difficulty. At present, to deal with the problem of how to detect arbitrary shape text, the semantic segmentation-based methods are widely used, but the post-processing and label generation operations are complex. Sparse R-CNN is a novel object detection framework with simple process and high accuracy, which can simplify post-processing by bipartite graph matching loss. Therefore, an arbitrary shape text detect method without any post-process based on Bezier proposal with Sparse R-CNN is proposed. Firstly, the feature pyramid network with attention mechanism is used to extract features, and then the processed features go into the Sparse R-CNN detection head to get the score and coordinates, and finally the detection results are visualized according to the score. The results on ICDAR2015 and CTW1500 datasets show that our method can detect arbitrary text effectively, and our method have higher accuracy and higher speed than other methods.","PeriodicalId":173633,"journal":{"name":"2021 5th Asian Conference on Artificial Intelligence Technology (ACAIT)","volume":"68 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 5th Asian Conference on Artificial Intelligence Technology (ACAIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/acait53529.2021.9731235","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Scene text detection is widely studied in natural language processing since 2016, in which arbitrary scene text detection is always the difficulty. At present, to deal with the problem of how to detect arbitrary shape text, the semantic segmentation-based methods are widely used, but the post-processing and label generation operations are complex. Sparse R-CNN is a novel object detection framework with simple process and high accuracy, which can simplify post-processing by bipartite graph matching loss. Therefore, an arbitrary shape text detect method without any post-process based on Bezier proposal with Sparse R-CNN is proposed. Firstly, the feature pyramid network with attention mechanism is used to extract features, and then the processed features go into the Sparse R-CNN detection head to get the score and coordinates, and finally the detection results are visualized according to the score. The results on ICDAR2015 and CTW1500 datasets show that our method can detect arbitrary text effectively, and our method have higher accuracy and higher speed than other methods.
任意场景文本检测与贝塞尔建议
自2016年以来,场景文本检测在自然语言处理领域得到了广泛的研究,其中任意场景文本检测一直是难点。目前,针对任意形状文本的检测问题,广泛采用基于语义分割的方法,但其后处理和标签生成操作较为复杂。稀疏R-CNN是一种新的目标检测框架,过程简单,精度高,可以简化二部图匹配损失的后处理。因此,本文提出了一种基于Bezier建议和稀疏R-CNN的无后处理任意形状文本检测方法。首先利用具有注意机制的特征金字塔网络提取特征,然后将处理后的特征输入到Sparse R-CNN检测头中得到分数和坐标,最后根据分数将检测结果可视化。在ICDAR2015和CTW1500数据集上的实验结果表明,我们的方法可以有效地检测任意文本,并且比其他方法具有更高的准确率和更快的速度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信