Fused Confidence for Scene Text Detection via Intersection-over-Union

Guo-lin Zhang, L. Ge, Yu-nuo Yang, Yu-qi Liu, Kexue Sun
{"title":"Fused Confidence for Scene Text Detection via Intersection-over-Union","authors":"Guo-lin Zhang, L. Ge, Yu-nuo Yang, Yu-qi Liu, Kexue Sun","doi":"10.1109/ICCT46805.2019.8947307","DOIUrl":null,"url":null,"abstract":"CNN-based scene text detection methods have achieved superior results. They are mostly implemented on the architecture of full convolution networks and non-maximum suppression (NMS) which combines two tasks of text classification and localization. However, in the NMS procedure, most filter the bounding boxes according to the classification confidence. This makes appropriately those well-located text boxes suppressed during NMS. In this paper, we propose an intersection-over-union (IOU) network to predict the IOU between the bounding box and the matched ground-truth. Then, the predicted IOU as localization confidence will be fused with the classification confidence. Furthermore, in the NMS, the classification confidence is replaced by the fused confidence as the ranking standard to preserve the accurately located text boxes. We experimented on the ICDAR2011 and ICDAR2013 datasets, the results show that the method proposed in this paper can effectively improve the accuracy of text detection.","PeriodicalId":306112,"journal":{"name":"2019 IEEE 19th International Conference on Communication Technology (ICCT)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 19th International Conference on Communication Technology (ICCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCT46805.2019.8947307","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

CNN-based scene text detection methods have achieved superior results. They are mostly implemented on the architecture of full convolution networks and non-maximum suppression (NMS) which combines two tasks of text classification and localization. However, in the NMS procedure, most filter the bounding boxes according to the classification confidence. This makes appropriately those well-located text boxes suppressed during NMS. In this paper, we propose an intersection-over-union (IOU) network to predict the IOU between the bounding box and the matched ground-truth. Then, the predicted IOU as localization confidence will be fused with the classification confidence. Furthermore, in the NMS, the classification confidence is replaced by the fused confidence as the ranking standard to preserve the accurately located text boxes. We experimented on the ICDAR2011 and ICDAR2013 datasets, the results show that the method proposed in this paper can effectively improve the accuracy of text detection.
基于交叉合并的场景文本检测融合置信度
基于cnn的场景文本检测方法取得了优异的效果。它们大多是在全卷积网络和非最大抑制(NMS)的架构上实现的,NMS结合了文本分类和定位两个任务。然而,在NMS过程中,大多数是根据分类置信度对边界框进行过滤。这使得在NMS期间适当地抑制了那些位置良好的文本框。在本文中,我们提出了一个交叉-超并(IOU)网络来预测边界框与匹配的真值之间的IOU。然后,将预测的IOU作为定位置信度与分类置信度融合。此外,在NMS中,将分类置信度替换为融合置信度作为排序标准,以保留准确定位的文本框。在ICDAR2011和ICDAR2013数据集上进行了实验,结果表明本文提出的方法可以有效地提高文本检测的准确率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信