Research on Text Detection Algorithm based on Improved FPN

Huibai Wang, Shaoxian Feng
{"title":"Research on Text Detection Algorithm based on Improved FPN","authors":"Huibai Wang, Shaoxian Feng","doi":"10.1109/IAEAC54830.2022.9929716","DOIUrl":null,"url":null,"abstract":"In recent years, due to the increasingly application of machine vision in various aspects, the topic of text recognition in actual scenes has gradually become a research hot spot of machine vision research. For images with complex backgrounds, the first thing to do is to accurately locate the position of the target text, and then the text content can be efficiently identified. However, as far as the current text detection algorithms based on deep learning are concerned, there are still problems such as incomplete extraction of text feature regions, and wrong detection of images as text regions. Therefore, this paper proposes an improved DB algorithm to solve the problems of the variable shape and complex background of the text area in the label text detection task, so that the algorithm can achieve better detection effect and better performance in complex scenes. The content of the article is mainly from the following main aspects:firstly, the current situation of text detection algorithms is introduced, then the improvement of the DB algorithm with ResNet-50 as the backbone network is proposed, and finally mPA (mean Average Precision) is used as the evaluation of text detection. By comparing the various detection algorithms, it is found that the improved algorithm has significantly improved the detection accuracy and recall rate, and the model speed is also faster.","PeriodicalId":349113,"journal":{"name":"2022 IEEE 6th Advanced Information Technology, Electronic and Automation Control Conference (IAEAC )","volume":"40 2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 6th Advanced Information Technology, Electronic and Automation Control Conference (IAEAC )","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IAEAC54830.2022.9929716","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In recent years, due to the increasingly application of machine vision in various aspects, the topic of text recognition in actual scenes has gradually become a research hot spot of machine vision research. For images with complex backgrounds, the first thing to do is to accurately locate the position of the target text, and then the text content can be efficiently identified. However, as far as the current text detection algorithms based on deep learning are concerned, there are still problems such as incomplete extraction of text feature regions, and wrong detection of images as text regions. Therefore, this paper proposes an improved DB algorithm to solve the problems of the variable shape and complex background of the text area in the label text detection task, so that the algorithm can achieve better detection effect and better performance in complex scenes. The content of the article is mainly from the following main aspects:firstly, the current situation of text detection algorithms is introduced, then the improvement of the DB algorithm with ResNet-50 as the backbone network is proposed, and finally mPA (mean Average Precision) is used as the evaluation of text detection. By comparing the various detection algorithms, it is found that the improved algorithm has significantly improved the detection accuracy and recall rate, and the model speed is also faster.
基于改进FPN的文本检测算法研究
近年来,由于机器视觉在各个方面的应用越来越多,实际场景中的文本识别课题逐渐成为机器视觉研究的一个研究热点。对于背景复杂的图像,首先要做的是准确定位目标文本的位置,然后才能高效地识别文本内容。然而,就目前基于深度学习的文本检测算法而言,仍然存在文本特征区域提取不完整、图像作为文本区域检测错误等问题。因此,本文提出了一种改进的DB算法来解决标签文本检测任务中文本区域形状多变、背景复杂的问题,使算法在复杂场景下能够达到更好的检测效果和性能。本文的内容主要从以下几个主要方面展开:首先介绍了文本检测算法的现状,然后提出了以ResNet-50为骨干网对DB算法的改进,最后采用mPA (mean Average Precision)作为文本检测的评价标准。通过比较各种检测算法,发现改进后的算法显著提高了检测准确率和召回率,模型速度也更快。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信